Mark-Specific Hazard Ratio Model with Missing Multivariate Marks

Michal Juraska; Peter B Gilbert

doi:10.1007/s10985-015-9353-9

. Author manuscript; available in PMC: 2017 Oct 1.

Published in final edited form as: Lifetime Data Anal. 2015 Oct 28;22(4):606–625. doi: 10.1007/s10985-015-9353-9

Mark-Specific Hazard Ratio Model with Missing Multivariate Marks

Michal Juraska ¹, Peter B Gilbert ²

PMCID: PMC4848257 NIHMSID: NIHMS734070 PMID: 26511033

Abstract

An objective of randomized placebo-controlled preventive HIV vaccine efficacy trials is to assess the relationship between vaccine effects to prevent HIV acquisition and continuous genetic distances of the exposing HIVs to multiple HIV strains represented in the vaccine. The set of genetic distances, only observed in failures, is collectively termed the ‘mark.’ The objective has motivated a recent study of a multivariate mark-specific hazard ratio model in the competing risks failure time analysis framework. Marks of interest, however, are commonly subject to substantial missingness, largely due to rapid post-acquisition viral evolution. In this article, we investigate the mark-specific hazard ratio model with missing multivariate marks and develop two inferential procedures based on (i) inverse probability weighting (IPW) of the complete cases, and (ii) augmentation of the IPW estimating functions by leveraging auxiliary data predictive of the mark. Asymptotic properties and finite-sample performance of the inferential procedures are presented. This research also provides general inferential methods for semiparametric density ratio/biased sampling models with missing data. We apply the developed procedures to data from the HVTN 502 ‘Step’ HIV vaccine efficacy trial.

Keywords: Augmented inverse probability weighting, Biased sampling model, Competing risks, Cox model, Density ratio model, Missing data, Semiparametric model

1 Introduction

In studies with time-to-event endpoints, treatment efficacy is commonly assessed by comparing treatment-specific hazard rates of failure. In preventive HIV vaccine efficacy trials, HIV-uninfected volunteers are randomly allocated to receive vaccine or placebo, and the primary objective assesses vaccine efficacy (VE), typically defined as one minus the (vaccine/placebo) hazard ratio of HIV acquisition (Halloran et al, 1997). The vaccine trial population is exposed to circulating HIVs of extensive sequence diversity, and, in general, viral vaccines stimulate protective immune responses to HIVs with protein sequences matching those inserted in the vaccine, but fail to stimulate protective immune responses to viruses exhibiting sequence divergence from the vaccine strains. Therefore, a secondary objective assesses VE to prevent acquisition of infection with certain HIV genotypes characterized by a continuous ‘mark’ variable measuring genetic divergence of an infecting HIV to the vaccine strain(s) (a mark, defined by Cox and Oakes (1984), is only observed in failures). The mark-specific vaccine efficacy is defined as one minus the mark-specific vaccine-to-placebo hazard ratio of infection (Gilbert et al, 2008).

A biologically relevant definition of the mark variable is required to reflect the potential of the vaccine, or lack thereof, to elicit protective immune responses to divergent viruses from the vaccine strains; under an adequate mark definition, the vaccine putatively confers a higher level of protection against acquisition of HIVs with smaller distances to the vaccine strains. Multiple immunologically relevant genetic distances have been proposed, based on (i) different conceptual definitions, (ii) different HIV sequence regions, (iii) different techniques to predict HIV amino acid sites that stimulate antibody/T-cell responses, and (iv) different HIV reference sequences (Rolland et al, 2011); thus, multivariate mark variables have been considered in order to assess the dependency of VE on the joint distance distribution. We analyze bivariate genetic distances in the application in Section 7.

Considering settings with mark data observed in all failures, Gilbert et al (2004, 2008) developed estimation and testing procedures for the univariate continuous mark-specific hazard ratio. Sun et al (2009) proposed inferential procedures for the univariate continuous mark-specific proportional hazards model that allows covariate adjustment. Sun et al (2013) extended this approach to accommodate multivariate marks. Juraska and Gilbert (2013) developed estimation and testing procedures for a continuous mark-specific hazard ratio model that allows multivariate marks and improves efficiency of estimation relative to alternative approaches.

However, the aforementioned work for continuous marks did not account for missingness of the multivariate mark in failures, which commonly arises in preventive HIV vaccine trials. The goal is to characterize if and how the vaccine blocks acquisition with certain HIV strains; therefore, it is critical to measure ‘near acquisition’ or ‘early’ HIV sequences, prior to the onset of post-acquisition viral evolution, to well-approximate the transmitting strain(s). Thus, the mark of particular scientific interest is defined based on sequences measured early after acquisition, which is commonly missing in a sizeable fraction of infected subjects. The missingness rate primarily depends on the frequency of HIV testing (typically 6-monthly). Although more frequent testing is planned in future trials, the missingness rate will remain substantial. Viral load may also be associated with missingness as levels below a limit of detection preclude sequencing of the virus. Other reasons for a missing early mark include a missing blood specimen or a technical failure in the HIV sequencing procedure. Section 7 describes an early-mark definition used in a vaccine trial.

A convenient approach in this setting is using the aforementioned ‘complete-case’ analysis methods; however, their correct application requires validity of the missing completely at random (MCAR) assumption. In addition, the complete-case analysis may be inefficient since data on subjects with missing early marks are ignored; the lack of efficiency may be severe if the missingness rate is high. Goetghebeur and Ryan (1995) studied proportional cause-specific hazards models with missing causes of failure (discrete marks). Lu and Tsiatis (2001) developed multiple imputation methods in this setting. Sun and Gilbert (2012) proposed estimation methods in the continuous mark-specific proportional hazards model with missing at random univariate marks; higher mark dimensions were not accommodated due to numerical limitations posed by the employed kernel smoothing procedure.

In this article, we extend the Juraska and Gilbert (2013) estimation and testing methods for continuous mark-specific hazard ratios to accommodate missing at random marks that are univariate or multivariate. We propose two consistent estimation procedures in semiparametric density ratio/biased sampling models, which, in the presence of complete biased sampling data, were studied by Vardi (1985), Qin (1998), Gilbert et al (1999) and Gilbert (2000). The estimation approaches are based on (i) weighting of the complete cases by the inverse of the probabilities of observing the mark of interest (Horvitz and Thompson, 1952), and (ii) augmentation of the inverse probability weighted (IPW) estimating functions by leveraging correlations between the mark and auxiliary data to “impute” the expected profile score vectors for subjects with both complete and incomplete mark data (using the general theory of Robins et al (1994)). Although motivated by the HIV vaccine trial application, this work addresses the general problem of inference in semiparametric density ratio/biased sampling models with missing data.

The remainder of the article is organized as follows. Notation, assumptions, the estimand of interest and the semiparametric model for the mark-specific hazard ratio are introduced in Section 2. We describe two estimation procedures in Section 3, followed by the presentation of the asymptotic properties of the proposed estimators in Section 4, which include double robustness. Hypothesis testing procedures are introduced in Section 5. We summarize our findings from the simulation study in Section 6 and apply the proposed inferential methods to data from the ‘Step’ HIV vaccine trial in Section 7. Theorem proofs are available in Supplement A.

2 Mark-specific hazard ratio model and missing marks

2.1 Notation

Let T and C denote the times to failure and censoring, respectively. The observed right-censored failure time is X = min(T,C) with the failure indicator δ = I(T ≤ C). Let V ∈ ℝ^s denote a continuous multivariate mark variable; without loss of generality, the support of each component of V is taken to be [0, 1]. In the competing risks framework, V is only observable if δ = 1. Juraska and Gilbert (2013) studied the case where V was assumed to be fully observed in all failures. We now allow V to be missing; in the HIV vaccine trial setting, we define V to be a multivariate distance based on a sequence measured in the early phase of HIV infection according to some operational definition. Thus, for infected subjects unobserved in the early phase, the mark V is missing. If δ = 1, we define the indicator R of observing the mark V as follows: let R = 1 if all components of V are observed and let R = 0 otherwise (hence, we consider the ‘all-or-none’ type of missingness because a missing sequence results in each component of the multivariate distance to be missing). Let Z denote the indicator of assignment to the treatment group (in vaccine trials, Z = 1 indicates vaccine and Z = 0 indicates placebo). We denote by A a vector of auxiliary covariates. Our approach only requires that A be observed in infected subjects (δ = 1) because only those observations of A are used by the method for predicting (i) the probability of observing V, and (ii) the expected profile score vector. Marks Ṽ based on ‘late’ sequences, their corresponding measurement times, and/or ‘late’ viral load measurements can be included as a subset of A. Let n be the total sample size and (X_i,δ_i,δ_iR_i, $δ_{i} R_{i} V_{i}^{T}$ , Z_i, $δ_{i} A_{i}^{T}$ ), i = 1, …, n, i.i.d. replicates of (X, δ, δR, δRV ^T,Z, δA^T ). The observed data consist of the observations (X_i, R_i, $R_{i} V_{i}^{T}$ , Z_i, $A_{i}^{T}$ ) for individuals with δ_i = 1 and the observations (X_i,Z_i) for those with δ_i = 0. If δ = 1, denote W = (Z,A^T )^T.

2.2 Assumptions

We assume C is conditionally independent of both T and V given Z, that is, C ⫫ T|Z and C ⫫ V |Z, and assume T ⫫ V |Z. Juraska and Gilbert (2013) demonstrated plausibility of the posed assumptions in the HIV vaccine trial setting, showed their necessity and sufficiency for identifiability of the Euclidean parameters in the time-independent density ratio model introduced in Section 2.4, proposed a Kolmogorov-Smirnov-type test for assessing validity of the assumption T ⫫ V |Z, and demonstrated robustness to levels of correlation between T and V detectable with moderate power (such correlations are of particular interest because, in these scenarios, the violation of T ⫫ V |Z may remain undetected).

We make the following assumptions about the missing mark mechanism:

P (R = 1 ∣ δ = 1, W) = P (R = 1 ∣ V, δ = 1, W)

(1)

and

π (W) : = P (R = 1 ∣ δ = 1, W) \geq ε with probability 1 for some ε > 0.

(2)

Condition (1) assumes that the mark V is missing at random (Rubin, 1976). Condition (2) ensures that an $n^{\frac{1}{2}}$ -consistent estimator for ϕ in (7) exists (Robins et al, 1994). The probability of observing V is largely affected by the frequency of HIV testing. Viral load may also be associated with missingness as noted in the Introduction.

2.3 Estimand of interest

We define the conditional multivariate mark-specific hazard function as

h (t, v ∣ Z = z) = lim_{Δ_{1}, Δ_{21}, \dots, Δ_{2 s} ↘ 0} \frac{P (T \in [t, t + Δ_{1}), V \in \prod_{i = 1}^{s} [v_{i}, v_{i} + Δ_{2 i}) ∣ T \geq t, Z = z)}{Δ_{1} Δ_{21} \dots Δ_{2 s}},

(3)

which naturally extends the cause-specific hazard function in the presence of finitely many causes of failure (Prentice et al, 1978). Gilbert et al (2008) defined the mark-specific vaccine efficacy as

V E (t, v) = 1 - h (t, v ∣ Z = 1) / h (t, v ∣ Z = 0),

(4)

which measures the multiplicative vaccine effect to reduce the instantaneous risk of infection by the exposing strain v at time t.

2.4 Mark-specific hazard ratio model

Following Juraska and Gilbert (2013), the mark-specific hazard function (3) factors as

h (t, v ∣ Z = z) = f (v ∣ T = t, Z = z) h (t ∣ Z = z),

(5)

where f(·|T = t,Z = z) is the conditional density function of V given T = t and Z = z, and h(·|Z = z) is the ordinary marginal hazard function ignoring mark data. Consequently, the mark-specific hazard ratio can be written as

\frac{h (t, v ∣ Z = 1)}{h (t, v ∣ Z = 0)} = \frac{f (v ∣ T = t, Z = 1)}{f (v ∣ T = t, Z = 0)} \frac{h (t ∣ Z = 1)}{h (t ∣ Z = 0)},

(6)

allowing separate estimation for each of the two ratios in (6). For the mark density ratio, following the assumption T ⫫ V|Z, we consider the semiparametric density ratio model (Qin, 1998)

f (v ∣ Z = 1) / f (v ∣ Z = 0) = g (v, ϕ)

(7)

with ϕ ∈ ℝ^d the parameter of interest and g(v,ϕ) a known time-independent weight function, continuously differentiable in ϕ. The nuisance parameter f(v|Z = 0) is treated nonparametrically. The assumption (T,C) ⫫ V|Z yields the equality f(v|Z = z) = f(v|Z = z, δ = 1), and hence the parameter ϕ in (7) is estimable using data in failures only. A popular choice of the weight function g(v,ϕ) is exp{α + g̃(v, β)} where ϕ = (α, β^T)^T and g̃(v, β) is a polynomial function because it yields model (7) equivalent to a retrospective logistic regression model (Prentice and Pyke, 1979).

For the marginal hazard ratio in (6), we posit the Cox model h(t|Z = 1)=h(t|Z = 0) = e^γ, estimating γ by the maximum partial likelihood method or the more efficient method of Lu and Tsiatis (2008) leveraging auxiliary data predictive of the failure time (we implemented the Lu and Tsiatis method in the R speff2trial package). Additional, possibly time-dependent covariates, can be included in the Cox model, allowing, for example, the overall treatment effect to vary over time (further discussed in Juraska and Gilbert (2013)).

3 Estimation procedures in the density ratio model

Due to factorization (6) of the mark-specific hazard ratio, missing marks only impact estimation of the mark density ratio. Following Qin (1998) and denoting f(v) = f(v|Z = 0), in the presence of complete mark data in failures, the semiparametric log likelihood for the parameter of interest ϕ is l(ϕ) = Σ_i_∈_I log f(V_i) + Σ_i∈I₁ log g(V_i,ϕ) with the nuisance parameter f(·) subject to constraints f(V_i) ≥ 0, Σ_i_∈_I f(V_i) = 1, and Σ_i_∈_I f(V_i)g(V_i,ϕ) = 1 where I = {k : δ_k = 1} and I₁ = {k : δ_k = 1 ^ Z_k = 1}. To maximize l(ϕ), the Lagrange multiplier method yields the profile score functions for ϕ and the Lagrange multiplier λ ∈ [0, 1] (that arises by profiling out f(·))

\begin{array}{l} U_{ϕ} (ϕ, λ) & = \sum_{i \in I} (- \frac{λ \dot{g} (V_{i}, ϕ)}{1 + λ (g (V_{i}, ϕ) - 1)} + Z_{i} \frac{\dot{g} (V_{i}, ϕ)}{g (V_{i}, ϕ)}) \\ U_{λ} (ϕ, λ) & = - \sum_{i \in I} \frac{g (V_{i}, ϕ) - 1}{1 + λ (g (V_{i}, ϕ) - 1)} \end{array}

where ġ(v, ϕ) = ∂g(v,ϕ)/∂ϕ. Denote U(ϕ, λ) = (U_ϕ (ϕ, λ)^T, U_λ(ϕ, λ))^T. If the mark is completely observed in failures, the maximum profile likelihood estimator (ϕ̂^T, λ̂)^T for (ϕ^T, λ)^T is defined as the solution to the system U(ϕ, λ) = 0. The goal is to extend the result to allow missing marks.

3.1 Inverse probability weighted complete-case estimator

The idea of the inverse probability weighted (IPW) complete-case estimator, originally proposed by Horvitz and Thompson (1952), is based on weighting of the complete cases by the inverse of the probabilities π(W_i) or their estimates. We suppose a correctly specified parametric model π(W, ψ) for π (W), i.e.,

π (W) = π (W, ψ_{0})

(8)

where ψ₀ is an unknown parameter vector and π(·, ψ) is a known smooth function taking values in [ε, 1] for some ε > 0. Typically we posit a logistic model logit{π (W, ψ)} = ψ^T c(W) where c is a vector function defined on the support of W. The maximum likelihood estimator ψ̂ for ψ is then obtained by solving Σ_i_∈_I S_ψ_,i(ψ) = 0 where S_ψ_,i(ψ) = (R_i − π (W_i, ψ)) δ logit π(W_i, ψ )/δψ.

To estimate the parameter of interest ψ (and λ), we define $U_{i}^{ipw} (ϕ, λ, ψ) = R_{i} U_{i} (ϕ, λ) / π (W_{i}, ψ)$ and $U^{ipw} (ϕ, λ, ψ) = \sum_{i \in I} U_{i}^{ipw} (ϕ, λ, ψ)$ , where U_i(ϕ, λ) is the i-th individual’s contribution to the profile score function U(ϕ,λ). The IPW estimator for (ϕ^T, λ)^T is defined as the solution to the IPW estimating equations U^ipw(ϕ, λ, ψ̂) = 0 and denoted by ${({\hat{ϕ}}_{ipw}^{T}, {\hat{λ}}_{ipw})}^{T}$ .

3.2 Augmented inverse probability weighted complete-case estimator

The IPW estimator fails to make optimal use of the available data and may be inefficient even if the model posited for π(w) is correct. Theorem 1 shows that the estimator ${({\hat{ϕ}}_{ipw}^{T}, {\hat{λ}}_{ipw})}^{T}$ is consistent if the probabilities π (w) are modeled correctly; otherwise it may be biased. To improve efficiency and robustness to mis-specification of the missingness model, Robins et al (1994) proposed adding an augmented term to the IPW estimating function in a general framework. In the density ratio model (7), a more efficient estimator for (ϕ^T, λ)^T can be obtained by adding information about the conditional expectation E[U(ϕ, λ)|δ = 1,W] into the estimation procedure (suppressing subscript i). Assumption (1) implies that E[U(ϕ, λ)|δ = 1,W] = E[U(ϕ, λ)|R = 1, δ = 1,W]. We posit a parametric model q(W,ϕ, λ, ν) for E[U(ϕ, λ)|δ = 1,W], i.e., for (ϕ^T, λ)^T in a compact set $ϕ_{λ}^{*}$ ,

E [U (ϕ, λ) ∣ δ = 1, W] = q (W, ϕ, λ, ν)

(9)

where ν = ν (ϕ, λ) is an unknown parameter vector and q(·,ϕ, λ, ν) is a known smooth function in (ϕ^T, λ)^T. Denote ν̂ = ν̂ (ϕ, λ) a uniformly consistent estimator for ν(ϕ, λ) in $(ϕ, λ) \in ϕ_{λ}^{*}$ . We define

U_{i}^{aug} (ϕ, λ, ψ, ν) = U_{i} (ϕ, λ) \frac{R_{i}}{π (W_{i}, ψ)} + q (W_{i}, ϕ, λ, ν) (1 - \frac{R_{i}}{π (W_{i}, ψ)}), i \in I,

and $U^{aug} (ϕ, λ, ϕ, ν) = \sum_{i \in I} U_{i}^{aug} (ϕ, λ, ψ, ν)$ . The augmented (AUG) IPW estimator for (ϕ^T, λ)^T is defined as the solution to the augmented IPW estimating equations U^aug(ϕ, λ, ϕ^, ν̂) = 0 and denoted by ${({\hat{ϕ}}_{aug}^{T}, {\hat{λ}}_{aug})}^{T}$ .

4 Asymptotic properties

Define x^⊗2 = xx^T for x ∈ ℝ^p. Let $H_{i} (ω) = {(U_{i}^{ipw} {(ϕ, λ, ψ)}^{T}, S_{ψ, i} {(ψ)}^{T})}^{T}$ with ω = (ϕ^T, λ, ψ^T )^T. We assume the regularity conditions expressed as Condition A in Supplement A. In addition, let $m_{0} = \sum_{i = 1}^{n} δ_{i} (1 - Z_{i}), m_{1} = \sum_{i = 1}^{n} δ_{i} Z_{i}$ and m = m₀ + m₁. Denote ξ_mi = m_i/m and assume that ξ_mi → ξ_i > 0, i = 0, 1, as m → ∞.

4.1 Asymptotic properties of the inverse probability weighted complete-case estimator

The following two theorems show that ${({\hat{ϕ}}_{ipw}^{T}, {\hat{λ}}_{ipw})}^{T}$ is a consistent estimator and characterize its asymptotic distribution.

Theorem 1

If (1), (2), (8), and Condition A in Supplement A are true, ${(ϕ_{ipw}^{T}, λ_{ipw})}^{T} \overset{P}{\to} {(ϕ^{T}, λ)}^{T}$ as m → ∞.

Theorem 2

If (1), (2), (8), and Condition A in Supplement A are true, then (i) with probability approaching 1, ${({\hat{ϕ}}_{ipw}^{T}, {\hat{λ}}_{ipw})}^{T}$ exists and is unique; (ii) $m^{\frac{1}{2}} ({({\hat{ϕ}}_{ipw}^{T}, {\hat{λ}}_{ipw})}^{T} - {(ϕ^{T}, λ)}^{T})$ is asymptotically normal with mean 0 and variance J⁻¹D(J⁻¹)^T, which can be consistently estimated by Ĵ ⁻¹ D̂ (Ĵ⁻¹)^T, where

\begin{array}{l} J = E {\partial U_{i}^{ipw} (ϕ, λ, ψ_{0}) / \partial (ϕ^{T}, λ)}, & D = E {resid {(U_{i}^{ipw}, S_{ψ, i})}^{\otimes 2}}, \\ U_{i}^{ipw} = U_{i}^{ipw} (ϕ, λ, ψ_{0}), & S_{ψ, i} = S_{ψ, i} (ψ_{0}), \end{array}

with resid(A,B) = A – E(AB^T )(E(BB^T ))⁻¹B the residual vector from the population least squares regressions of the components of A on B,

\hat{J} = m^{- 1} \sum_{i \in I} \partial U_{i}^{ipw} ({\hat{ϕ}}_{ipw}, {\hat{λ}}_{ipw}, \hat{ψ}) / \partial (ϕ^{T}, λ), \hat{D} = m^{- 1} \sum_{i \in I} Resid {(U_{i}^{ipw}, S_{ψ, i})}^{\otimes 2},

with $Resid (U_{i}^{ipw}, S_{ψ, i})$ the residual vector for subject i from the least squares regressions of the components of $U_{i}^{ipw} ({\hat{ϕ}}_{ipw}, {\hat{λ}}_{ipw}, \hat{ψ})$ on S_ψ,i(ψ̂), i ∈ I.

4.2 Asymptotic properties of the augmented IPW complete-case estimator

In Condition A, we now consider $H_{i} (ω) = {(U_{i}^{aug} {(ϕ, λ, ψ, ν)}^{T}, S_{ψ, i} {(ψ)}^{T})}^{T}$ with ω = (ϕ^T, λ, ψ^T )^T. The next theorem shows that ${({\hat{ϕ}}_{aug}^{T}, {\hat{λ}}_{aug})}^{T}$ remains a consistent estimator if either π(w, ψ) or q(w,ϕ, λ, ν) is correctly specified, the double robustness property.

Theorem 3

Under validity of (1), (2), and Condition A in Supplement A, ${(ϕ_{aug}^{T}, λ_{aug})}^{T} \overset{P}{\to} {(ϕ^{T}, λ)}^{T}$ as m → ∞ if either π(w, ψ) or q(w,ϕ, λ, ν) is correctly specified.

In Theorem 4, we characterize the asymptotic distribution of the AUG estimator ${(ϕ_{aug}^{T}, λ_{aug})}^{T}$ .

Theorem 4

If (1), (2), (8), (9) with ν = ν₀, and Condition A in Supplement A are true, then $m^{\frac{1}{2}} ({({\hat{ϕ}}_{aug}^{T}, {\hat{λ}}_{aug})}^{T} - {(ϕ^{T}, λ)}^{T})$ is asymptotically normal with mean 0 and variance $J_{*}^{- 1} D_{*} {(J_{*}^{- 1})}^{T}$ , which can be consistently estimated by ${\hat{J}}_{*}^{- 1} {\hat{D}}_{*} {({\hat{J}}_{*}^{- 1})}^{T}$ , where

\begin{array}{l} J_{*} = E {\partial U_{i}^{aug} (ϕ, λ, ψ_{0}, ν_{0}) / \partial (ϕ^{T}, λ)}, & D_{*} = E {resid {(U_{i}^{aug}, S_{ψ, i})}^{\otimes 2}}, \\ U_{i}^{aug} = U_{i}^{aug} (ϕ, λ, ψ_{0}, ν_{0}), & S_{ψ, i} = S_{ψ, i} (ψ_{0}), \end{array}

with resid(A,B) = A – E(AB^T )(E(BB^T ))⁻¹B the residual vector from the population least squares regressions of the components of A on B,

{\hat{J}}_{*} = m^{- 1} \sum_{i \in I} \partial U_{i}^{aug} ({\hat{ϕ}}_{aug}, {\hat{λ}}_{aug}, \hat{ψ}, \hat{ν}) / \partial (ϕ^{T}, λ), {\hat{D}}_{*} = m^{- 1} \sum_{i \in I} Resid {(U_{i}^{aug}, S_{ψ, i})}^{\otimes 2},

with $Resid (U_{i}^{aug}, S_{ψ, i})$ the residual vector for subject i from the least squares regressions of the components of $U_{i}^{aug} ({\hat{ϕ}}_{aug}, {\hat{λ}}_{aug}, \hat{ψ}, \hat{ν})$ on S_ψ,i(ψ̂), i ∈ I.

The joint asymptotic distribution of the Euclidean parameter estimators in the density ratio and Cox models is required for asymptotic-based analytic inference about V E(t, v). To this end, analogous IPW and AUG versions of Theorem 1 (and Corollary 1) in Juraska and Gilbert (2013) can be established by replacing the complete-case profile score U_i(ϕ, λ) with $U_{i}^{ipw} (ϕ, λ, \hat{ψ})$ and $U_{i}^{aug} (ϕ, λ, \hat{ψ}, \hat{ν})$ , respectively. The results subsequently lead to the construction of IPW and AUG versions of asymptotic pointwise Wald confidence interval for V E(t, v).

5 Hypothesis testing

Under complete mark data in failures, Juraska and Gilbert (2013) proposed Wald-type procedures for testing whether the mark-specific hazard ratio is (i) unity or (ii) independent of the mark. Problems (i) and (ii) are equivalent to the null hypothesis of zero VE against any exposing virus and VE not varying by viral divergence, respectively. In the presence of missing marks, we consider IPW and AUG versions of the Wald test statistics that are the same as those in Juraska and Gilbert (2013) except they replace the complete-mark estimator ϕ̂ and its variance with ϕ̂_ipw or ϕ̂_aug and the corresponding variance.

6 Simulation study

We conducted a simulation study to investigate finite-sample performance of the proposed estimation and testing procedures in the presence of missing marks. The augmented IPW estimator (AUG) for V E(v) = 1 – e^α⁺^βv⁺^γ, v ∈ [0, 1], i.e., assuming a monotone decline of V E(v) in v, is compared to the complete-case estimator (CC), which ignores information about failures with missing marks, and the IPW complete-case estimator. We additionally compare the aforementioned estimators to the (unobtainable) full data likelihood estimator (Full) that uses the complete set of marks before a fraction of them is deleted. Finally, size and power of the associated Wald tests are examined.

The simulation setup emulates typical placebo-controlled HIV VE trials with a 2–3 fold higher infection rate to aid efficiency of the simulation study. We specify exponential failure times T with rates η and ηe^γ in the placebo and vaccine group, respectively, where η = log(0.7)/(−3) is chosen so that 70% of failure times in the placebo group are administratively censored at year 3. Independently of T, we specify censoring times C to be Uniform(0, 15) in each group. The observed time on study X is defined as min(T, C, 3). The vaccine-to-placebo assignment ratio is 1:1. A univariate complete mark V for placebo and vaccine failures is specified to follow distributions with density functions f(v|Z = 0) = {2e⁻²^v/(1 − e⁻²)} I(0 ≤ v ≤ 1) and f(v|Z = 1) = f(v|Z = 0)e^α⁺^βv, respectively, where, for a given value of β, the value of the parameter α = α(β) is defined as the solution to $\int_{0}^{1} f (v ∣ Z = 0) e^{α + β v} d v = 1$ . This mechanism ensures that the mark V follows model (7) with g(v, α, β) = e^α⁺^βv.

We consider simulation scenarios (S1)–(S4) characterized, respectively, by the following values of (β, γ): (0, 0), (0.5, −0.8), (1.2, −0.2) and (2.1, −1.3). The per-arm sample size N = 1481 corresponds to the expected number of N_pI = 400 observed placebo failures by year 3. In the vaccine group, the expected number of observed failures N_vI additionally depends on the marginal log hazard ratio, and thus, in scenarios (S1)–(S4), N_vI = 400, 188, 334, and 116. The pertaining V E(v) functions in (S1)–(S4) are displayed in Supplementary Figure 1; for ease of orientation, in subsequent tables and figures, scenarios (S1)–(S4) are interchangeably labeled by their corresponding V E(0) and V E(1) values. The results are based on 10⁴ replicated data sets.

6.1 Assessment of the IPW and AUG estimation and testing procedures under correctly specified missing mark models

We generate complete-case indicators R with conditional probabilities π(W) = P(R = 1|δ = 1, W) satisfying the models

\begin{array}{l} (M 1) : logit π (W, ψ) = ψ_{0} + ψ_{1} Z + ψ_{2} A + ψ_{3} Z A, \\ (M 2) : logit π (W, ψ) = ψ_{0} + ψ_{1} Z + ψ_{2} A^{*} + ψ_{3} Z A^{*}, \\ (M 3) : logit π (W, ψ) = ψ_{0} + ψ_{1} Z . \end{array}

We assume a continuous auxiliary variable A representing, e.g., a distance based on a sequence measured in the post-seroconversion phase, that, conditional on V, follows the model

A = {(1 + κ)}^{- 1} (V + κ U), κ > 0,

(10)

where U ~ Uniform(0, 1), independent of V. The parameter κ governs the level of association between V and A. For each of (M1)–(M3), we evaluate AUG estimators corresponding to six scenarios with varying levels of correlation between V and A: AUG-1 for κ = 0.2 corresponding to ρ ≈ 0.98, AUG-2 for κ = 0.4 with ρ ≈ 0.92, AUG-3 for κ = 0.8 with ρ ≈ 0.76, AUG-4 for κ = 1.6 with ρ ≈ 0.5, AUG-5 for κ = 3.6 with ρ ≈ 0.25, and AUG-6 for κ = 10⁵ with ρ ≈ 0 where ρ denotes the correlation coefficient between V and A. For model (M1), we analogously study six IPW estimators, whereas each of models (M2) and (M3) evaluates a single IPW estimator. In model (M2), we consider a dichotomous auxiliary covariate A*, e.g., a near vs. distant post-acquisition virus, or a low vs. high viral load, that is generated in two steps: first, generate Ã following (10) with κ = 0.4, and second, generate A* from Bernoulli(Ã). We include settings with high correlations because of their feasibility in real data sets as between-subject HIV sequence diversity is considerably larger than within-subject HIV sequence diversity (Keele et al, 2008).

In (M1)–(M3), we consider the following values of ψ Z with respective missing mark rates among failures in the placebo and vaccine group:

(M1)
ψ = (−2, 0.4, 0.5, 0.8) resulting in ≈ 87% and 73% of failures with missing marks for κ = 0.4 (minor variations for the other κ values);
(M2)
ψ = (−2, 0.4, 0.5, 0.8) resulting in ≈ 86% and 70% of failures with missing marks;
(M3)
ψ = (−0.8, 0.5) resulting in ≈ 69% and 58% of failures with missing marks.

The AUG estimator additionally requires estimation of the parameter E[U(ϕ, λ)|W]. Rotnitzky and Robins (1995) advocate the use of highly parameterized regression models in this setting. We posit a linear regression model E[U(ϕ, λ)|W] = q(W, ν) of the form

E [U (ϕ, λ) ∣ W] = ν_{0} + ν_{1} Z + ν_{2} A + ν_{3} A^{2} + ν_{4} Z A;

(11)

fitted for subjects with R_i = 1 by the ordinary least squares method. Predicted values Ê [U_i(ϕ, λ)|W_i] used to construct the AUG estimator are specified for subjects with δ_i = 1 as Ê [U_i(ϕ, λ)|W_i] = q(W_i, ν̂). In each simulation scenario, the model (11) is exible but likely to be mis-specified. More recently, van der Laan and Rose (2011) described the targeted learning approach, which (i) utilizes cross-validation for selection of the optimal estimator, and (ii) considers targeted estimation of the data-generating distribution toward the parameter of interest. We conjecture that targeted learning would improve estimation of E[U(ϕ, λ)|W]; however, its computational challenges exceed the scope of this paper.

Figures 1–2 and Supplementary Figures 2–5 present operating characteristics of four estimation procedures – Full, AUG, IPW and CC – under missing mark model (M2); results under (M1) and (M3) are included in Supplement B. Finite-sample bias of the AUG and IPW estimators for V E(v) is small under each of (M1)–(M3) and increases in magnitude as ρ approaches 0. It tends to be smaller for the AUG than the IPW estimator, sometimes markedly so (Figure 1 and Supplementary Figure 12). Bias of the AUG-1 estimator tends to be close to that of the Full estimator representing the unachievable benchmark. Under (M1) and (M2), the CC estimator is substantially biased illustrating the inappropriateness of its use when marks are missing at random (Supplementary Figure 6 and Figure 1). The change in bias along the mark support is similar for each estimation procedure. It is smallest for v in the central region and tends to increase in magnitude in the right tail of the mark distribution.

Fig. 1 — Bias of estimation for *V E*(v) in simulation scenarios (S1)–(S4) using four procedures – Full, IPW, CC and AUG – for the missing mark model (M2). Estimators AUG-1 corresponding to ρ ≈ 0.98, AUG-2 for ρ ≈ 0.92, and AUG-3 for ρ ≈ 0.76 are evaluated here, while estimators AUG-4 for ρ ≈ 0.5, AUG-5 for ρ ≈ 0.25, and AUG-6 for ρ ≈ 0 are evaluated in Supplementary Figure 2.

Fig. 2 — Median asymptotic standard error estimates of $\hat{V E} (v)$ in simulation scenarios (S1)–(S4) using four procedures – Full, IPW, CC and AUG – for the missing mark model (M2). Estimators AUG-1 corresponding to ρ ≈ 0.98, AUG-2 for ρ ≈ 0.92, and AUG-3 for ρ ≈ 0.76 are evaluated here, while estimators AUG-4 for ρ ≈ 0.5, AUG-5 for ρ ≈ 0.25, and AUG-6 for ρ ≈ 0 are evaluated in Supplementary Figure 3.

Median asymptotic standard error estimates of $\hat{V E} (v)$ indicate that substantial efficiency gain of the AUG estimator may be achieved relative to the IPW estimator for ρ > 0.7. The most pronounced efficiency gains are observed under (M2) and (M3) for ρ > 0.7 (Figure 2 and Supplementary Figure 14), with the AUG-1 estimator nearing efficiency of the Full estimator. Under (M1), AUG-k proves to be more efficient than IPW-k for k ≤ 2; for k > 2, the efficiency gain is unobserved. In each setting, the CC estimator is highly inefficient.

Coverage probabilities of 95% pointwise confidence intervals are near the nominal confidence level for scenarios (S1)–(S2) and ρ > 0.7. For scenarios (S3)–(S4) and ρ ≤ 0.5, coverage probabilities uctuate moderately. We verified that, in (S3)–(S4), adequate coverage probabilities were attained for lower missing mark rates if ρ > 0.7. Supplement B additionally summarizes operating characteristics of estimating the mark coefficient β in density ratio model (7).

Table 1 and Supplementary Tables 7–9 report size and power of three Wald tests, each based on four estimation procedures – Full, AUG, IPW and CC: two-sided Wald test and one-sided weighted Wald-type test of $H_{0}^{0} : V E (v) = 0$ for all v ∈ [0, 1] (or, equivalently, ${\tilde{H}}_{0}^{0} : β = 0$ and γ = 0), and two-sided Wald test of H₀ : V E(v) = V E for all v ∈ [0, 1] (or, equivalently, H̃₀ : β = 0). (All three tests are characterized in detail in Juraska and Gilbert (2013) for complete mark data.) Sizes of the Wald tests induced by the AUG and IPW estimators are in accordance with the nominal significance level under each of missing mark models (M1)–(M3). In contrast, for the CC estimator, the size of each Wald test tends to be too large.

Table 1.

Size of Wald tests of { $H_{0}^{0} : V E (v) = 0$ for all v ∈ [0, 1]} and {H₀ : V E(v) = V E for all v ∈ [0, 1]} using four procedures – Full, IPW, CC and AUG – under missing mark models (M1)–(M3). Estimators AUG-1 corresponding to ρ ≈ 0.98, AUG-2 for ρ ≈ 0.92, and AUG-3 for ρ ≈ 0.76 are evaluated here, while estimators AUG-4 for ρ ≈ 0.5, AUG-5 for ρ ≈ 0.25, and AUG-6 for ρ ≈ 0 are evaluated in Supplementary Table 7. For model (M1), the IPW estimator considers the case ρ ≈ 0.98.

Missingness model

Estimator

Full

AUG-1

AUG-2

AUG-3

IPW

Size of Wald test of

H_{0}^{0} (α = 0.05)

(M1)

0.049

0.054

0.055

0.046

0.081

(M2)

0.049

0.050

0.075

(M3)

0.049

0.048

0.055

0.056

0.048

Size of weighted Wald-type test of

H_{0}^{0} (α = 0.025)

(M1)

0.023

0.025

0.028

0.025

0.028

(M2)

0.023

0.026

0.023

0.024

0.029

(M3)

0.023

0.025

0.029

0.024

0.026

Size of Wald test of H₀ (α = 0.05)

(M1)

0.050

0.048

0.054

0.056

0.048

0.102

(M2)

0.050

0.054

0.050

0.054

0.088

(M3)

0.050

0.052

0.054

0.047

0.046

Open in a new tab

6.2 Robustness analysis of the IPW and AUG estimation and testing procedures under mis-specified missing mark models

We investigate robustness of the proposed estimators and hypothesis tests to mis-specification of π(w, ψ) and to violation of the missing at random assumption. To study robustness to mis specification of π(w, ψ), we assume validity of model (M3) while the complete-case indicator R is generated with the conditional probability π(W, ψ) satisfying

(M 4) : logit π (W, ψ) = ψ_{0} + ψ_{1} Z + ψ_{2} X .

We consider ψ = (−0.8, 0.5, −0.5) such that the mark is missing in approximately 82% and 71% of failures in the placebo and vaccine group, respectively.

To examine robustness to violation of the missing at random assumption, we assume validity of model (M3) while the complete-case indicator R depends on V conditionally on W following the model

(M 5) : logit π (W, ψ) = ψ_{0} + ψ_{1} Z + ψ_{2} V .

Here we consider ψ = (−0.1, 0.5, −2) which results in approximately 71% and 63% of failures with missing marks in the placebo and vaccine group, respectively.

The IPW estimator is moderately biased when the missing mark model is mis-specified, while the AUG estimator exhibits smaller bias approaching that of the Full estimator with an increase in ρ (Supplementary Figures 18–19). Violation of the MAR assumption induces severe bias of the IPW estimator (Supplementary Figures 22–23). On the contrary, bias of the AUG estimator is markedly smaller for ρ > 0.7. Under both (M4) and (M5), bias of the CC estimator is severe.

Under (M4), the IPW estimator is visibly less efficient than the AUG estimator for ρ > 0.7, with the AUG-1 estimator retaining efficiency close to that of the Full estimator (Supplementary Figure 20). Under (M5), the AUG estimator appears to be robust to violation of the MAR assumption for the considered scenarios with ρ > 0.7, with the AUG-1 estimator exhibiting minimal efficiency loss compared to the Full estimator (Supplementary Figure 24). In (M4) and (M5), all estimated weights π(W_i, ψ̂) were bounded away from zero. Supplement C additionally reports robustness analysis results in estimation of the density ratio coefficient β.

Size of the Wald tests appears to be robust to model mis-specification (Supplementary Tables 14–15). Under violation of the MAR assumption, size of each Wald test tends to slightly exceed the nominal significance level.

6.3 Assessment of the IPW and AUG estimators versus a simpler complete-case single-imputation estimator when marks are missing completely at random

It is of interest to evaluate whether we lose efficiency by using the missing at random IPW and AUG methods if marks are missing completely at random (MCAR) such that simpler complete-case methods could be used. Relatedly, for applications with auxiliaries A highly predictive of V, it is of interest to compare the IPW and AUG estimators to a simpler complete-case estimator. In particular, we consider an estimator that fills in each missing V based on a parametric regression model linking the mean of V to A and then treats the imputed V’s as known values; this is a regression calibration type estimator that we refer to as the complete-case after imputation (CCIM) estimator.

To compare IPW vs. AUG vs. CC-IM, we add a simulation scenario that assumes validity of the (over-parametrized) model (M1) while the complete-case indicator R satisfies the MCAR model

(M 6) : logit π (W, ψ) = ψ_{0} .

We consider ψ₀ = −0.85 which results in approximately 70% of failures with missing marks in both the placebo and vaccine groups. For each level of correlation between V and A specified in Section 6.1, missing V’s for the CC-IM estimator are filled in with predicted values from a beta regression model (Ferrari and Cribari-Neto, 2004) with the logit link and A as a lone predictor, fitted using complete cases. We compare the Monte-Carlo variance of all estimators as the derivation of the analytic variance of the CC-IM estimator is beyond the scope of this article.

Supplementary Figure 29 indicates that the IPW and AUG estimators do not lose efficiency if marks are MCAR and no auxiliary information about V is available (i.e., ρ ≈ 0). Additionally, as ρ increases, the IPW and AUG estimators gain efficiency over the CC estimator (Supplementary Figure 28). Under MCAR, finite-sample bias of the CC-IM estimator is small for highly correlated V and A and becomes larger for ρ < 0.7, particularly in scenario (S4) (Supplementary Figures 26–27). Overall, the CC-IM estimator has similar performance as the AUG estimator in this simulation study, making it appealing in practice given it is simpler to implement. However, as a regression calibration estimator CC-IM has the theoretical disadvantage of not being a consistent estimator.

7 Application

In the Step Phase IIb HIV vaccine trial, 1836 HIV-negative men were randomized to receive either the Merck adenovirus type 5 (Ad5) vector vaccine (MRKAd5) or placebo (Buchbinder et al, 2008). Of the 1836 subjects, 88 acquired HIV infection prior to unblinding (52 in the vaccine and 36 in the placebo group). Single-genome-amplification HIV sequences were successfully measured in 65 of the 88 infections. It is of particular scientific interest to analyze sequence data collected early upon HIV acquisition, prior to the onset of post-acquisition HIV evolution. We define the early mark as satisfying at least one of the following two conditions: (i) the sample originates from the estimated Fiebig stage I or II with negative ELISA and negative Western Blot (Keele et al, 2008); and (ii) all measured Nef sequences from an infected subject are identical (this lack of detected viral diversification suggests limited time elapsed between acquisition and virus sampling). Of the total 88 infected subjects included in the analysis, 22 (25.0%) have observed early marks (11 vaccine, 11 placebo).

The Step trial showed the estimated overall vaccine-to-placebo hazard ratio (HR) of 1.50 (95% CI, 0.95–2.41, p = 0.06) suggesting that the vaccine elevated the risk of HIV acquisition. Consequently, we sought to study if the elevated risk was greater against some HIV genotypes. To this end, we analyzed distances defined based on four HIV proteins, Gag, Pol, and Nef, each of which was represented in the vaccine, and the complement of the whole HIV genome outside the vaccine (Env-Rev-Tat-Vif-Vpr-Vpu) using the central HXB2 reference sequence. For each genomic region, bivariate distances (V₁, V₂) were computed based on two bioinformatics methods, Epipred (Heckerman et al, 2007) (defining V₁) and NetMHC (Buus et al, 2003) (defining V₂), used to predict, for each infected subject, HIV peptides that could bind to the subject’s HLA molecules, and thus be part of potential T-cell epitopes. These distances, calculated as described in Rolland et al (2011), quantify the amino acid divergence in known and predicted T-cell epitopes present in the reference sequence compared to the subjects’ sequences, with Epipred based on a training set of known T-cell epitopes and putative non-T-cell epitopes, and NetMHC based on predicted binding strength between HLA molecules and HIV peptides.

To predict the probability of observing the early mark, we used a logistic regression model with the number of days from randomization to the first HIV PCR+ test as the lone predictor (this variable is predictive because HIV diagnostic testing was more frequent early in follow-up). In addition, using the AUG estimation method, we specified a linear regression model for the expected profile scores conditional on the number of days from randomization to the first HIV PCR+ test, the vaccine group indicator, and viral load sampled at the date of the first PCR+ test. These three auxiliary covariates were selected from available demographics, host genetics and post-infection biomarker measurements using all-subsets model selection by BIC. Each of these auxiliaries was a weak predictor, which is not surprising given that heuristically none of these auxiliaries is expected to be related to HIV sequence distance. However, due to resource constraints the study did not measure the critical auxiliary information that would be highly predictive (HIV sequence data measured at a later post-acquisition sampling time point), and, given this absent auxiliary, empirical selection of the best predictive model from all input features regardless of their level of biological plausibility was deemed a reasonable approach.

We specified the time from randomization to the first HIV PCR+ test as the failure time variable T in the analysis. Each mark variable was standardized by subtracting its minimum and dividing by its range. Figure 3 shows the distribution of the observed V₁ and V₂ distances by the vaccine/placebo group for each HIV protein. We initially fit density ratio model (7) with the weight function

Fig. 3 — Distribution of the Step early distances to the vaccine or HXB2 reference sequence based on Gag, Pol, Nef, and Env-Rev-Tat-Vif-Vpr-Vpu sites identified using Epipred and NetMHC epitope prediction methods.

g (v, α, β) = exp {α + β_{1} v_{1} + β_{2} v_{2} + β_{12} v_{1} v_{2}}

(12)

and found no significant evidence for an interaction β₁₂ ≠ 0; the Wald tests yielded two-sided interaction p-values of 0.55 for Gag, 0.14 for Pol, 0.18 for Nef, and 0.50 for Env-Rev-Tat-Vif-Vpr-Vpu. Subsequently, model (12) excluding the interaction term was used for reporting inferential results for each HIV protein.

Due to the higher infection rate in the vaccine group, instead of the mark-specific VE, we present the results as the mark-specific hazard ratio HR(v₁, v₂) = exp{α+β₁v₁+β₂v₂+γ} characterizing whether and how the vaccine selectively elevated susceptibility to HIV infection. The estimated HR(v₁, v₂) surfaces for each protein are displayed in Supplementary Figure 30, with point and interval estimates for selected (v₁, v₂) = (0.2, 0.2), (0.5, 0.5) and (0.8, 0.8) reported in Table 2. We observe that the standard error estimates for $\hat{H R} (v_{1}, v_{2})$ are markedly smaller than those obtained in the complete-case proportional hazards model fit by Sun et al (2013). We found some evidence that the vaccine selectively modified the HIV acquisition risk in the Nef reference region (p = 0.01, Table 2), suggesting that vaccine-enhancement of HIV acquisition risk may have been stronger against HIVs with Nef sequences more similar to the vaccine. No such evidence was indicated in the other HIV proteins.

Table 2.

Inference about bivariate mark-specific hazard ratios with marks based on Epipred (V₁) and NetMHC (V₂) epitope prediction methods in 4 HIV genomic regions in the Step trial.

Gene	HR(0.2, 0.2)			HR(0.5, 0.5)			HR(0.8, 0.8)			P-value $H_{0}^{0}$ ^†	P-value H₀^‡
Gene	Est	SE	95% CI	Est	SE	95% CI	Est	SE	95% CI	P-value $H_{0}^{0}$ ^†	P-value H₀^‡
Gag	0.15	0.16	(0.02, 1.24)	0.85	0.44	(0.31, 2.34)	4.94	4.18	(0.94, 25.93)	0.05	0.10
Pol	1.46	0.58	(0.67, 3.19)	1.46	0.61	(0.65, 3.30)	1.46	1.44	(0.21, 10.07)	0.20	0.46
Nef	3.29	2.10	(0.94, 11.52)	2.30	0.89	(1.07, 4.93)	1.61	1.22	(0.36, 7.16)	0.01	0.01
Env^§	2.11	1.58	(0.48, 9.18)	1.47	0.49	(0.77, 2.84)	1.03	0.43	(0.46, 2.34)	0.07	0.15

Open in a new tab

^§

Env-Rev-Tat-Vif-Vpr-Vpu

^†

$H_{0}^{0} : H R (v_{1}, v_{2}) = 1$ for all (v₁, v₂) ∈ [0, 1]²

^‡

H₀: HR(v₁, v₂) = HR for all (v₁, v₂) ∈ [0, 1]²

The assumption T ⫫ (V₁, V₂)|Z was verified to be reasonable by application of the complete-case Kolmogorov–Smirnov-type diagnostic test (Juraska and Gilbert, 2013), which, using 10³ boot-strap iterations, yielded p-values in the 0.15–0.99 range in the vaccine and placebo groups for each HIV protein. The complete-case Qin and Zhang (1997) goodness-of-fit test of the final bivariate-mark density ratio model did not reject (p = 0.46 for Gag, 0.62 for Pol, 0.55 for Nef, and 0.22 for Env-Rev-Tat-Vif-Vpr-Vpu) (an extension of the diagnostic tests for incomplete mark data was beyond the scope of this paper). The proportional marginal hazards assumption was verified using the Grambsch and Therneau (1994) test (p = 0.69).

8 Discussion

Growing complexity of HIV vaccine designs and commonly missing early sequence data have motivated the development of inferential procedures in the multivariate mark-specific hazard ratio model accounting for missing at random marks. Although motivated by this specific scientific application, the article presents general methods of analysis of time-to-event data with continuous, possibly multivariate, marks subject to missingness at random. The presented methods also address the general problem of parameter estimation and hypothesis testing in density ratio/biased sampling models with missing data. Exponential forms of the weight function yield density ratio models that are equivalent to retrospective logistic regression models (Prentice and Pyke, 1979), suggesting an alternative approach to inference based on weighted estimating equations, developed in a general framework by Robins et al (1994) and further discussed in Zhao et al (1996).

We developed an IPW and an augmented IPW (AUG) estimator for V E(v). The AUG estimator is doubly robust (Scharfstein et al, 1999), performs well under correctly specified missing mark models, shows considerable robustness and efficiency under the considered mis-specified missing mark models, and maintains efficiency under MCAR mark models with no auxiliary information about the mark. The AUG estimator is superior to the IPW estimator in each setting and is recommended for use. It is of note that the AUG estimator carries potential to be severely biased if (i) both models in (8) and (9) are mis-specified, or (ii) the estimated weights π(W_i, ψ○) are outliers close to zero for some observations (e.g., Kang and Schafer (2007)). The approach of Tan (2006) partially addresses these shortcomings by using an alternative likelihood-based estimator for π(w). To improve robustness and efficiency of the AUG estimator to (i) and (ii), a modified doubly robust AUG estimator following the approach of Cao et al (2009) could be constructed in this setting.

In future work, it would be of interest to compare the AUG estimators to estimators based on multiple imputation, which would constitute an extension of Lu and Tsiatis (2001) to the case of a continuous mark V. A multiple imputation estimator would potentially improve efficiency given an approximately correctly specified model for predicting V; however, it would lack the double robustness property and hence may have more bias if it is difficult to model V. In future work, it would also be of interest to compare the AUG approach to measurement error approaches, for example based on profile likelihood or the conditional score function, and our initial work on a regression calibration estimator (Section 6.3) showed promising performance. Similar to the multiple imputation approaches, these methods would potentially improve efficiency under sufficiently accurate modeling of V but may have more bias in some contexts due to the lack of double robustness.

To the best of our knowledge, the problem of hypothesis testing for VE(v) [or, equivalently, HR(v)] with missing multivariate v has not been addressed in the existing published literature. For a univariate missing mark, the presented estimation method is an alternative to the Sun and Gilbert (2012) method, which estimates the same VE(v) estimand under the MAR assumption. In this setting, the discussion in Juraska and Gilbert (2013) applies and provides guidance for method selection. The R code implementing the proposed methods is available upon request.

Supplementary Material

10985_2015_9353_MOESM1_ESM

NIHMS734070-supplement-10985_2015_9353_MOESM1_ESM.pdf^{(862KB, pdf)}

Acknowledgments

The authors thank the participants, investigators, and sponsors of the HVTN 502 Step HIV vaccine trial. Research reported in this publication was supported by the National Institute of Allergy and Infectious Diseases of the National Institutes of Health under Award Numbers UM1AI068635 and R37AI054165 and by the Bill and Melinda Gates Foundation (BMGF) Award Number OPP1110049. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH or BMGF.

Footnotes

Ethical statement

Institutional review board approval for the HVTN 502 Step study was obtained at all study sites. The study was undertaken in conformance with applicable local and country requirements, and participants gave written informed consent.

Supporting information

Additional supporting material referenced in this paper is available at the website of Lifetime Data Analysis:

Supplement A: Theorem proofs.

Supplement B: Additional simulation results under correctly specified missing mark models.

Supplement C: Additional robustness analysis results under mis-specified missing mark models.

Supplement D: Additional simulation results when marks are missing completely at random.

Supplement E: Additional Step trial application results.

Contributor Information

Michal Juraska, Email: mjuraska@fredhutch.org, Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Mail Stop M2-C200, Seattle, WA 98109, USA Tel.: +1-206-667-5568.

Peter B. Gilbert, Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, and Department of Biostatistics, University of Washington, 1100 Fairview Avenue North, Mail Stop M2-C200, Seattle, WA 98109, USA.

References

Buchbinder SP, Mehrotra DV, Duerr A, Fitzgerald DW, Mogg R, Li D, Gilbert PB, Lama JR, Marmor M, del Rio C, McElrath MJ, Casimiro DR, Gottesdiener KM, Chodakewitz JA, Corey L, Robertson MN the Step Study Protocol Team. Efficacy assessment of a cell-mediated immunity HIV-1 vaccine (the Step Study): a double-blind, randomised, placebo-controlled, test-of- concept trial. Lancet. 2008;372(9653):1881–1893. doi: 10.1016/S0140-6736(08)61591-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Buus S, Lauemoller SL, Worning P, Kesmir C, Frimurer T, Corbet S, Fomsgaard A, Hilden J, Holm A, Brunak S. Sensitive quantitative predictions of peptide-mhc binding by a ‘query by committee’ artificial neural network approach. Tissue Antigens. 2003;62(5):378–384. doi: 10.1034/j.1399-0039.2003.00112.x. [DOI] [PubMed] [Google Scholar]
Cao W, Tsiatis AA, Davidian M. Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data. Biometrika. 2009;96(3):723–734. doi: 10.1093/biomet/asp033. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cox DR, Oakes D. Chapman and Hall/CRC Monographs on Statistics and Applied Probability Series. Chapman and Hall; 1984. Analysis of Survival Data. [Google Scholar]
Ferrari S, Cribari-Neto F. Beta regression for modelling rates and proportions. Journal of Applied Statistics. 2004;31(7):799–815. [Google Scholar]
Gilbert PB. Large sample theory of maximum likelihood estimates in semiparametric biased sampling models. Annals of Statistics. 2000;28(1):151–194. [Google Scholar]
Gilbert PB, McKeague IW, Sun Y. The 2-sample problem for failure rates depending on a continuous mark: an application to vaccine efficacy. Biostatistics. 2008;9(2):263–276. doi: 10.1093/biostatistics/kxm028. [DOI] [PubMed] [Google Scholar]
Gilbert PB, Lele SR, Vardi Y. Maximum likelihood estimation in semiparametric selection bias models with application to AIDS vaccine trials. Biometrika. 1999;86(1):27–43. [Google Scholar]
Gilbert PB, McKeague IW, Sun Y. Tests for comparing mark-specific hazards and cumulative incidence functions. Lifetime Data Analysis. 2004;10:5–28. doi: 10.1023/b:lida.0000019253.69537.91. [DOI] [PubMed] [Google Scholar]
Goetghebeur E, Ryan L. Analysis of competing risks survival data when some failure types are missing. Biometrika. 1995;82(4):821–833. [Google Scholar]
Grambsch PM, Therneau TM. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika. 1994;81(3):515–526. doi: 10.1093/biomet/81.3.515. [DOI] [Google Scholar]
Halloran ME, Struchiner CJ, Longini IM. Study designs for evaluating different efficacy and effectiveness aspects of vaccines. American Journal of Epidemiology. 1997;146(10):789–803. doi: 10.1093/oxfordjournals.aje.a009196. [DOI] [PubMed] [Google Scholar]
Heckerman D, Kadie C, Listgarten J. Leveraging information across hla alleles/supertypes improves epitope prediction. Journal of Computational Biology. 2007;14:736–746. doi: 10.1089/cmb.2007.R013. [DOI] [PubMed] [Google Scholar]
Horvitz DG, Thompson DJ. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association. 1952;47(260):663–685. [Google Scholar]
Juraska M, Gilbert PB. Mark-specific hazard ratio model with multivariate continuous marks: An application to vaccine efficacy. Biometrics. 2013;69(2):328–337. doi: 10.1111/biom.12016. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kang JDY, Schafer JL. Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science. 2007;22(4):523–539. doi: 10.1214/07-STS227. [DOI] [PMC free article] [PubMed] [Google Scholar]
Keele B, Giorgi E, Salazar-Gonzalez J, Decker J, Pham K, Salazar M, Sun C, Grayson T. Identification and characterization of transmitted and early founder virus envelopes in primary HIV-1 infection. Proceedings of the National Academy of Sciences. 2008;105:7552–7557. doi: 10.1073/pnas.0802203105. [DOI] [PMC free article] [PubMed] [Google Scholar]
van der Laan M, Rose S. Springer Series in Statistics. Springer; 2011. Targeted Learning. [Google Scholar]
Lu K, Tsiatis AA. Multiple imputation methods for estimating regression coefficients in the competing risks model with missing cause of failure. Biometrics. 2001;57:1191–1197. doi: 10.1111/j.0006-341x.2001.01191.x. [DOI] [PubMed] [Google Scholar]
Lu X, Tsiatis AA. Improving the efficiency of the log-rank test using auxiliary covariates. Biometrika. 2008;95(3):679–694. doi: 10.1093/biomet/asn003. [DOI] [Google Scholar]
Prentice RL, Pyke R. Logistic disease incidence models and case-control studies. Biometrika. 1979;66(3):403–411. doi: 10.2307/2335158. [DOI] [Google Scholar]
Prentice RL, Kalbeisch JD, Peterson JAV, Flournoy N, Farewell VT, Breslow NE. The analysis of failure times in the presence of competing risks. Biometrics. 1978;34(4):541–554. [PubMed] [Google Scholar]
Qin J. Inferences for case-control and semiparametric two-sample density ratio models. Biometrika. 1998;85(3):619–630. [Google Scholar]
Qin J, Zhang B. A goodness-of-fit test for logistic regression models based on case-control data. Biometrika. 1997;84(3):609–618. [Google Scholar]
Robins JM, Rotnitzky A, Zhao LP. Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association. 1994;89(427):846–866. [Google Scholar]
Rolland M, Tovanabutra S, deCamp AC, Frahm N, Gilbert PB, Sanders-Buell E, Heath L, Magaret CA, Bose M, Bradfield A, O’Sullivan A, Crossler J, Jones T, Nau M, Wong K, Zhao H, Raugi DN, Sorensen S, Stoddard JN, Maust BS, Deng W, Hural J, Dubey S, Michael NL, Shiver J, Corey L, Li F, Self SG, Kim J, Buchbinder S, Casimiro DR, Robertson MN, Duerr A, McElrath MJ, McCutchan FE, Mullins JI. Genetic impact of vaccination on breakthrough HIV-1 sequences from the STEP trial. Nature Medicine. 2011;17(3):366–U168. doi: 10.1038/nm.2316. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rotnitzky A, Robins JM. Semiparametric regression estimation in the presence of dependent censoring. Biometrika. 1995;82(4):805–820. doi: 10.1093/biomet/82.4.805. [DOI] [Google Scholar]
Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–590. [Google Scholar]
Scharfstein DO, Rotnitzky A, Robins JM. Adjusting for nonignorable drop-out using semi-parametric nonresponse models: Rejoinder. Journal of the American Statistical Association. 1999;94(448):1135–1146. [Google Scholar]
Sun Y, Gilbert PB. Estimation of stratified mark-specific proportional hazards models with missing marks. Scandinavian Journal of Statistics. 2012;39:34–52. doi: 10.1111/j.1467-9469.2011.00746.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sun Y, Gilbert PB, McKeague IW. Proportional hazards models with continuous marks. Ann Stat. 2009;37(1):394–426. doi: 10.1214/07-AOS554. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sun Y, Li M, Gilbert PB. Mark-specific proportional hazards model with multivariate continuous marks and its application to HIV vaccine efficacy trials. Biostatistics. 2013;14(1):60–74. doi: 10.1093/biostatistics/kxs022. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tan Z. A distributional approach for causal inference using propensity scores. Journal of the American Statistical Association. 2006;101(476):1619–1637. [Google Scholar]
Vardi Y. Empirical distributions in selection bias models. The Annals of Statistics. 1985;13(1):178–203. [Google Scholar]
Zhao LP, Lipsitz S, Lew D. Regression analysis with missing covariate data using estimating equations. Biometrics. 1996;52(4):1165–1182. doi: 10.2307/2532833. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

10985_2015_9353_MOESM1_ESM

NIHMS734070-supplement-10985_2015_9353_MOESM1_ESM.pdf^{(862KB, pdf)}

[R1] Buchbinder SP, Mehrotra DV, Duerr A, Fitzgerald DW, Mogg R, Li D, Gilbert PB, Lama JR, Marmor M, del Rio C, McElrath MJ, Casimiro DR, Gottesdiener KM, Chodakewitz JA, Corey L, Robertson MN the Step Study Protocol Team. Efficacy assessment of a cell-mediated immunity HIV-1 vaccine (the Step Study): a double-blind, randomised, placebo-controlled, test-of- concept trial. Lancet. 2008;372(9653):1881–1893. doi: 10.1016/S0140-6736(08)61591-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] Buus S, Lauemoller SL, Worning P, Kesmir C, Frimurer T, Corbet S, Fomsgaard A, Hilden J, Holm A, Brunak S. Sensitive quantitative predictions of peptide-mhc binding by a ‘query by committee’ artificial neural network approach. Tissue Antigens. 2003;62(5):378–384. doi: 10.1034/j.1399-0039.2003.00112.x. [DOI] [PubMed] [Google Scholar]

[R3] Cao W, Tsiatis AA, Davidian M. Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data. Biometrika. 2009;96(3):723–734. doi: 10.1093/biomet/asp033. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Cox DR, Oakes D. Chapman and Hall/CRC Monographs on Statistics and Applied Probability Series. Chapman and Hall; 1984. Analysis of Survival Data. [Google Scholar]

[R5] Ferrari S, Cribari-Neto F. Beta regression for modelling rates and proportions. Journal of Applied Statistics. 2004;31(7):799–815. [Google Scholar]

[R6] Gilbert PB. Large sample theory of maximum likelihood estimates in semiparametric biased sampling models. Annals of Statistics. 2000;28(1):151–194. [Google Scholar]

[R7] Gilbert PB, McKeague IW, Sun Y. The 2-sample problem for failure rates depending on a continuous mark: an application to vaccine efficacy. Biostatistics. 2008;9(2):263–276. doi: 10.1093/biostatistics/kxm028. [DOI] [PubMed] [Google Scholar]

[R8] Gilbert PB, Lele SR, Vardi Y. Maximum likelihood estimation in semiparametric selection bias models with application to AIDS vaccine trials. Biometrika. 1999;86(1):27–43. [Google Scholar]

[R9] Gilbert PB, McKeague IW, Sun Y. Tests for comparing mark-specific hazards and cumulative incidence functions. Lifetime Data Analysis. 2004;10:5–28. doi: 10.1023/b:lida.0000019253.69537.91. [DOI] [PubMed] [Google Scholar]

[R10] Goetghebeur E, Ryan L. Analysis of competing risks survival data when some failure types are missing. Biometrika. 1995;82(4):821–833. [Google Scholar]

[R11] Grambsch PM, Therneau TM. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika. 1994;81(3):515–526. doi: 10.1093/biomet/81.3.515. [DOI] [Google Scholar]

[R12] Halloran ME, Struchiner CJ, Longini IM. Study designs for evaluating different efficacy and effectiveness aspects of vaccines. American Journal of Epidemiology. 1997;146(10):789–803. doi: 10.1093/oxfordjournals.aje.a009196. [DOI] [PubMed] [Google Scholar]

[R13] Heckerman D, Kadie C, Listgarten J. Leveraging information across hla alleles/supertypes improves epitope prediction. Journal of Computational Biology. 2007;14:736–746. doi: 10.1089/cmb.2007.R013. [DOI] [PubMed] [Google Scholar]

[R14] Horvitz DG, Thompson DJ. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association. 1952;47(260):663–685. [Google Scholar]

[R15] Juraska M, Gilbert PB. Mark-specific hazard ratio model with multivariate continuous marks: An application to vaccine efficacy. Biometrics. 2013;69(2):328–337. doi: 10.1111/biom.12016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Kang JDY, Schafer JL. Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science. 2007;22(4):523–539. doi: 10.1214/07-STS227. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Keele B, Giorgi E, Salazar-Gonzalez J, Decker J, Pham K, Salazar M, Sun C, Grayson T. Identification and characterization of transmitted and early founder virus envelopes in primary HIV-1 infection. Proceedings of the National Academy of Sciences. 2008;105:7552–7557. doi: 10.1073/pnas.0802203105. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] van der Laan M, Rose S. Springer Series in Statistics. Springer; 2011. Targeted Learning. [Google Scholar]

[R19] Lu K, Tsiatis AA. Multiple imputation methods for estimating regression coefficients in the competing risks model with missing cause of failure. Biometrics. 2001;57:1191–1197. doi: 10.1111/j.0006-341x.2001.01191.x. [DOI] [PubMed] [Google Scholar]

[R20] Lu X, Tsiatis AA. Improving the efficiency of the log-rank test using auxiliary covariates. Biometrika. 2008;95(3):679–694. doi: 10.1093/biomet/asn003. [DOI] [Google Scholar]

[R21] Prentice RL, Pyke R. Logistic disease incidence models and case-control studies. Biometrika. 1979;66(3):403–411. doi: 10.2307/2335158. [DOI] [Google Scholar]

[R22] Prentice RL, Kalbeisch JD, Peterson JAV, Flournoy N, Farewell VT, Breslow NE. The analysis of failure times in the presence of competing risks. Biometrics. 1978;34(4):541–554. [PubMed] [Google Scholar]

[R23] Qin J. Inferences for case-control and semiparametric two-sample density ratio models. Biometrika. 1998;85(3):619–630. [Google Scholar]

[R24] Qin J, Zhang B. A goodness-of-fit test for logistic regression models based on case-control data. Biometrika. 1997;84(3):609–618. [Google Scholar]

[R25] Robins JM, Rotnitzky A, Zhao LP. Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association. 1994;89(427):846–866. [Google Scholar]

[R26] Rolland M, Tovanabutra S, deCamp AC, Frahm N, Gilbert PB, Sanders-Buell E, Heath L, Magaret CA, Bose M, Bradfield A, O’Sullivan A, Crossler J, Jones T, Nau M, Wong K, Zhao H, Raugi DN, Sorensen S, Stoddard JN, Maust BS, Deng W, Hural J, Dubey S, Michael NL, Shiver J, Corey L, Li F, Self SG, Kim J, Buchbinder S, Casimiro DR, Robertson MN, Duerr A, McElrath MJ, McCutchan FE, Mullins JI. Genetic impact of vaccination on breakthrough HIV-1 sequences from the STEP trial. Nature Medicine. 2011;17(3):366–U168. doi: 10.1038/nm.2316. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] Rotnitzky A, Robins JM. Semiparametric regression estimation in the presence of dependent censoring. Biometrika. 1995;82(4):805–820. doi: 10.1093/biomet/82.4.805. [DOI] [Google Scholar]

[R28] Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–590. [Google Scholar]

[R29] Scharfstein DO, Rotnitzky A, Robins JM. Adjusting for nonignorable drop-out using semi-parametric nonresponse models: Rejoinder. Journal of the American Statistical Association. 1999;94(448):1135–1146. [Google Scholar]

[R30] Sun Y, Gilbert PB. Estimation of stratified mark-specific proportional hazards models with missing marks. Scandinavian Journal of Statistics. 2012;39:34–52. doi: 10.1111/j.1467-9469.2011.00746.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] Sun Y, Gilbert PB, McKeague IW. Proportional hazards models with continuous marks. Ann Stat. 2009;37(1):394–426. doi: 10.1214/07-AOS554. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] Sun Y, Li M, Gilbert PB. Mark-specific proportional hazards model with multivariate continuous marks and its application to HIV vaccine efficacy trials. Biostatistics. 2013;14(1):60–74. doi: 10.1093/biostatistics/kxs022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] Tan Z. A distributional approach for causal inference using propensity scores. Journal of the American Statistical Association. 2006;101(476):1619–1637. [Google Scholar]

[R34] Vardi Y. Empirical distributions in selection bias models. The Annals of Statistics. 1985;13(1):178–203. [Google Scholar]

[R35] Zhao LP, Lipsitz S, Lew D. Regression analysis with missing covariate data using estimating equations. Biometrics. 1996;52(4):1165–1182. doi: 10.2307/2532833. [DOI] [PubMed] [Google Scholar]

PERMALINK

Mark-Specific Hazard Ratio Model with Missing Multivariate Marks

Michal Juraska

Peter B Gilbert

Abstract

1 Introduction

2 Mark-specific hazard ratio model and missing marks

2.1 Notation

2.2 Assumptions

2.3 Estimand of interest

2.4 Mark-specific hazard ratio model

3 Estimation procedures in the density ratio model

3.1 Inverse probability weighted complete-case estimator

3.2 Augmented inverse probability weighted complete-case estimator

4 Asymptotic properties

4.1 Asymptotic properties of the inverse probability weighted complete-case estimator

Theorem 1

Theorem 2

4.2 Asymptotic properties of the augmented IPW complete-case estimator

Theorem 3

Theorem 4

5 Hypothesis testing

6 Simulation study

6.1 Assessment of the IPW and AUG estimation and testing procedures under correctly specified missing mark models

Fig. 1.

Fig. 2.

Table 1.

6.2 Robustness analysis of the IPW and AUG estimation and testing procedures under mis-specified missing mark models

6.3 Assessment of the IPW and AUG estimators versus a simpler complete-case single-imputation estimator when marks are missing completely at random

7 Application

Fig. 3.

Table 2.

8 Discussion

Supplementary Material

Acknowledgments

Footnotes

Contributor Information

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases