A copula model for bivariate hybrid censored survival data with application to the MACS study

Suhong Zhang; Ying Zhang; Kathryn Chaloner; Jack T Stapleton

doi:10.1007/s10985-009-9139-z

. Author manuscript; available in PMC: 2013 Feb 8.

Published in final edited form as: Lifetime Data Anal. 2010 Apr;16(2):231–249. doi: 10.1007/s10985-009-9139-z

A copula model for bivariate hybrid censored survival data with application to the MACS study

Suhong Zhang ¹, Ying Zhang ^2,^✉, Kathryn Chaloner ³, Jack T Stapleton ⁴

PMCID: PMC3567926 NIHMSID: NIHMS430987 PMID: 19921432

Abstract

A copula model for bivariate survival data with hybrid censoring is proposed to study the association between survival time of individuals infected with HIV and persistence time of infection with an additional virus. Survival with HIV is right censored and the persistence time of the additional virus is subject to interval censoring case 1. A pseudo-likelihood method is developed to study the association between the two event times under such hybrid censoring. Asymptotic consistency and normality of the pseudo-likelihood estimator are established based on empirical process theory. Simulation studies indicate good performance of the estimator with moderate sample size. The method is applied to a motivating HIV study which investigates the effect of GB virus type C (GBV-C) co-infection on survival time of HIV infected individuals.

Keywords: Association measure, Bivariate survival model, Copula, Current status data, Kendall's τ, Right censored data, Empirical process

1 Introduction and motivating example

This paper was motivated by the investigation of the association between survival time among HIV-infected subjects and co-infection with an additional apparently harmless virus named GB Virus Type C (or GBV-C). Several recent studies suggest that persistent co-infection of GBV-C is associated with prolonged HIV survival (for example, Xiang et al. 2001; Tillmann et al. 2001; Williams et al. 2004; Zhang et al. 2006), while this beneficial association was not significant in other studies (Toyoda et al. 1998; Birk et al. 2002).

Among all these studies, the Multicenter AIDS Cohort Study (MACS, Williams et al. 2004) is the most comprehensive study to date. It began to recruit subjects at risk for HIV infection from 1984, a time close to the beginning of the AIDS epidemic. For each subject, blood samples were taken and stored every 6 months. When diagnostic testing for HIV subsequently became available, seroconverters were identified through retrospective testing of the stored samples. Later, for a selected subset of seroconverters, two samples of stored blood were tested for GBV-C infection: one sample at 12–18 months after the subject's first positive HIV test (HIV onset), and the second was a sample at 4.5–6 years after seroconversion. The analysis conducted in Williams et al. (2004) treated all HIV survival times as right censored at January 1, 1996 to avoid confounding with the use of highly active HIV therapy that became available in 1996. They found that persistent GBV-C infection was significantly associated with prolonged survival among HIV-positive subjects at the late time (4.5–6 years after HIV onset), but not at the early time (12–18 months after HIV onset).

All previous studies compared the Kaplan-Meier survival curves between HIV-infected subjects with and without GBV-C infection at a specified time using the log-rank test. However, GBV-C viremia may clear over time and GBV-C persistence time varies among subjects. As a consequence, if GBV-C persistence time plays an essential role in its association with HIV survival then time to GBV-C clearance needs to be included in any comparison. This motivated the need to model GBV-C persistence time, rather than the status at a single time.

The use of Cox regression model with GBV-C status treated as a time-dependent covariate is not possible in this MACS data set. The Cox model requires that GBV-C status be known throughout the time during the study (Kalbfleisch and Prentice 2002, p. 200), but GBV-C status in the MACS study is only known at baseline and one another follow-up time.

In this paper we propose a bivariate survival model to adjust for the GBV-C persistence time since co-infection (time from HIV onset to the clearance of GBV-C). The GBV-C diagnostic test at the time close to HIV seroconversion is treated as the baseline GBV-C status, and the test at the second observation time provides current status data on GBV-C persistence time. Current status data, or interval censoring case 1 data, is a special case of interval censoring when it is only feasible to know whether or not an event (clearance) has occurred at a monitoring time (Groeneboom and Wellner 1992).

Bivariate and multivariate survival data have been studied extensively in the statistical literature. Liang et al. (1995) and Oakes (2000) reviewed some recent developments for analysis of multivariate failure time data. Copula based survival models are considered, for example, by Hougaard (1989), Oakes (1989), Shih and Louis (1995) and Wang and Ding (2000). Shih and Louis (1995) examined the association of the bivariate data that are both subject to right censoring, through a two-stage semiparametric estimation procedure. At the first stage in their procedure, the marginal survival functions are estimated consistently by nonparametric maximum likelihood estimators. At the second stage, a dependency structure is imposed by using a copula model, and the nonparametric maximum likelihood estimators of the two marginal survival functions are substituted into the likelihood function to form a pseudo-likelihood, then the association parameter is estimated through a pseudo-likelihood approach. Wang and Ding (2000) proposed a parallel two-stage semiparametric method for the bivariate current status data. In both papers, they showed that the proposed estimators of the association measure converge in distribution to normal random variables with the n^1/2 rate without demonstrating the consistency first which is, however, required in the proof of asymptotic normality.

In this paper, we model the association of bivariate event times using a copula model and estimate the association parameter through the two-stage procedure as well. We focus specifically on the data structure where one of the paired event time data is right-censored and the other is observed as current status data, as observed in the MACS study. Our main goal in this paper is to develop an inference procedure to study the association of bivariate survival data with this type of censoring structure and to apply the proposed method to investigate the association between HIV survival and GBV-C persistence time.

The rest of this paper is organized as follows. Section 2 introduces a theoretical model and describes a two-stage semiparametric estimation procedure for the association parameter. Section 3 states asymptotic properties of the association parameter estimator. Section 4 presents simulation studies. Section 5 applies the proposed estimation method to the MACS GBV-C study. Finally, Section 6 summarizes the method with some remarks. Technical details are provided in the Appendix.

2 Likelihood and estimation method

In what follows the usual cumulative distribution function is defined as F(t) = P(T ≤ t) and the corresponding survival function is defined as S(t) = P(T > t) = 1 – F(t).

Let $T_{1}^{0}$ be the HIV survival time and $T_{2}^{0}$ be the GBV-C persistence time. Assume the distributions of $T_{1}^{0}$ and $T_{2}^{0}$ are continuous. Let S_j and F_j, j = 1, 2 be the survival function and distribution function of $T_{j}^{0}$ , respectively. Denote F(t₁, t₂) and S(t₁, t₂) the joint distribution function and survival function of $(T_{1}^{0}, T_{2}^{0})$ , respectively. We propose to model the joint survival function S(t₁, t₂) by the one-parameter Archimedean copula C_α:

C_{α} : {[0, 1]}^{2} \to [0, 1] that satisfies S_{α} (t_{1}, t_{2}) = C_{α} (S_{1} (t_{1}), S_{2} (t_{2})) .

The joint distribution function Fα(t₁, t₂) can therefore be expressed as F_α(t₁, t₂) = 1 – S₁(t₁) – S₂(t₂) + S_α(t₁, t₂).

Examples of various one-parameter Archimedean copula models are discussed in Nelsen (2006). As Kendall's τ is related to the Copula by τ = 4E {C_α(u, v)} – 1 (Nelsen 2006), the parameter α is naturally linked to the association between the two random variables with the marginal survival functions given by S₁ and S₂, respectively. Therefore, the inference for the association between the two event times can be made through the inference about α.

We consider bivariate survival data with hybrid censoring in which $T_{1}^{0}$ is right censored by a random variable C₁ and $T_{2}^{0}$ is subject to interval censoring case 1 by a random monitoring time C₂. Suppose we have collected a random sample of (T_1i, T_2i, Δ_1i, Δ_2i), i = 1, 2, . . . , n, from a distribution with density function f (t₁, t₂, δ₁, δ₂), where $T_{1 i} = T_{1 i}^{0} \land C_{1 i}$ and $T_{2 i} = C_{2 i}; Δ_{1 i} = 1_{[T_{1 i}^{0} \leq C_{1 i}]}$ and $Δ_{2 i} = 1_{[T_{2 i}^{0} \leq C_{2 i}]}$ . We consider the scenario of independent and non-informative censoring, i.e., $(T_{1}^{0}, T_{2}^{0})$ are jointly independent of (C₁, C₂), and the distribution of (C₁, C₂) is non-informative to any parameters in the joint distribution of $(T_{1}^{0}, T_{2}^{0})$ . We also denote G_i(t) the marginal distribution function of C_i with density function g_i(t), for i = 1, 2.

The density function f (t₁, t₂, δ₁, δ₂) can be explicitly written for four distinct cases with respect to Lebesgue measure. Combining the four cases and discarding the parts that are non-informative to the joint distribution of $T_{1}^{0}$ and $T_{2}^{0}$ , we can derive the likelihood for n independently and identically distributed observations.

Let $C_{1 α} (u, v) = \frac{\partial}{\partial u} C_{α} (u, v)$ . Given the marginal survival functions S₁ and S₂, the likelihood for α, omitting parts that are irrelevant in estimating α, is

L (α, S_{1}, S_{2}; d a t a) = \prod_{i = 1}^{n} {[1 - C_{1 α} (S_{1} (t_{1 i}), S_{2} (t_{2 i}))]}^{δ_{1 i} δ_{2 i}} {[C_{1 α} (S_{1} (t_{1 i}), S_{2} (t_{2 i}))]}^{δ_{1 i} (1 - δ_{2 i})} \times {[S_{1} (t_{1 i}) - C_{α} (S_{1} (t_{1 i}), S_{2} (t_{2 i}))]}^{(1 - δ_{1 i}) δ_{2 i}} {[C_{α} (S_{1} (t_{1 i}), S_{2} (t_{2 i}))]}^{(1 - δ_{1 i}) (1 - δ_{2 i})} .

(1)

A two-stage maximum pseudo-likelihood estimation approach is developed to estimate α. The first stage involves the estimation of marginal survival functions for censored data. The marginal survival function S₁ is estimated by the Kaplan-Meier estimator Ŝ₁ and S₂ is estimated by the nonparametric maximum likelihood estimator Ŝ₂, using the Convex Minorant Algorithm described by Groeneboom and Wellner (1992).

At the second stage, Ŝ₁(t) and Ŝ₂(t) are substituted into the likelihood (1), the resulting pseudo-likelihood is then maximized with respect to α. The maximum pseudo-likelihood estimator ${\hat{α}}_{n}$ is the solution to the pseudo score equation:

U_{α} (α, {\hat{S}}_{1}, {\hat{S}}_{2}; d a t a) = \sum_{i = 1}^{n} \frac{\partial}{\partial α} l (α, {\hat{S}}_{1} (t_{1 i}), {\hat{S}}_{2} (t_{2 i}), δ_{1 i}, δ_{2 i}) = 0,

(2)

where

l (α, {\hat{S}}_{1} (t_{1}), {\hat{S}}_{2} (t_{2}), δ_{1}, δ_{2}) = δ_{1} δ_{2} log (1 - C_{1 α} ({\hat{S}}_{1} (t_{1}), {\hat{S}}_{2} (t_{2}))) + δ_{1} (1 - δ_{2}) log C_{1 α} ({\hat{S}}_{1} (t_{1}), {\hat{S}}_{2} (t_{2})) + (1 - δ_{1}) δ_{2} log ({\hat{S}}_{1} (t_{1}) - C_{α} ({\hat{S}}_{1} (t_{1}), {\hat{S}}_{2} (t_{2}))) + (1 - δ_{1}) (1 - δ_{2}) log C_{α} ({\hat{S}}_{1} (t_{1}), {\hat{S}}_{2} (t_{2})) .

(3)

Note that the pseudo likelihood approach was previously adopted by Shih and Louis (1995) in an association study of bivariate right censored data and by Wang and Ding (2000) in a study of association between two event times with both subject to interval censoring case 1 (Groeneboom and Wellner 1992).

3 Asymptotic properties of the maximum pseudo-likelihood estimator ${\hat{α}}_{n}$

Let T₁ and T₂ take values on [0, t₀₁] × [0, t₀₂], where t₀₁ = sup {t : P(T₁ > t, C₁ > t) > 0} and t₀₂ = sup {t : P(C₂ > t) > 0}. Suppose α is in an open set A in the real line. Denote D a universal constant throughout the rest of technical development.

For the brevity of presentation, we define the following notations:

\begin{matrix} V_{α} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) & = \frac{\partial}{\partial α} l (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) \\ V_{α^{2}} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) & = \frac{\partial^{2}}{\partial α^{2}} l (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) \\ V_{α, 1} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) & = \frac{\partial^{2}}{\partial α \partial u} l (α, u, S_{2} (t_{2}), δ_{1}, δ_{2}) ∣_{u = S_{1} (t_{1})} \\ V_{α, 2} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) & = \frac{\partial^{2}}{\partial α \partial v} l (α, S_{1} (t_{1}), v, δ_{1}, δ_{2}) ∣_{v = S_{2} (t_{2})} \\ V_{α^{2}, 1} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) & = \frac{\partial^{3}}{\partial α^{2} \partial u} l (α, u, S_{2} (t_{2}), δ_{1}, δ_{2}) ∣_{u = S_{1} (t_{1})} \\ V_{α^{2}, 2} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) & = \frac{\partial^{3}}{\partial α^{2} \partial v} l (α, S_{1} (t_{1}), v, δ_{1}, δ_{2}) ∣_{v = S_{2} (t_{2})} \\ V_{α, 1^{2}} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) & = \frac{\partial^{3}}{\partial α \partial u^{2}} l (α, u, S_{2} (t_{2}), δ_{1}, δ_{2}) ∣_{u = S_{1} (t_{1})} \\ V_{α, 1, 2} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) & = \frac{\partial^{3}}{\partial α \partial u \partial v} l (α, u, v, δ_{1}, δ_{2}) ∣_{u = S_{1} (t_{1}), v = S_{2} (t_{2})} \\ V_{α, 2^{2}} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) & = \frac{\partial^{3}}{\partial α \partial v^{2}} l (α, S_{1} (t_{1}), v, δ_{1}, δ_{2}) ∣_{v = S_{2} (t_{2})} \end{matrix}

To study the asymptotic properties of ${\hat{α}}_{n}$ , we need the following regularity conditions. Some of the conditions are related to the smoothness of the copula models and the likelihood.

A1
l(α, S₁(t₁), S₂(t₂), δ₁, δ₂) is three-time differentiable with respect to α on [0, t₀₁] × [0, t₀₂], for each α ∈ A, and all derivatives are continuous and uniformly bounded by some constant D.
A2
V_α,1(α, S₁(t₁), S₂(t₂), δ₁, δ₂), V_α,2(α, S₁(t₁), S₂(t₂), δ₁, δ₂), V_α²,1(α, S₁(t₁), S₂(t₂), δ₁, δ₂), V_α²,2(α, S₁(t₁), S₂(t₂), δ₁, δ₂), V_α,1² (α, S₁(t₁), S₂(t₂), δ₁, δ₂), V_α,1,2(α, S₁(t₁), S₂(t₂), δ₁, δ₂), and V_α,2² (α, S₁(t₁), S₂(t₂), δ₁, δ₂) exist and are uniformly bounded by some constant D on [0, t₀₁] × [0, t₀₂], for all α ∈ A and survival functions S₁ and S₂.
A3
For each α ∈ A, 0 < E_α[V_α (α, S₁(T₁), S₂(T₂), Δ₁, Δ₂)]² < ∞.
A4
F₂ and G₂ are absolutely continuous with respect to each other.
A5
$(ψ_{2} ∕ g_{2}) \circ S_{2}^{- 1}$ is bounded and Lipschitz on [0, 1], where ψ₂ is the derivative of the influence curve IC₂(t₂), defined by
$I C_{2} (t_{2}) = - \int_{0}^{t_{2}} \int_{0}^{t_{01}} V_{α, 2} (α_{0}, S_{1} (τ_{1}), S_{2} (τ_{2}), δ_{1}, δ_{2}) d P (τ_{1}, τ_{2}, δ_{1}, δ_{2}) .$
A6
S₂, g₂ and ψ₂ satisfy
$\int_{0}^{t_{02}} \frac{S_{2} (t_{2}) (1 - S_{2} (t_{2}))}{g_{2} (t_{2})} ψ_{2} (t_{2}) d t_{2} < \infty .$

Remarks

Conditions (A1) and (A2) require the log likelihood to be differentiable with respect to the unknown parameters. These conditions can be easily but tediously verified for Archimedean copulas. Condition (A3) indicates the log likelihood has finite nonzero information about α when the marginal survival functions are known which is usually required in parametric maximum likelihood theory. Conditions (A4)–(A6) are the regularity conditions given by Huang and Wellner (1995) in studying the asymptotic normality of linear functionals of the nonparametric maximum likelihood estimator of S₂ with current status data. These regularity conditions are generally mild for applications.

The following two lemmas are important to study asymptotic properties of ${\hat{α}}_{n}$ .

Lemma 1 Let $F_{j} = {f : f i s a s u r v i v a l f u n c t i o n o n [0, t_{0 j}]}$ , j = 1, 2, and the class $G_{F} = {V_{α, 1} (α, f_{1} (t_{1}), f_{2} (t_{2}), δ_{1}, δ_{2}); f_{j} \in F_{j}, j = 1, 2}$ . Let P denote the probability measure of (T₁, T₂, Δ₁, Δ₂), then under condition (A1)–(A2), $G_{F}$ is a P-Glivenko-Cantelli class, for all α ∈ A.

Lemma 2 Let $F_{j} = {f : f i s a s u r v i v a l f u n c t i o n o n [0, t_{0 j}]}$ , j = 1, 2 and the class $H_{F} = {V_{α} (α, f_{1} (t_{1}), f_{2} (t_{2}), δ_{1}, δ_{2}) - V_{α} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) : f_{j} \in F_{j}, j = 1, 2}$ . Let P denote the probability measure of (T₁, T₂, Δ₁, Δ₂), then under condition (A1)–(A2), $H_{F}$ is a P-Donsker Class, for all α ∈ A.

Based on these two lemmas, the maximum pseudo-likelihood estimator ${\hat{α}}_{n}$ can be shown consistent and asymptotically normally distributed. The results are summarized in the following two theorems.

Theorem 1 Assume that the joint distribution of $(T_{1}^{0}, T_{2}^{0})$ follows an Archimedean copula model with the true association parameter α = α₀. Under the regularity conditions (A1)–(A2), ${\hat{α}}_{n} \overset{p}{\to} α_{0}$ as n → ∞.

Theorem 2 Under the regularity conditions (A1)–(A6), $\sqrt{n} ({\hat{α}}_{n} - α_{0}) \overset{d}{\to} N (0, σ^{2})$ , where

σ^{2} = \frac{V a r (Q (T_{1}, T_{2}, Δ_{1}, Δ_{2}; α_{0}, S_{1}, S_{2}))}{W^{2} (α_{0}, S_{1}, S_{2})}

with

\begin{matrix} W (α_{0}, S_{1}, S_{2}) & = - \int {[V_{α} (α_{0}, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2})]}^{2} d P (t_{1}, t_{2}, δ_{1}, δ_{2}) \\ Q (T_{1}, T_{2}, Δ_{1}, Δ_{2}; α_{0}, S_{1}, S_{2}) & = V_{α} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) + I_{1} (T_{1}, Δ_{1}; α_{0}) - \tilde{l} (T_{2}, Δ_{2}; S_{2}, G_{2}, ψ_{2}), \end{matrix}

in which

I_{1} (T_{1}, Δ_{1}; α_{0}) = \int_{0}^{t_{01}} \int_{0}^{t_{02}} M_{α, 1} (α_{0}, S_{1} (τ_{1}), S_{2} (τ_{2})) f (τ_{1}, τ_{2}) I_{1}^{0} (T_{1}, Δ_{1}) (τ_{1}) d τ_{1} d τ_{2} a n d \tilde{l} (T_{2}, Δ_{2}; S_{2}, G_{2}, ψ_{2}) = - [Δ_{2} - (1 - S_{2} (T_{2}))] \frac{ψ_{2} (T_{2})}{g_{2} (T_{2})} I [g_{2} (T_{2}) > 0],

where

M_{α, 1} (α_{0}, S_{1} (t_{1}), S_{2} (t_{2})) = - E {V_{α, 1} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) ∣ T_{1} = t_{1}, T_{2} = t_{2}}

and

I_{1}^{0} (T_{1}, Δ_{1}) (t_{1}) = - S_{1} (t_{1}) {\int_{0}^{t_{1}} \frac{1}{P (T_{1} \geq u)} d N_{1} (u) - \int_{0}^{t_{1}} \frac{I [T_{1} \geq u]}{P (T_{1} \geq u)} d Λ_{1} (u)} .

Here N₁(u) is defined as I[T₁ ≤ u, Δ₁ = 1] and Λ₁is the cumulative hazard function of $T_{1}^{0}$ .

The proofs of these lemmas and theorems are provided in the Appendix.

4 Simulation studies

Simulation studies are conducted to evaluate the finite sample performance of the proposed method. A Gumbel copula, a special case of Archimedean copulas, defined by

C_{α} (u, v) = e x p {- {[{(- log u)}^{α} + {(- log v)}^{α}]}^{1 ∕ α}}, α \geq 1, 0 \leq u, v \leq 1

is used to generate the bivariate event time data in which the two marginal distributions are both assumed to be exponential with unit rate 1. For the Gumbel copula, a larger α corresponds to a stronger positive association and α = 1 corresponds to the case that the two event times are independent.

A sample of bivariate copula random variables is generated based on their conditional distribution function. Suppose that the joint distribution of the bivariate data $(T_{1}^{0}, T_{2}^{0})$ is C_α(1 – exp(–t₁), 1 – exp(–t₂)). We generate $(T_{1}^{0}, T_{2}^{0})$ through the following steps:

– Generate two independent uniform (0, 1) random variables u, w.
– Set w = P(V ≤ v|U = u) = ∂C_α(u, v)/∂u, solve for v.
– Set $T_{1}^{0} = - log (1 - u)$ , $T_{2}^{0} = - log (1 - v)$ .

Meanwhile, a sample of bivariate censoring times (C₁ and C₂) are each independently drawn from a uniform distribution on [0, 2.3]. In this setting, about 50% of $T_{1}^{0}$ is right censored by C₁, and about 50% of $T_{2}^{0}$ is subject to interval censoring case 1 by C₂ as well.

Kendall's τ is chosen as a global association measure. For the Gumbel copula, τ = 1 – 1/α. Three different values of α are set such that the corresponding Kendall's τ is 0.25, 0.5, and 0.75. For each value of α, we conduct Monte-Carlo simulations with 1,000 replications for sample size n = 50, 100, 200 and 400, respectively.

For each of the 1,000 simulations, Wald confidence interval is constructed based on the asymptotic normality, in which the standard error of ${\hat{α}}_{n}$ is computed using 200 bootstrap resamples. The empirical estimate of the coverage probability is obtained based on the Wald confidence interval over 1,000 replications.

Table 1 summarizes the simulation results for the two-stage pseudo-likelihood estimator. It provides results for estimation bias, Monte-Carlo standard deviation of 1,000 replicates as the empirical standard error (ese), mean of bootstrap standard error (bse), and 95% empirical coverage probability (ecp).

Table 1.

Simulation results of the two-stage maximum pseudo-likelihood estimator based on 1,000 Monte-Carlo samples with sample size ranged from 50 to 400 for α = 4/3, 2, 4

n = 50

n = 100

n = 200

n = 400

{\hat{α}}_{n}

{\hat{τ}}_{n}

{\hat{α}}_{n}

{\hat{τ}}_{n}

{\hat{α}}_{n}

{\hat{τ}}_{n}

{\hat{α}}_{n}

{\hat{τ}}_{n}

τ = 0.25

Bias

0.219

0.043

0.059

0.013

0.023

0.005

–0.005

–0.002

α = 1.333

ese

0.845

0.172

0.233

0.113

0.142

0.076

0.098

0.055

bse

8.799

0.161

0.334

0.109

0.154

0.077

0.099

0.054

95% ecp

0.968

0.966

0.963

0.954

τ = 0.50

Bias

1.120

0.051

0.194

0.021

0.102

0.014

0.032

0.003

α = 2.0

ese

8.090

0.158

0.563

0.098

0.320

0.070

0.208

0.050

bse

26.276

0.156

2.523

0.101

0.359

0.069

0.213

0.048

95% ecp

0.985

0.976

0.966

0.957

τ = 0.75

Bias

9.460

0.176

0.695

0.037

0.189

0.017

0.058

0.004

α = 4.0

ese

51.64

0.117

4.305

0.081

1.002

0.054

0.646

0.038

bse

60.48

0.124

15.726

0.079

1.597

0.054

0.696

0.038

95% ecp

0.991

0.980

0.974

0.959

Open in a new tab

As sample size increases, for a wide range of α, the biases of both ${\hat{α}}_{n}$ and ${\hat{τ}}_{n}$ decreases considerably, so do the Monte-Carlo standard deviation and bootstrap standard error. In addition, when sample size increases, the empirical coverage probability converges to the nominal level and the Monte-Carlo standard deviation and the mean of bootstrap standard error tend to get closer.

With same sample size, the stronger the dependency, the bigger the bias and the standard error for the estimator ${\hat{α}}_{n}$ , as greater variations are usually expected for larger values. Therefore, to preserve high efficiency, a larger sample size is desired to achieve reasonable performance of ${\hat{α}}_{n}$ when a strong association exists. Interestingly, we observe that the standard deviation of ${\hat{τ}}_{n}$ decreases as the association becomes stronger. This may be explained by the standard delta method which implies that $σ_{{\hat{τ}}_{n}} \approx \frac{σ_{{\hat{α}}_{n}}}{α^{2}}$ when sample size is large. The simulations demonstrate that this relationship approximately holds when n ≥ 200. We also note that the average of bootstrap standard error of estimated Kendall's τ is very close to the Monte-Carlo standard deviation when n ≥ 100, particularly if the association is not strong. This may imply the inference about the Kendall's τ will be reasonably good when n ≥ 100.

In addition to compute the proposed two-sage maximum pseudo-likelihood estimator ${\hat{α}}_{n}$ , we also compute ${\tilde{α}}_{n}$ , the maximum likelihood estimator when the two marginal distributions are completely specified. The latter estimator serves as a benchmark to evaluate the performance of the maximum pseudo-likelihood estimator. Table 2 gives the results of ${\tilde{α}}_{n}$ and ${\tilde{τ}}_{n}$ , the maximum likelihood estimators of α and τ, respectively, when the two marginal survival functions are known. The maximum likelihood estimators perform better than the proposed maximum pseudo-likelihood estimators, as expected, but their differences are substantially reduced when sample size increases, say n ≥ 200. The small difference between the two estimators assures us the use of two-stage pseudo-likelihood estimation procedure, for which we gain the advantage of having flexibility by not modeling the marginal distributions but maintain high estimation efficiency with reasonable sample size.

Table 2.

Simulation results of maximum likelihood analysis (S₁ and S₂ are known) based on 1,000 Monte-Carlo samples with sample size ranged from 50 to 400 for α = 4/3, 2, 4

n = 50

n = 100

n = 200

n = 400

{\tilde{α}}_{n}

{\tilde{τ}}_{n}

{\tilde{α}}_{n}

{\tilde{τ}}_{n}

{\tilde{α}}_{n}

{\tilde{τ}}_{n}

{\tilde{α}}_{n}

{\tilde{τ}}_{n}

τ = 0.25

Bias

0.065

0.031

0.019

0.011

0.016

0.004

–0.004

–0.001

α = 1.333

ese

0.334

0.145

0.192

0.102

0.136

0.076

0.097

0.053

bse

1.253

0.136

0.218

0.101

0.136

0.073

0.094

0.053

95% ecp

0.940

0.942

0.954

0.949

τ = 0.50

Bias

0.302

0.036

0.069

0.018

0.022

0.005

–0.009

–0.002

α = 2.0

ese

1.360

0.141

0.455

0.096

0.288

0.068

0.190

0.047

bse

8.808

0.132

0.811

0.093

0.288

0.068

0.196

0.047

95% ecp

0.965

0.951

0.939

0.952

τ = 0.75

Bias

7.136

0.160

0.539

0.030

0.164

0.010

0.014

0.001

α = 4.0

ese

41.24

0.104

3.830

0.073

0.975

0.050

0.635

0.038

bse

44.55

0.089

12.353

0.067

1.590

0.049

0.659

0.036

95% ecp

0.971

0.963

0.960

0.940

Open in a new tab

5 Application to the motivating example

We apply the proposed method to the sub-cohort of MACS from Williams et al. (2004) to study the association of GBV-C persistence time and HIV survival. MACS consists of gay men who were enrolled between 1984 and 1990 and whose blood samples were obtained every 6 months and tested retrospectively when a test for HIV became available. The sub-cohort includes 271 subjects from MACS who were initially HIV negative when they entered the study but HIV positive during the follow ups. Since the visits were scheduled every 6 months, the seroconversion time is known to be within a six-month window. Seroconversion time is imputed as the midpoint between the last seronegative visit and the first seropositive visit. All 271 subjects were evaluated at 12–18 months after HIV seroconversion for the evidence of GBV-C infection and a subgroup of 138 patients were re-examined 4.5–6 years after HIV seroconversion. The study only included data collected before Jan 1, 1996 to avoid the impact of the use of highly active antiretroviral therapy.

Williams et al. (2004) compared the Kaplan-Meier curves for the survival time of the HIV subjects with and without GBV-C co-infection at disease onset and found no significant difference at level 0.05. Here we consider the association between GBV-C persistence time and HIV survival among people who were co-infected with both HIV and GBV-C at HIV onset. HIV survival is defined as the time from seroconversion to death, and GBV-C persistence time is defined as the time from HIV seroconversion to GBV-C clearance for the subjects with GBV-C positive at HIV onset. Previous clinical studies and lab studies suggest that the re-infection of GBV-C is very rare among people who have already infected with HIV. So we assume that the HIV subjects who were co-infected with GBV-C would not be re-infected once they lose it.

In our analysis, we treat the GBV-C status evaluated at 12–18 months as the baseline GBV-C information to select a subsample of HIV patients who are assumed to be co-infected with GBV-C at HIV onset. The GBV-C status evaluated at the second time after HIV seroconversion presents the current status data for GBV-C persistence time. The Gumbel copula is used for the bivariate distribution of HIV survival and GBV-C persistence times. The bootstrap standard error based on 1,000 resamples with replacement was used to estimate the standard error of the estimated association parameter and to construct the Wald confidence interval. There are 61 subjects who were GBV-C positive at the first visit, and GBV-C status at the late visit were known and evaluated before January 1, 1996. In order to use as many data as possible, we define the current status of GBV-C co-infection for the subjects whose late observations on GBV-C were unavailable before January 1, 1996 as follows: (i) for those whose second GBV-C test were negative and evaluated after January 1, 1996, their GBV-C persistence times were right censored at the first visit (n = 2); (ii) for those whose second GBV-C test were positive and evaluated after January 1, 1996, their GBV-C persistence times were right censored at January 1, 1996 (n = 7); and (iii) for those whose second GBV-C test were missing, their GBV-C persistence times were right censored at the first visit (n = 37). Therefore, we have a total of 107 subjects for analysis. Table 3 presents the results when all the subjects who were GBV-C positive at the first visit are included. The maximum pseudo-likelihood estimate of Kendall's τ is ${\hat{τ}}_{n} = 0.3685$ with an 95% confidence interval being [0.1988, 0.5383] using the asymptotic normality or [0.2114, 0.5533] using the bootstrap method. The result indicates that GBV-C persistence time is moderately associated with increased survival among HIV and GBV-C co-infected individuals.

Table 3.

The analysis of association between HIV survival time and GBV-C persistence time: include all subjects whose GBV-C at early visit are positive (N = 107)

Estimate

Bootstrap SE

95% Wald CI

95% Bootstrap CI

{\hat{α}}_{n}

1.5836

0.2037

[1.1843, 1.9829]

[1.2043, 2.0598]

{\hat{τ}}_{n}

0.3685

0.0866

[0.1988, 0.5383]

[0.2114, 0.5533]

Open in a new tab

6 Final remarks

This manuscript proposes a method for assessing the association between two random variables which are subject to different censoring schemes: one is right censored and the other is observed as current status data. The asymptotic properties of the estimator of association parameter, including consistency and asymptotic normality, are established under mild technical assumptions. Although the asymptotic variance of the estimator has a complicated form and is difficult to estimate directly, the ordinary bootstrap method provides a practical and efficient way to estimate the standard error.

Our simulation results suggest that the proposed method works well for moderate sample size and has the advantage of allowing for flexibility in the marginal distributions. Moreover, our numerical study shows that the proposed method is quite efficient compared to the full maximum likelihood approach in which the marginal distributions are given. It suggests that the efficiency loss from the pseudo-likelihood approach is not substantial.

Some copula functions, such as the Gumbel copula, are equivalent to the independent copula only when the association parameter takes its value on the boundary of the parameter space. It may result in failure of some regularity conditions and hence the likelihood theory cannot be easily developed which makes the test of the independence of bivariate event times problematic using the copula models. Several nonparametric tests of dependence have been developed for the bivariate censored data (Oakes 1982; Shih and Louis 1996; Hsu and Prentice 1996; Ding and Wang 2004). A nonparametric test procedure to test the dependence between HIV survival and GBV-C persistence time needs to be developed under the hybrid censoring considered in this paper.

A new study testing additional stored samples from MACS cohort is being planned. There will be considerably more power and precision using these additional time points in the analysis. With the new study of more GBV-C screening, GBV-C persistence time is interval censored, the method presented here will be extended to model the bivariate event data with one margin being subject to right censoring and the other being subject to interval censoring case 2. Other applications with time-varying covariates subject to interval censoring case 2 are readily available.

Acknowledgements

The authors wish to thank the Multicenter AIDS Cohort Study (MACS) for providing data. The MACS has centers located at: The Johns Hopkins Bloomberg School of Public Health (Joseph Margolick); Howard Brown Health Center and Northwestern University Medical School (John Phair); University of California, Los Angeles (Roger Detels); University of Pittsburgh (Charles Rinaldo); and Data Analysis Center (Lisa Jacobson). The authors are also thankful to the editors and two anonymous referees. Their insightful comments and suggestions greatly help improve this manuscript from an early version.

Appendix

This section provides proofs for the lemmas and theorems stated in Sect. 3. We use modern empirical process theory to justify our proofs. We denote ∫ fdP by Pf and $\frac{1}{n} \sum_{i = 1}^{n} f (X_{i})$ by $P_{n} f$ .

Proof of Lemma 1 Since $F_{j}$ consists of uniformly bounded monotone functions on the real line, by the Theorem 2.7.5 of ?, for any ε > 0, for j = 1, 2, there exists a set of brackets:

[f_{j 1}^{L}, f_{j 1}^{U}], [f_{j 2}^{L}, f_{j 2}^{U}], \dots, [f_{j N_{j}}^{L}, f_{j N_{j}}^{U}],

with N_j ≤ exp (D/ε) and ${(\int {∣ f_{j i}^{U} - f_{j i}^{L} ∣}^{r} d P)}^{1 ∕ r} \leq ∊$ for any 1 ≤ i ≤ N_j and r > 0, such that for any $f_{j} \in F_{j}$ and any $t_{j} \in [0, t_{0 j}], f_{j q_{j}}^{L} (t_{j}) \leq f_{j} (t_{j}) \leq f_{j q_{j}}^{U} (t_{j})$ for some 1 ≤ q_j ≤ N_j.

By (A2), V_α,1(α, f₁(t₁), f₂(t₂), δ₁, δ₂) is continuous. We can then construct a set of brackets as follows: for any i = 1, 2, . . . , N₁, s = 1, 2, . . . , N₂ and for any t_j ∈ [0, t_0j], we can find the unique maximum and minimum of V_α,1(α, f₁(t₁), f₂(t₂), δ₁, δ₂) on the product set $[f_{1 i}^{L}, f_{1 i}^{U}] \times [f_{2 s}^{L}, f_{2 s}^{U}]$ . Let

\begin{matrix} (f_{1}^{L, (i, s)} (t_{1}), f_{2}^{L, (i, s)} (t_{2})) & = \underset{\begin{matrix} f_{1} \in [f_{1 i}^{L}, f_{1 i}^{U}] \\ f_{2} \in [f_{2 s}^{L}, f_{2 s}^{U}] \end{matrix}}{argmin} V_{α, 1} (α, f_{1} (t_{1}), f_{2} (t_{2}), δ_{1}, δ_{2}) \\ (f_{2}^{U, (i, s)} (t_{1}), f_{2}^{U, (i, s)} (t_{2})) & = \underset{\begin{matrix} f_{1} \in [f_{1 i}^{L}, f_{1 i}^{U}] \\ f_{2} \in [f_{2 s}^{L}, f_{2 s}^{U}] \end{matrix}}{argmax} V_{α, 1} (α, f_{1} (t_{1}), f_{2} (t_{2}), δ_{1}, δ_{2}) \end{matrix}

and let

\begin{matrix} V_{α, 1}^{L, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2}) & = V_{α, 1} (α, f_{1}^{L, (i, s)} (t_{1}), f_{2}^{L, (i, s)} (t_{2}), δ_{1}, δ_{2}) \\ V_{α, 1}^{U, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2}) & = V_{α, 1} (α, f_{1}^{U, (i, s)} (t_{1}), f_{2}^{U, (i, s)} (t_{2}), δ_{1}, δ_{2}) . \end{matrix}

The class $G_{F}$ is then covered by a set of N₁ × N₂ brackets:

{[V_{α, 1}^{L, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2}), V_{α, 1}^{U, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2})] : i = 1, 2, \dots, N_{1}, s = 1, 2, \dots, N_{2}} .

By (A2), V_α,1²(α, u, v, δ₁, δ₂) and V_α,1,2(α, u, v, δ₁, δ₂) are bounded by some constant D, then V_α,1(α, u, v, δ₁, δ₂) satisfies the Lipschitz condition with respect to u and v. It follows that:

\begin{matrix} \int ∣ V_{α, 1}^{U, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2}) - V_{α, 1}^{L, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2}) ∣ d P & = \int ∣ V_{α, 1} (α, f_{1}^{U, (i, s)} (t_{1}), f_{2}^{U, (i, s)} (t_{2}), δ_{1}, δ_{2}) - V_{α, 1} (α, f_{1}^{L, (i, s)} (t_{1}), f_{2}^{L, (i, s)} (t_{2}), δ_{1}, δ_{2}) ∣ d P \\ \leq \int [D ∣ f_{1}^{U, (i, s)} (t_{1}) - f_{1}^{L, (i, s)} (t_{1}) ∣ + D ∣ f_{2}^{U, (i, s)} (t_{2}) - f_{2}^{L, (i, s)} (t_{2}) ∣] d P \\ \leq D ∊ . \end{matrix}

This indicates that the preceding N₁ × N₂ brackets are D_ε–brackets for $G_{F}$ . It follows that, for any ε > 0, the bracketing number of class $G_{F}$ associated with L₁(P) norm is bounded. By Theorem 2.4.1 of ?, $G_{F}$ is a P-Glivenko-Cantelli class.

Proof of Lemma 2 Based on the similar technique used in the proof of Lemma 1, we can construct a set of N₁ × N₂ brackets:

{[V_{α}^{L, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2}) - V_{α} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}), V_{α}^{U, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2}) - V_{α} (α, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2})] : i = 1, 2, \dots, N_{1}, s = 1, 2, \dots, N_{2}},

which covers $H_{F}$ .

By (A2), V_α,1(α, u, v, δ₁, δ₂) and V_α,2(α, u, v, δ₁, δ₂) are bounded by some constant D, then V_α(α, u, v, δ₁, δ₂) satisfies the Lipschitz condition with respect to u and v. Also note that (x + y)² = x² + y² + 2xy ≤ 2x² + 2y², it follows that

\begin{matrix} \int {(V_{α}^{U, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2}) - V_{α}^{L, (i, s)} (t_{1}, t_{2}, δ_{1}, δ_{2}))}^{2} d P & = \int {∣ V_{α} (α, f_{1}^{U, (i, s)} (t_{1}), f_{2}^{U, (i, s)} (t_{2}), δ_{1}, δ_{2}) - V_{α} (α, f_{1}^{L, (i, s)} (t_{1}), f_{2}^{L, (i, s)} (t_{2}), δ_{1}, δ_{2}) ∣}^{2} d P \\ \leq \int {[D ∣ f_{1}^{U, (i, s)} (t_{1}) - f_{1}^{L, (i, s)} (t_{1}) ∣ + D ∣ f_{2}^{U, (i, s)} (t_{2}) - f_{2}^{L, (i, s)} (t_{2}) ∣]}^{2} d P \\ \leq 2 D^{2} \int {∣ f_{1}^{U, (i, s)} (t_{1}) - f_{1}^{L, (i, s)} (t_{1}) ∣}^{2} d P + 2 D^{2} \int {∣ f_{2}^{U, (i, s)} (t_{2}) - f_{2}^{L, (i, s)} (t_{2}) ∣}^{2} d P \\ \leq D ∊^{2} . \end{matrix}

This indicates that the bracketing number of $H_{F}$ associated with L₂(P) norm, denoted by $N_{[]} (∊, H_{F}, L_{2} (P))$ , is bounded by N₁ × N₂. It follows that $log (N_{[]} (∊, H_{F}, L_{2} (P))) \leq log (N_{1} \times N_{2}) \leq D ∕ ∊$ for some constant D. Hence,

\int_{0}^{1} \sqrt{log N_{[]} (∊, H_{F}, L_{2} (P))} d ∊ \leq \int_{0}^{1} D ∊^{- 1 ∕ 2} d ∊ < \infty .

By Theorem 19.5 of van der Vaart and Wellner (1996, p. 270), $H_{F}$ is a P-Donsker Class.

Proof of Theorem 1 Let

\begin{matrix} {\overset{‒}{L}}_{n} (α, {\hat{S}}_{1}, {\hat{S}}_{2}; X) & = \frac{1}{n} \sum_{i = 1}^{n} l (α, {\hat{S}}_{1} (t_{1 i}), {\hat{S}}_{2} (t_{2 i}), δ_{1 i}, δ_{2 i}) \\ {\overset{‒}{L}}_{n} (α, S_{1}, S_{2}; X) & = \frac{1}{n} \sum_{i = 1}^{n} l (α, S_{1} (t_{1 i}), S_{2} (t_{2 i}), δ_{1 i}, δ_{2 i}), \end{matrix}

where l(α, Ŝ₁(t_1i), Ŝ₂(t_2i) is defined in (3). First we show that ${\overset{‒}{L}}_{n} (α, {\hat{S}}_{1}, {\hat{S}}_{2}; X) \overset{p}{\to} E_{α_{0}} l (α, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2})$ for any α ∈ A.

By Taylor series expansion, we have

{\overset{‒}{L}}_{n} (α, {\hat{S}}_{1}, {\hat{S}}_{2}; X) = {\overset{‒}{L}}_{n} (α, S_{1}, S_{2}; X) + \frac{1}{n} \sum_{i = 1}^{n} ({\hat{S}}_{1} (t_{1 i}) - S_{1} (t_{1 i})) V_{α, 1} (α, {\tilde{S}}_{1} (t_{1 i}), {\hat{S}}_{2} (t_{2 i}), δ_{1 i}, δ_{2 i}) + \frac{1}{n} \sum_{i = 1}^{n} ({\hat{S}}_{2} (t_{2 i}) - S_{2} (t_{2 i})) V_{α, 2} (α, {\hat{S}}_{1} (t_{1 i}), {\tilde{S}}_{2} (t_{2 i}), δ_{1 i}, δ_{2 i})

where ${sup}_{t_{1} \in [0, t_{01}]} ∣ {\tilde{S}}_{1} (t_{1}) - S_{1} (t_{1}) ∣ \leq {sup}_{t_{1} \in [0, t_{01}]} ∣ {\hat{S}}_{1} (t_{1}) - S_{1} (t_{1}) ∣ \overset{p}{\to} 0$ (Fleming and Harrington 1991) and ${sup}_{t_{2} \in [0, t_{02}]} ∣ {\tilde{S}}_{2} (t_{2}) - S_{2} (t_{2}) ∣ \leq {sup}_{t_{2} \in [0, t_{02}]} ∣ {\hat{S}}_{2} (t_{2}) - S_{2} (t_{2}) ∣ \overset{p}{\to} 0$ (Groeneboom and Wellner 1992), respectively.

By the Weak Law of Large Number Theorem,

{\overset{‒}{L}}_{n} (α, S_{1}, S_{2}; X) \overset{p}{\to} E_{α_{0}} l (α, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) .

Note that

∣ \frac{1}{n} \sum_{i = 1}^{n} ({\hat{S}}_{1} (t_{1 i}) - S_{1} (t_{1 i})) V_{α, 1} (α, {\tilde{S}}_{1} (t_{1 i}), {\hat{S}}_{2} (t_{2 i}), δ_{1 i}, δ_{2 i}) ∣ \leq sup_{t_{1} \in [0, t_{01}]} ∣ {\hat{S}}_{1} (t_{1}) - S_{1} (t_{1}) ∣ P_{n} ∣ V_{α, 1} (α, {\tilde{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) ∣

Denote $∣ G_{F} ∣ = {∣ V_{α, 1} (α, f_{1} (t_{1}), f_{2} (t_{2}), δ_{1}, δ_{2}) ∣; f_{j} \in F_{j}, j = 1, 2}$ . Since $G_{F}$ is a P-Glivenko-Cantelli class by Lemma 1, a straightforward algebra yields that the ε-bracketing number of $∣ G_{F} ∣$ is the same as the ε-bracketing number of $G_{F}$ which results in $∣ G_{F} ∣$ being a P-Glivenko-Cantelli class as well. Hence

\begin{matrix} P_{n} ∣ V_{α, 1} (α, {\tilde{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) ∣ & = P ∣ V_{α, 1} (α, {\tilde{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) ∣ + o_{p} (1) \\ = P ∣ V_{α, 1} (α, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) ∣ + o_{p} (1), \end{matrix}

due to the uniform consistency of S̃₁(·) and Ŝ₂(·), the continuous mapping theorem, assumption (A2), and the dominated convergence theorem. This implies that

∣ \frac{1}{n} \sum_{i = 1}^{n} ({\hat{S}}_{1} (t_{1 i}) - S_{1} (t_{1 i})) V_{α, 1} (α, {\tilde{S}}_{1} (t_{1 i}), {\hat{S}}_{2} (t_{2 i}), δ_{1 i}, δ_{2 i}) ∣ \overset{p}{\to} 0 .

Similar argument leads that

∣ \frac{1}{n} \sum_{i = 1}^{n} ({\hat{S}}_{2} (t_{2 i}) - S_{2} (t_{2 i})) V_{α, 2} (α, {\hat{S}}_{1} (t_{1 i}), {\tilde{S}}_{2} (t_{2 i}), δ_{1 i}, δ_{2 i}) ∣ \overset{p}{\to} 0 .

This concludes ${\overset{‒}{L}}_{n} (α, {\hat{S}}_{1}, {\hat{S}}_{2}; X) \overset{p}{\to} E_{α_{0}} l (α, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2})$ . Now, $\forall α \in A$ , using Jensen's inequality, it follows that

{\overset{‒}{L}}_{n} (α, {\hat{S}}_{1}, {\hat{S}}_{2}; X) - {\overset{‒}{L}}_{n} (α_{0}, {\hat{S}}_{1}, {\hat{S}}_{2}; X) \overset{p}{\to} E_{α_{0}} l (α, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) - E_{α_{0}} l (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) = E_{α_{0}} log \frac{h (α, T_{1}, T_{2}, Δ_{1}, Δ_{2})}{h (α_{0}, T_{1}, T_{2}, Δ_{1}, Δ_{2})} < log E_{α_{0}} \frac{h (α, T_{1}, T_{2}, Δ_{1}, Δ_{2})}{h (α_{0}, T_{1}, T_{2}, Δ_{1}, Δ_{2})} = 0 .

Due to the convergence demonstrated above, $\forall ∊$ , δ > 0, for which (α₀ – ε, α₀ + ε) ∈ A, we may find an integer N = N (ε, δ), such that, if n > N, for α = α₀ ± ε,

P ({\overset{‒}{L}}_{n} (α, {\hat{S}}_{1}, {\hat{S}}_{2}; X) < {\overset{‒}{L}}_{n} (α_{0}, {\hat{S}}_{1}, {\hat{S}}_{2}; X)) > 1 - δ .

Thus for n > N,

$P ({\overset{‒}{L}}_{n} (α, {\hat{S}}_{1}, {\hat{S}}_{2}; X) has a local maximum {\hat{α}}_{n} \in (α_{0} - ∊, α_{0} + ∊)) > 1 - 2 δ$ , because of (A1). This immediately shows that the sequence of random variables ${\hat{α}}_{n}$ converge in probability to α₀ as n → ∞.

Proof of Theorem 2 Under (A1), Taylor expansion of the pseudo score function gives

0 = P_{n} V_{α} ({\hat{α}}_{n}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) = P_{n} V_{α} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) + ({\hat{α}}_{n} - α_{0}) P_{n} V_{α^{2}} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) + O_{p} ({∣ {\hat{α}}_{n} - α_{0} ∣}^{2}),

then we get

\sqrt{n} ({\hat{α}}_{n} - α_{0}) = \frac{\sqrt{n} P_{n} V_{α} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2})}{- P_{n} V_{α^{2}} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) - O_{p} (∣ {\hat{α}}_{n} - α_{0} ∣)} .

First, we show that

P_{n} V_{α^{2}} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) \overset{p}{\to} W (α_{0}, S_{1}, S_{2}),

where

\begin{matrix} W (α_{0}, S_{1}, S_{2}) & = P V_{α^{2}} (α_{0}, S_{1} (T_{1}), S_{2} (T_{1}), Δ_{1}, Δ_{2}) \\ = - P {[V_{α} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2})]}^{2} . \end{matrix}

We can rewrite $P_{n} V_{α^{2}} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) = P_{n} V_{α^{2}} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) + R_{n}$ . By the uniform consistency of Ŝ₁ and Ŝ₂ and the fact that V_α² (α₀, S₁(t₁), S₂(t₂), δ₁, δ₂) satisfies the Lipschitz condition due to (A2), it follows that

P_{n} V_{α^{2}} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) = P_{n} V_{α^{2}} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) + o_{p} (1) .

This results in

P_{n} V_{α^{2}} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) \overset{p}{\to} P V_{α^{2}} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2})

by the Weak Law of Large Number Theorem.

Second, we derive the asymptotic distribution of $\sqrt{n} P_{n} V_{α} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2})$ . Note that

\begin{matrix} P_{n} V_{α} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) & = (P_{n} - P) (V_{α} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) - V_{α} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2})) + P_{n} V_{α} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) + P (V_{α} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) - V_{α} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2})) \\ = u_{1 n} + u_{2 n} + u_{3 n} . \end{matrix}

Lemma 2 indicates that under (A1) and (A2), $H_{F}$ is a P-Donsker class. Furthermore, since ${sup}_{0 \leq t_{j} \leq t_{0 j}} ∣ {\hat{S}}_{j} (t_{j}) - S_{j} (t_{j}) ∣ \overset{p}{\to} 0, j = 1, 2$ , by the Dominated Convergence Theorem,

\int {({\hat{S}}_{j} (t_{j}) - S_{j} (t_{j}))}^{2} d P (t_{1}, t_{2}, δ_{1}, δ_{2}) \overset{p}{\to} 0, j = 1, 2 .

Therefore, $\sqrt{n} u_{1 n} = o_{p} (1)$ by Lemma 19.24 of van der Vaart and Wellner (1996).

Note that u_2n is a sum of independent and identically distributed quantities, where each quantity has mean

\int V_{α} (α_{0}, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2}) d P (t_{1}, t_{2}, δ_{1}, δ_{2}) = 0

and variance

\int {[V_{α} (α_{0}, S_{1} (t_{1}), S_{2} (t_{2}), δ_{1}, δ_{2})]}^{2} d P (t_{1}, t_{2}, δ_{1}, δ_{2}) = - W (α_{0}, S_{1}, S_{2}) .

By the Central Limit Theorem, $\sqrt{n} u_{2 n}$ converges in distribution to a normal random variable with mean 0 and variance – W (α₀, S₁, S₂).

Applying Von Mises Expansion (von Mises 1947) on u_3n around S₁, S₂, we get

u_{3 n} \overset{d}{=} \int_{0}^{t_{01}} I C_{1} (t_{1}) d ({\hat{S}}_{1} - S_{1}) (t_{1}) + \int_{0}^{t_{02}} I C_{2} (t_{2}) d ({\hat{S}}_{2} - S_{2}) (t_{2}),

(4)

where IC_j (t), j = 1, 2 are the influence curves of the functional PV_α(α₀, S₁(T₁), S₂(T₂), Δ₁, Δ₂) which are defined by

\begin{matrix} I C_{1} (t_{1}) & = - \int_{0}^{t_{1}} \int_{0}^{t_{02}} V_{α, 1} (α_{0}, S_{1} (τ_{1}), S_{2} (τ_{2}), δ_{1}, δ_{2}) d P (τ_{1}, τ_{2}, δ_{1}, δ_{2}) \\ = \int_{0}^{t_{1}} \int_{0}^{t_{02}} M_{α, 1} (α_{0}, S_{1} (τ_{1}), S_{2} (τ_{2})) f (τ_{1}, τ_{2}) d τ_{1} d τ_{2} \end{matrix}

and

\begin{matrix} I C_{2} (t_{2}) & = - \int_{0}^{t_{2}} \int_{0}^{t_{01}} V_{α, 2} (α_{0}, S_{1} (τ_{1}), S_{2} (τ_{2}), δ_{1}, δ_{2}) d P (τ_{1}, τ_{2}, δ_{1}, δ_{2}) \\ = \int_{0}^{t_{2}} \int_{0}^{t_{01}} M_{α, 2} (α_{0}, S_{1} (τ_{1}), S_{2} (τ_{2})) f (τ_{1}, τ_{2}) d τ_{1} d τ_{2}, \end{matrix}

(5)

respectively. Here

\begin{matrix} M_{α, 1} (α_{0}, S_{1} (τ_{1}), S_{2} (τ_{2})) & = - E {V_{α, 1} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) ∣ T_{1} = τ_{1}, T_{2} = τ_{2}} \\ M_{α, 2} (α_{0}, S_{1} (τ_{1}), S_{2} (τ_{2})) & = - E {V_{α, 2} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) ∣ T_{1} = τ_{1}, T_{2} = τ_{2}} . \end{matrix}

Using the martingale theory for counting process, Pepe (1991) showed that, for t ∈ [0, t₀₁], (Ŝ₁ (t₁) – S₁ (t₁)) is asymptotically equivalent to a sum of n i.i.d. random variables $\sum_{i} I_{1}^{0} (T_{1 i}, Δ_{1 i}) (t_{1}) ∕ n$ . It follows that

\int_{0}^{t_{01}} I C_{1} (t_{1}) d ({\hat{S}}_{1} - S_{1}) (t_{1}) = \frac{1}{n} \sum_{i = 1}^{n} I_{1} (T_{1 i}, Δ_{1 i}; α_{0}) + o_{p} (1),

(6)

where

I_{1} (T_{1 i}, Δ_{1 i}; α_{0}) = \int_{0}^{t_{01}} \int_{0}^{t_{02}} M_{α, 1} (α_{0}, S_{1} (τ_{1}), S_{2} (τ_{2})) f (τ_{1}, τ_{2}) I_{1}^{0} (T_{1 i}, Δ_{1 i}) (τ_{1}) d τ_{1} d τ_{2}

and $I_{1}^{0}$ is a martingale given by

I_{1}^{0} (T_{1}, Δ_{1}) (t_{1}) = - S_{1} (t_{1}) {\int_{0}^{t_{1}} \frac{1}{P (T_{1} \geq u)} d N_{1} (u) - \int_{0}^{t_{1}} \frac{I [T_{1} \geq u]}{P (T_{1} \geq u)} d Λ_{1} (u)},

in which N₁(u) is defined as I[T₁ ≤ u, Δ₁ = 1] and Λ₁ is the cumulative hazard function of $T_{1}^{0}$ .

On the other hand, although (Ŝ₂ – S₂)(t₂) can not be written as sum of i.i.d random quantities, a smooth functional of the nonparametric maximum likelihood estimator Ŝ₂ can still be shown asymptotically normal (Huang and Wellner 1995). Using this property and the regularity conditions (A3)–(A6), Wang and Ding (2000) showed that

\int_{0}^{t_{02}} I C_{2} (t_{2}) d ({\hat{S}}_{2} - S_{2}) (t_{2}) = - P_{n} \tilde{l} (_{2}, Δ_{2}; S_{2}, G_{2}, ψ_{2}) + o_{p} (1)

(7)

with $\tilde{l} (T_{2}, Δ_{2}; S_{2}, G_{2}, ψ_{2}) = - [Δ_{2} - (1 - S_{2} (T_{2}))] \frac{ψ_{2} (T_{2})}{g_{2} (T_{2})} I [g_{2} (T_{2}) > 0]$ and thus $\sqrt{n} \int_{0}^{t_{02}} I C_{2} (t_{2}) d ({\hat{S}}_{2} - S_{2}) (t_{2})$ converges in distribution to a normal random variable with mean 0.

In summary, we obtain that,

\begin{matrix} P_{n} V_{α} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2}) & = P_{n} [V_{α} (α_{0}, S_{1} (T_{1}), S_{2} (T_{2}), Δ_{1}, Δ_{2}) + I_{1} (T_{1}, Δ_{1}; α_{0}) - \tilde{l} (T_{2}, Δ_{2}; S_{2}, G_{2}, ψ_{2})] + o_{p} (n^{- 1 ∕ 2}) \\ = P_{n} Q (T_{1}, T_{2}, Δ_{1}, Δ_{2}; α_{0}, S_{1}, S_{2}) + o_{p} (n^{- 1 ∕ 2}) . \end{matrix}

Therefore, $\sqrt{n} P_{n} V_{α} (α_{0}, {\hat{S}}_{1} (T_{1}), {\hat{S}}_{2} (T_{2}), Δ_{1}, Δ_{2})$ is asymptotically normal with mean zero and variance Var(Q(T₁, T₂, Δ₁, Δ₂; α₀, S₁, S₂)). Hence,

\sqrt{n} ({\hat{α}}_{n} - α_{0}) \overset{d}{\to} N (0, σ^{2}),

where

σ^{2} = \frac{Var (Q (T_{1}, T_{2}, Δ_{1}, Δ_{2}; α_{0}, S_{1}, S_{2}))}{W^{2} (α_{0}, S_{1}, S_{2})} .

Contributor Information

Suhong Zhang, Division of Biostatistics, Edwards Lifesciences, One Edwards Way, Irvine, CA 92612, USA suhong.zhang@edwards.com.

Ying Zhang, Department of Biostatistics, University of Iowa, C22 GH, 200 Hawkins Drive, Iowa City, IA 52242, USA ying-j-zhang@uiowa.edu.

Kathryn Chaloner, Department of Biostatistics, University of Iowa, C22 GH, 200 Hawkins Drive, Iowa City, IA 52242, USA kathryn-chaloner@uiowa.edu.

Jack T. Stapleton, Department of Internal Medicine, University of Iowa and Iowa City VA Medical Center, SW54-15 GH, 200 Hawkins Drive, Iowa City, IA 52242, USA jack-stapleton@uiowa.edu

References

Birk M, Lindback S, Lidman C. No influence of GB virus C replication on the prognosis in a cohort of HIV-1-infected patients. AIDS. 2002;16:2482–2485. doi: 10.1097/00002030-200212060-00017. [DOI] [PubMed] [Google Scholar]
Ding AA, Wang W. Testing independence for bivariate current status data. J Am Stat Assoc. 2004;99:145–155. [Google Scholar]
Fleming TR, Harrington DP. Counting process and survival analysis. John wiley & Sons; New York: 1991. [Google Scholar]
Groeneboom P, Wellner JA. Information bounds and nonparametric maximum likelihood estimation. Birkhauser; Boston: 1992. [Google Scholar]
Hougaard P. Fitting a multivariate failure time distribution. IEEE Trans Reliab. 1989;38:444–448. [Google Scholar]
Hsu L, Prentice RL. A generalisation of the mantel-haenszel test to bivariate failure time data. Biometrika. 1996;4:905–911. [Google Scholar]
Huang J, Wellner JA. Asymptotic normality of the npmle of linear functionals for interval censored data, case 1. Stat Neerl. 1995;49:153–163. [Google Scholar]
Kalbfleisch JD, Prentice RL. The statistical analysis of failure time data. 2nd edn. Wiley-Interscience; New York: 2002. [Google Scholar]
Liang KE, Self SG, Bandeen-Rocche K, Zeger S. Some recent developments for regression analysis of multivariate failure time data. Lifetime Data Anal. 1995;1:403–415. doi: 10.1007/BF00985452. [DOI] [PubMed] [Google Scholar]
Nelsen RB. An introduction to copulas. 2nd edn. Springer-Verlag; New York: 2006. [Google Scholar]
Oakes D. A concordance test for independence in the presence of censoring. Biometrics. 1982;38:451–455. [PubMed] [Google Scholar]
Oakes D. Bivariate survival models induced by frailties. J Am Stat Assoc. 1989;84:487–493. [Google Scholar]
Oakes D. Survival analysis. J Am Stat Assoc. 2000;95:282–285. [Google Scholar]
Pepe MS. Inference for events with dependent risks in multiple endpoint studies. J Am Stat Assoc. 1991;86:770–778. [Google Scholar]
Shih JH, Louis T. Inference on the association parameter in copula models for bivariate survival data. Biometrics. 1995;51:1384–1399. [PubMed] [Google Scholar]
Shih JH, Louis TA. Tests of independence for bivariate survival data. Biometrics. 1996;4:1440–1449. [PubMed] [Google Scholar]
Tillmann H, Heiken H, Knapik-Botor A, Heringlake S, Ockenga J, et al. Infection with GB virus C and reduced mortality among HIV-infected patients. N Engl J Med. 2001;345:715–724. doi: 10.1056/NEJMoa010398. [DOI] [PubMed] [Google Scholar]
Toyoda H, Fukuda Y, et al. Effect of GB virus C/hepatitis G virus coinfection on the course of HIV infection in hemophilia patients in Japan. J Acquir Immune Defic Syndr Hum Retrovirol. 1998;17:209–213. doi: 10.1097/00042560-199803010-00004. [DOI] [PubMed] [Google Scholar]
van der Vaart AW. Asymptotic statistics. Cambridge Univ. Press; Cambridge: 1998. [Google Scholar]
van der Vaart AW, van der Wellner JA. Weak convergence and empirical processes with application to statistics. Springer-Verlag; New York: 1996. [Google Scholar]
von Mises R. On the asymptotic distribution of differentiable statistical functions. Ann Math Statist. 1947;18:309–348. [Google Scholar]
Wang W, Ding AA. On assessing the association for bivariate current status data. Biometrika. 2000;87:879–893. [Google Scholar]
Williams C, Klinzman D, Yamashita T, Xiang J, et al. Persistent GB virus C infection and survival in HIV-infected men. N Engl J Med. 2004;350:981–990. doi: 10.1056/NEJMoa030107. [DOI] [PubMed] [Google Scholar]
Xiang J, Wunschmann W, Diekema D, Klinzman D, Patrick K, et al. Effect of coinfection with GB virus C on survival among patients with HIV infection. N Engl J Med. 2001;345:707–714. doi: 10.1056/NEJMoa003364. [DOI] [PubMed] [Google Scholar]
Zhang W, Chaloner K, Tillmann HS, Williams CF, Stapleton JT. Effect of early and late GBV-C viremia on survival of HIV-infected individuals: a meta-analysis. HIV Med. 2006;7:173–180. doi: 10.1111/j.1468-1293.2006.00366.x. [DOI] [PubMed] [Google Scholar]

[R1] Birk M, Lindback S, Lidman C. No influence of GB virus C replication on the prognosis in a cohort of HIV-1-infected patients. AIDS. 2002;16:2482–2485. doi: 10.1097/00002030-200212060-00017. [DOI] [PubMed] [Google Scholar]

[R2] Ding AA, Wang W. Testing independence for bivariate current status data. J Am Stat Assoc. 2004;99:145–155. [Google Scholar]

[R3] Fleming TR, Harrington DP. Counting process and survival analysis. John wiley & Sons; New York: 1991. [Google Scholar]

[R4] Groeneboom P, Wellner JA. Information bounds and nonparametric maximum likelihood estimation. Birkhauser; Boston: 1992. [Google Scholar]

[R5] Hougaard P. Fitting a multivariate failure time distribution. IEEE Trans Reliab. 1989;38:444–448. [Google Scholar]

[R6] Hsu L, Prentice RL. A generalisation of the mantel-haenszel test to bivariate failure time data. Biometrika. 1996;4:905–911. [Google Scholar]

[R7] Huang J, Wellner JA. Asymptotic normality of the npmle of linear functionals for interval censored data, case 1. Stat Neerl. 1995;49:153–163. [Google Scholar]

[R8] Kalbfleisch JD, Prentice RL. The statistical analysis of failure time data. 2nd edn. Wiley-Interscience; New York: 2002. [Google Scholar]

[R9] Liang KE, Self SG, Bandeen-Rocche K, Zeger S. Some recent developments for regression analysis of multivariate failure time data. Lifetime Data Anal. 1995;1:403–415. doi: 10.1007/BF00985452. [DOI] [PubMed] [Google Scholar]

[R10] Nelsen RB. An introduction to copulas. 2nd edn. Springer-Verlag; New York: 2006. [Google Scholar]

[R11] Oakes D. A concordance test for independence in the presence of censoring. Biometrics. 1982;38:451–455. [PubMed] [Google Scholar]

[R12] Oakes D. Bivariate survival models induced by frailties. J Am Stat Assoc. 1989;84:487–493. [Google Scholar]

[R13] Oakes D. Survival analysis. J Am Stat Assoc. 2000;95:282–285. [Google Scholar]

[R14] Pepe MS. Inference for events with dependent risks in multiple endpoint studies. J Am Stat Assoc. 1991;86:770–778. [Google Scholar]

[R15] Shih JH, Louis T. Inference on the association parameter in copula models for bivariate survival data. Biometrics. 1995;51:1384–1399. [PubMed] [Google Scholar]

[R16] Shih JH, Louis TA. Tests of independence for bivariate survival data. Biometrics. 1996;4:1440–1449. [PubMed] [Google Scholar]

[R17] Tillmann H, Heiken H, Knapik-Botor A, Heringlake S, Ockenga J, et al. Infection with GB virus C and reduced mortality among HIV-infected patients. N Engl J Med. 2001;345:715–724. doi: 10.1056/NEJMoa010398. [DOI] [PubMed] [Google Scholar]

[R18] Toyoda H, Fukuda Y, et al. Effect of GB virus C/hepatitis G virus coinfection on the course of HIV infection in hemophilia patients in Japan. J Acquir Immune Defic Syndr Hum Retrovirol. 1998;17:209–213. doi: 10.1097/00042560-199803010-00004. [DOI] [PubMed] [Google Scholar]

[R19] van der Vaart AW. Asymptotic statistics. Cambridge Univ. Press; Cambridge: 1998. [Google Scholar]

[R20] van der Vaart AW, van der Wellner JA. Weak convergence and empirical processes with application to statistics. Springer-Verlag; New York: 1996. [Google Scholar]

[R21] von Mises R. On the asymptotic distribution of differentiable statistical functions. Ann Math Statist. 1947;18:309–348. [Google Scholar]

[R22] Wang W, Ding AA. On assessing the association for bivariate current status data. Biometrika. 2000;87:879–893. [Google Scholar]

[R23] Williams C, Klinzman D, Yamashita T, Xiang J, et al. Persistent GB virus C infection and survival in HIV-infected men. N Engl J Med. 2004;350:981–990. doi: 10.1056/NEJMoa030107. [DOI] [PubMed] [Google Scholar]

[R24] Xiang J, Wunschmann W, Diekema D, Klinzman D, Patrick K, et al. Effect of coinfection with GB virus C on survival among patients with HIV infection. N Engl J Med. 2001;345:707–714. doi: 10.1056/NEJMoa003364. [DOI] [PubMed] [Google Scholar]

[R25] Zhang W, Chaloner K, Tillmann HS, Williams CF, Stapleton JT. Effect of early and late GBV-C viremia on survival of HIV-infected individuals: a meta-analysis. HIV Med. 2006;7:173–180. doi: 10.1111/j.1468-1293.2006.00366.x. [DOI] [PubMed] [Google Scholar]

PERMALINK

A copula model for bivariate hybrid censored survival data with application to the MACS study

Suhong Zhang

Ying Zhang

Kathryn Chaloner

Jack T Stapleton

Abstract

1 Introduction and motivating example

2 Likelihood and estimation method

3 Asymptotic properties of the maximum pseudo-likelihood estimator ${\hat{α}}_{n}$

Remarks

4 Simulation studies

Table 1.

Table 2.

5 Application to the motivating example

Table 3.

6 Final remarks

Acknowledgements

Appendix

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A copula model for bivariate hybrid censored survival data with application to the MACS study

Suhong Zhang

Ying Zhang

Kathryn Chaloner

Jack T Stapleton

Abstract

1 Introduction and motivating example

2 Likelihood and estimation method

3 Asymptotic properties of the maximum pseudo-likelihood estimator α^n

Remarks

4 Simulation studies

Table 1.

Table 2.

5 Application to the motivating example

Table 3.

6 Final remarks

Acknowledgements

Appendix

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3 Asymptotic properties of the maximum pseudo-likelihood estimator ${\hat{α}}_{n}$