Asymptotic results for fitting marginal hazards models from stratified case-cohort studies with multiple disease outcomes

Sangwook Kang; Jianwen Cai

doi:10.1016/j.jkss.2010.03.005

. Author manuscript; available in PMC: 2012 Mar 20.

Published in final edited form as: J Korean Stat Soc. 2010 Sep;39(3):371–385. doi: 10.1016/j.jkss.2010.03.005

Asymptotic results for fitting marginal hazards models from stratified case-cohort studies with multiple disease outcomes

Sangwook Kang ¹, Jianwen Cai ^2,^*,¹

PMCID: PMC3308729 NIHMSID: NIHMS195411 PMID: 22442642

Abstract

In stratified case-cohort designs, samplings of case-cohort samples are conducted via a stratified random sampling based on covariate information available on the entire cohort members. In this paper, we extended the work of Kang & Cai (2009) to a generalized stratified case-cohort study design for failure time data with multiple disease outcomes. Under this study design, we developed weighted estimating procedures for model parameters in marginal multiplicative intensity models and for the cumulative baseline hazard function. The asymptotic properties of the estimators are studied using martingales, modern empirical process theory, and results for finite population sampling.

Keywords: marginal hazards model, multivariate failure times, stratified case-cohort design, survival analysis, weighted estimating equations

1 Introduction

The case-cohort study design (Prentice, 1986) has been proposed to reduce the cost and effort arising in large cohort studies with time-to-event data. The amount of reduction could be substantial especially if the main disease of interest is rare and the main covariate of interest (exposure) is expensive to measure since the case-cohort design requires measurement of the exposure only on a subset of the whole cohort. Specifically, the sampling in the case-cohort design is comprised of the following two steps. First, a subset called subcohort from the entire cohort is sampled randomly regardless of failure status. Second, remaining cases outside the subcohort are sampled. Information on the exposure is obtained only on these sampled subjects.

While the exposure will be available only for the case-cohort sample, some less expensive covariates such as age, gender, or a correlate of the exposure might be easily obtained for all the cohort members. In such cases, the subcohort could be sampled via a stratified simple random sampling based on strata defined by some of these covariates. This stratified case-cohort design could lead to a large efficiency gain compared to the unstratified counterpart since the latter ignores the available information.

For a single disease outcome assuming independent failure times among subjects in Cox models, many statistical methods have been proposed and studied for data from unstratified case-cohort studies (Prentice, 1986; Self & Prentice, 1988; Barlow, 1994; Chen & Lo, 1999; Chen, 2001) and stratified case-cohort designs (Borgan et al., 2000; Kulich & Lin, 2004).

The case-cohort design, among several other study designs which have been proposed for the similar purpose, is known to have advantage since the same subcohort can be utilized for different disease outcomes (Langholz & Thomas, 1990; Wacholder et al., 1991). When more than one disease outcomes from a subject are of interest, failure time data from the same subject constitute multivariate failure time data wherein correlations among the failure times within the same subject should be accounted for. Such multivariate failure time data are frequently encountered in many biomedical studies. One interesting example is a study of relationship between serum ferritin and coronary heart disease and stroke events in the Busselton Health Study (Cullen, 1972). In order to reduce costs and preserve stored serum, case-cohort sampling was used. It is of scientific interest to compare the effects of serum ferritin on coronary heart disease and on stroke. A subject can experience both coronary heart disease and stroke, and times to coronary heart disease and stroke events observed from the same subject are obviously not independent. In this case, methods developed for single disease outcome assuming univariate failure time data can no longer be directly applied.

Statistical methods which address this problem have been somewhat limited. Recently, we proposed weighted estimating equation methods for failure time data with multiple disease outcomes from case-cohort studies assuming marginal hazards models (Kang & Cai, 2009). In that paper, the generalized case-cohort design which allows sampling of cases outside the subcohort was considered. It is more realistic and flexible when considering multiple disease outcomes since not all disease outcomes need to be rare or number of cases need to be small (Breslow & Wellner, 2007).

The main purpose of this article is to extend the study design considered in Kang & Cai (2009) to a stratified case-cohort design, propose estimation procedure under such study designs, and provide a detailed derivation for the asymptotic properties of the proposed estimators. The model and the estimating procedures for regression coefficients and cumulative baseline hazards function are presented in Section 2. The corresponding asymptotic properties are stated and proven in Section 3. A brief summary and discussion is provided in Section 4.

2 Model, study design, and estimating procedure

2.1 Model

Suppose a cohort with n subjects can be divided into L mutually exclusive strata using some information available for all the cohort members. Let T_lik be the failure time for the kth type of disease outcome (k = 1, …, K) of the ith subject (i = 1, …, n_l) within the lth stratum (l = 1, …, L). Due to right censoring, what one actually observes for the kth type of disease outcome within the lth stratum is X_lik = min(T_lik, C_lik) where C_lik is the potential censoring time. Given p-vector of covariates Z_lik(t), T_lik and C_lik are assumed to be independent. We assume that all the time-dependent covariates in Z_lik(t) are “external”, i.e., they are not affected by the disease processes, as described by Kalbfleisch & Prentice (2002). Let Δ_lik = I(T_lik ≤ C_lik), N_lik(t) = Δ_likI (X_lik ≤ t), and Y_lik(t) = I (X_lik ≥ t) where I (·) is an indicator function. Let λ_lik(t) denote the corresponding marginal hazards function and let τ denote the study end time.

For the kth type of disease outcome of the ith subject within the lth stratum, the marginal hazards function λ_lik(t) is assumed to be associated with the covariate Z_lik(t) by

λ_{lik} {t ∣ Z_{lik} (t)} = Y_{lik} (t) λ_{0 k} (t) e^{β_{0}^{T} Z_{lik} (t)},

(1)

where λ₀_k(t) is an unspecified baseline hazard function for the kth disease outcome and β₀ = (β₀₁, …, β₀_K )^T is a p × 1 vector of regression parameters.

2.2 Study designs

Let V denote the discrete random variable for indicating the stratum. We consider sampling procedures depending on V. We assume that T_lik is independent of V_ik given Z_lik(·), i.e., V_ik affects the failure time only through the covariates (Kulich & Lin, 2004).

First, we consider a direct extension of the stratified case-cohort design for a single disease outcome to multiple disease outcomes and refer to this design as the “original” stratified case-cohort design. Under the original stratified case-cohort design for multiple disease outcomes, the subcohort is selected by a stratified random sampling. Specifically, for the lth stratum, we select a fixed size ñ_l subjects from n_l subjects in the entire cohort via simple random sampling without replacement. Thus, each subject in the lth stratum has the same probability pr(ξ_li = 1) = α̃_l = ñ_l/n_l of being selected into the subcohort where ξ_li is subcohort sampling indicator for the ith subject in the lth stratum. We obtain covariate measurements only on the subcohort members and all the remaining cases outside the subcohort. For the kth type of disease outcome, complete data, {X_lik, Δ_lik, Z_lik(t), 0 ≤ t ≤ X_ik, V_ik} are available for the subcohort members (ξ_li = 1) or cases (Δ_lik = 1). Note that, for cases, information on V_ik do not need to be available. For the non-subcohort controls (ξ_li = 0 and Δ_lik = 0), only partial data, {X_lik, Δ_lik V_ik} are available.

Since we consider more than one disease outcome, it might be more realistic that some of diseases outcomes are not rare or the numbers of cases are not small. In this situation, obtaining covariate information on all the cases outside the subcohort might not be feasible. Thus, we consider a stratified case-cohort design which allows the sampling of cases outside the subcohort to be different for different stratum and refer to this design as the “generalized” stratified case-cohort design.

Under the generalized stratified case-cohort design for multiple disease outcomes, sampling of the subcohort members follows the same routine as before: for the lth stratum, sampling a fixed size ñ_l subjects from n_l subjects in the entire cohort via simple random sampling without replacement. After the sampling of a subcohort, instead of sampling all the cases outside the subcohort, we allow sampling a fraction of cases for each of the disease outcomes. Specifically, for the kth type of disease (k = 1, …, K) within the lth stratum (l = 1, …, L), we select a fixed number of $m_{l}^{k}$ cases who are outside the subcohort via simple random sampling without replacement. Then, each case outside the subcohort has the same probability ${\tilde{q}}_{l k} = p r (η_{lik} = 1 ∣ Δ_{lik} = 1, ξ_{l i} = 0) = m_{l}^{k} / (n_{l}^{k} - {\tilde{n}}_{l}^{k})$ of being sampled where η_lik is the case sampling indicator, $n_{l}^{k}$ is the number of the kth type of disease cases within the lth stratum in the cohort and ${\tilde{n}}_{l}^{k}$ is the number of kth disease cases within the lth stratum in the subcohort.

Note that due to the sampling scheme, (η₁₁_k, …, η_{ln_l}_k) are correlated, however, (η_l₁_k, …, η_{ln_lk}) and (η _l′₁_k′, …, η_{l′n_lk′}) are not correlated for k ≠ k′ or l ≠ l′. We obtain covariate measurements only on the sampled subject. Thus complete data, {X_lik, Δ_lik, Z_lik(t), 0 ≤ t ≤ X_lik, V_ik}, are available for the sub-cohort members (ξ_li = 1) or sampled cases outside the subcohort (η_lik = 1). Only partial data, {X_lik, Δ_lik, V_ik} are available for all others (ξ_li = 0 and η_lik = 0). Note that the generalized stratified case-cohort design includes the original stratified case-cohort design as a special case since if q̃_lk = 1 for all k and l, it reduces to the original stratified case-cohort design which samples all the cases outside the subcohort. Also, if we do not consider the strata for the cohort, i.e., L = 1, then it reduces to the generalized case-cohort design considered by Kang & Cai (2009).

2.3 Estimation of regression parameters under the original stratified case-cohort design

For full cohort data, Wei et al. (1989) proposed the following pseudo-likelihood score equations for the estimation of the hazards regression parameter β₀:

U (β) = \sum_{i = 1}^{n} \sum_{k = 1}^{K} \int_{0}^{τ} {Z_{i k} (u) - \frac{S_{k}^{(1)} (β, u)}{S_{k}^{(0)} (β, u)}} {d N}_{i k} (u),

(2)

where $S_{k}^{(d)} (β, t) = n^{- 1} \sum_{i = 1}^{n} Y_{i k} (t) Z_{i k} {(t)}^{\otimes d} e^{β^{T} Z_{i k} (t)}$ for d = 0 and 1, and a^⊗2 = aa^T, a^⊗1 = a, a^⊗0 = 1 for a vector a. Since these estimating equations do not have analytical solutions, they need to be solved iteratively, for example, by Newton-Raphson method (Thisted, 1988).

For data from the original stratified case-cohort studies, $S_{k}^{(d)} (β, t) (d = 0, 1)$ in (2) cannot be calculated due to the incompleteness of the data. In order to handle this problem, we consider the idea of weighting the incomplete data by the inverse selection probability (Horvitz & Thompson, 1951). Specifically, we consider ${\hat{S}}_{k}^{(d)} (β, t) = n^{- 1} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ρ_{lik} (t) Y_{lik} (t) Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)}$ in place of $S_{k}^{(d)} (β, t)$ for d = 0 and 1 where ρ_lik(t) is a possibly time-varying weight function, incorporated to account for the sampling scheme and has the following form:

ρ_{lik} (t) = Δ_{lik} + (1 - Δ_{lik}) ξ_{l i} / {\hat{α}}_{l k} (t) where {\hat{α}}_{l k} (t) = \frac{\sum_{i = 1}^{n_{l}} (1 - Δ_{lik}) ξ_{l i} Y_{lik} (t)}{\sum_{i = 1}^{n_{l}} (1 - Δ_{lik}) Y_{lik} (t)} .

Then, for the estimation of β₀, we propose the following pseudo-partial-likelihood score equations Û(β) = 0_p_×1, where

\hat{U} (β) = \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} \int_{0}^{τ} {Z_{lik} (u) - \frac{{\hat{S}}_{k}^{(1)} (β, u)}{{\hat{S}}_{k}^{(0)} (β, u)}} {d N}_{lik} (u),

(3)

A time-invariant version of the weight function which uses α̃_l in place of α̂_lk(t) may also be considered.

The solution to Û(β) = 0_p_×1 is defined to be the estimator of the hazards regression parameter β₀. We will denote the estimator which uses time-invariant weight functions as β̂_I and time-varying weight functions as β̂_II. The corresponding pseudo-partial-likelihood functions will be denoted by Û_I(β) and Û_II (β), respectively.

2.4 Estimation of regression parameters under the generalized stratified case-cohort design

For the generalized stratified case-cohort design, the weight function needs to be modified to appropriately account for the sampling of cases outside the subcohort. Specifically, cases outside the subcohort who are sampled are weighted by ${\hat{q}}_{l k}^{- 1} (t)$ where q̂_lk(t) denotes the number of sampled non-subcohort cases with the kth type of disease outcome in the lth stratum divided by the number of non-subcohort cases with the kth type of disease outcome in the lth stratum remaining in the risk set at time t. Then, the weight function ω_lik(t) has the following form:

ω_{lik} (t) = (1 - Δ_{lik}) \frac{ξ_{l i}}{{\hat{α}}_{l k} (t)} + Δ_{lik} ξ_{l i} + Δ_{lik} (1 - ξ_{l i}) \frac{η_{lik}}{{\hat{q}}_{l k} (t)} where {\hat{q}}_{l k} (t) = \frac{\sum_{i = 1}^{n_{l}} Δ_{lik} (1 - ξ_{l i}) η_{lik} Y_{lik} (t)}{\sum_{i = 1}^{n_{l}} Δ_{lik} (1 - ξ_{l i}) Y_{lik} (t)} .

Note that the proposed weight functions reduce to the ones for the original stratified case-cohort design if all cases outside the subcohort are sampled since q̃_lk = 1 for all k and l.

For the estimation of β₀ under the generalized stratified case-cohort design, the following weighted estimating functions with the weight function ω_lik(t) is considered:

\tilde{U} (β) = \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} \int_{0}^{τ} ω_{lik} (u) {Z_{lik} (u) - \frac{{\tilde{S}}_{k}^{(1)} (β, u)}{{\tilde{S}}_{k}^{(0)} (β, u)}} {d N}_{lik} (u),

(4)

where ${\tilde{S}}_{k}^{(d)} (β, t) = n^{- 1} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ω_{lik} (t) Y_{lik} (t) Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)}$ for d = 0 and 1.

The solution to the equations Ũ(β) = 0_p_×1 is the estimator for the hazards regression parameter β₀. We will denote the estimator which uses time-invariant weight functions as β̃_I and time-varying weight functions as β̃_II, respectively. The corresponding weighted estimating functions are Ũ_I (β) and Ũ_II (β), respectively.

2.5 Estimation of the cumulative baseline hazard function

Let Λ₀_k(t) denote the cumulative baseline hazard function for the kth type of disease outcome at time t where $Λ_{0 k} (t) = \int_{0}^{t} λ_{0 k} (u) d u$ . Then for the estimator of Λ₀_k(t), we consider the following Breslow-Aalen type estimators Λ̂₀_k(β̂, t) for the original stratified case-cohort design and Λ̂₀_k(β̃, t) for the generalized stratified case-cohort design where

{\hat{Λ}}_{0 k} (β, t) = \int_{0}^{t} \frac{\sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ρ_{lik} (u) {d N}_{lik} (u)}{n {\hat{S}}_{k}^{(0)} (β, u)} and

(5)

{\tilde{Λ}}_{0 k} (β, t) = \int_{0}^{t} \frac{\sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ω_{lik} (u) {d N}_{lik} (u)}{n {\tilde{S}}_{k}^{(0)} (β, u)},

(6)

respectively.

We, again, use the superscript $I ({\hat{Λ}}_{0 k}^{I} (\cdot, t), {\tilde{Λ}}_{0 k}^{I} (\cdot, t))$ and $II ({\hat{Λ}}_{0 k}^{I I} (\cdot, t), {\tilde{Λ}}_{0 k}^{I I} (\cdot, t))$ to denote the estimator using time-invariant and time-varying weight functions, respectively.

3 Asymptotic properties

We will focus on the asymptotic properties of the estimators under the generalized stratified case-cohort design, β̃_I, β̃_II, ${\tilde{Λ}}_{0 k}^{I} (\cdot, t)$ , and ${\tilde{Λ}}_{0 k}^{I I} (\cdot, t)$ . This is because the estimators under the original stratified case-cohort studies, β̂_I, β̂_II, ${\hat{Λ}}_{0 k}^{I} (\cdot, t)$ , and ${\hat{Λ}}_{0 k}^{I I} (\cdot, t)$ , are special cases of those under the generalized stratified case-cohort studies. Thus, their asymptotic properties can be directly drawn from those under the generalized case-cohort studies.

3.1 Conditions

In order to establish the consistency and asymptotic normality of the estimators for the generalized stratified case-cohort studies, the following sets of conditions are needed:

(A)
(T_li, C_li, Z_li), i = 1, …, n_l and l = 1, …, L are independent and identically distributed where T_li = (T_li₁, …, T_liK)^T, C_li = (C_li₁, …, C_liK)^T and Z_li = (Z_li₁, …, Z_liK)^T.
(B)
pr{Y_lik(τ) > 0} > 0 for all i = 1, …, n_l, k = 1, …, K and l = 1, …, L.
(C)
$∣ Z_{likj} (0) ∣ + \int_{0}^{τ} ∣ {d Z}_{likj} (u) ∣ < C_{z} < \infty$ almost surely for all i = 1, …, n_l, k = 1, …, K, and l = 1, …, L where Z_likj is the jth component of Z_lik and C_z is some constant.
(D)
The matrix $A_{k} (β_{0}) = \int_{0}^{τ} v_{k} (β_{0}, u) s_{k}^{(0)} (β_{0}, u) λ_{0 k} (u) d u$ is positive definite.
(E)
$\int_{0}^{τ} λ_{0 k} (u) d u < \infty$ , for all k = 1, …, K.
(F)
There exists a neighborhood of β₀ that satisfies the following conditions, as n → ∞: for all k = 1, …, K, and d = 0, 1, 2, $sup_{\begin{matrix} t \in [0, τ] \\ β \in B \end{matrix}} | | S_{k}^{(d)} (β, t) - s_{k}^{(d)} (β, t) | | \overset{p}{\to} 0$ where $E {S_{k}^{(d)} (β, t)} = s_{k}^{(d)} (β, t)$ are continuous functions of β ∈ , uniformly in t ∈ [0, τ] and are bounded on × [0, τ] and $s_{k}^{(0)}$ is bounded away from zero on × [0, τ].

The following additional conditions are also needed to ensure the desired asymptotic convergence of case-cohort samples:

(G)
As n → ∈,
1. For all l = 1, …, L, α̃_l converges to a constant α_l ∈ (0, 1];
2. For all k = 1, …, K, and l = 1, …, L, q̃_lk converges to a constant q_lk in (0, 1].
(H)
n_l/n converges to a constant p_l ∈ [0, 1] for all l = 1, …, L as n → ∈.

Here and hereafter the norms for the vector a, matrix A, and function f are defined as the following:

| | a | | = max_{i} ∣ a_{i} ∣, | | A | | = max_{i, j} ∣ A_{i j} ∣, | | f | | = sup_{t} ∣ f (t) ∣

3.2 Asymptotic properties of β̃_I and ${\tilde{Λ}}_{0 k}^{I} ({\tilde{β}}_{I}, t)$

The key component of the derivation of the asymptotic results involves a decomposition of the proposed estimating function into three asymptotically uncorrelated pieces and some negligible terms. These three components represent, respectively, the whole cohort counterpart, one arising from sampling of a subcohort, and one arising from sampling of cases outside the subcohort.

Let us provide some lemmas which will be frequently used in proving the theorems.

Lemma 1

Let W_n(t) and G_n(t) be two sequences of bounded processes. For some constant τ, assume that the following conditions (a) – (c) hold where

${sup}_{0 \leq t \leq τ} | | W_{n} (t) - W (t) | | \overset{p}{\to} 0$ for some bounded process W (t),
W_n(t) is monotone on [0, τ] and
G_n(t) converges to a zero-mean process with continuous sample paths. Under Conditions (a) – (c),
$sup_{0 \leq t \leq τ} ‖ \int_{0}^{t} {W_{n} (u) - W (u)} {d G}_{n} (u) ‖ \overset{p}{\to} 0, sup_{0 \leq t \leq τ} ‖ \int_{0}^{t} G_{n} (u) d {W_{n} (u) - W (u)} ‖ \overset{p}{\to} 0.$

PROOF

Lemma 1 is given as a lemma in Lin (2000). Its proof follows from the strong embedding theorem (Shorack & Wellner, 1986, p47–48), lemma 1 of Lin et al. (2000) and the triangular argument of a norm.

Lemma 2 is an extension of the proposition in Kulich & Lin (2000).

Lemma 2

Let ξ = (ξ₁, …, ξ_n) be a random vector containing ñ ones and n–ñ zeros, with each permutation equally likely. Let B_i(t), i = 1, …, n, be i.i.d. real-valued random processes on [0, τ] with E{B_i(t)} = μ_B (t), Var{B_i(0)} < ∞ and Var{B_i(τ)} < ∞. Let B(t) = {B₁(t), …, B_n(t)} and ξ be independent. Suppose that almost all paths of B_i(t) have finite variation. Then,

n^{- 1 / 2} \sum_{i = 1}^{n} ξ_{i} {B_{i} (t) - μ_{B} (t)}

converges weakly in ℓ^∞ [0, τ] to a zero-mean Gaussian process and therefore

n^{- 1} \sum_{i = 1}^{n} ξ_{i} {B_{i} (t) - μ_{B} (t)}

converges in probability to 0 uniformly in t.

PROOF

This lemma is an extension of the proposition in Kulich & Lin (2000). The proof of this lemma follows from Hájek (1960)’s central limit theorem for finite population sampling and Example 3.6.14 of van der Vaart & Wellner (1996). Specifically, suppose first that the B_i(t)’s have nondecreasing sample paths then the finite-dimensional convergence follows from Hájek (1960)’s central limit theorem for finite population sampling while the tightness follows from Example 3.6.14 of van der Vaart & Wellner (1996). In the general case, since almost every path b(t) of B(t) have finite variation, b(t) can be written as $b_{1}^{*} (t) - b_{2}^{*} (t)$ , where $b_{1}^{*} (t)$ and $b_{2}^{*} (t)$ are nonnegative, nondecreasing in t. Hence $B_{i} (t) = B_{i 1}^{*} (t) - B_{i 2}^{*} (t)$ , where $B_{i 1}^{*} (t)$ and $B_{i 2}^{*} (t)$ are marginally tight since they meet the condition of Example 3.6.14 of van der Vaart & Wellner (1996). This implies that they are jointly tight. The joint finite-dimensional convergence of the normalized $n^{- 1 / 2} \sum_{i = 1}^{n} ξ_{i} {B_{i 1}^{*} (t) - μ_{B_{i 1}^{*}} (t)}$ and $n^{- 1 / 2} \sum_{i = 1}^{n} ξ_{i} {B_{i 2}^{*} (t) - μ_{B_{i 1}^{*}} (t)}$ follows again from Hájek (1960)’s central limit theorem for finite population sampling. Therefore, $n^{- 1 / 2} \sum_{i = 1}^{n} ξ_{i} {B_{i} (t) - μ_{B_{i}} (t)}$ converges weakly in ℓ^∞ [0, τ] to zero mean Gaussian processes. It then follows that $n^{- 1} \sum_{i = 1}^{n} ξ_{i} {B_{i} (t) - μ_{B_{i}} (t)}$ converges to 0 in probability uniformly in t.

Note that ξ_li is the subcohort membership indicator and η_lik is the sampling indicator for the ith subject with the kth disease within the lth stratum outside the subcohort where both the sampling of the subcohort and the cases outside the subcohort were conducted by simple random sampling without replacement. Thus, it is clear that our ξ_li’s and η_lik’s satisfy the conditions in lemma 2.

Theorem 1

Under Conditions (A) – (H), β̃_I solving Ũ_I (β) = 0 is a consistent estimator of β₀. Also, n^1/2(β̃_I – β₀) is asymptotically normally distributed with mean zero and with variance matrix Σ_I (β₀) of the following form

A {(β_{0})}^{- 1} \sum_{l = 1}^{L} p_{l} {Q_{l} (β_{0}) + \frac{1 - α_{l}}{α_{l}} V_{l}^{I, (1)} (β_{0}) + (1 - α_{l}) \sum_{k = 1}^{K} p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) V_{l k}^{I, (2)} (β_{0})} A {(β_{0})}^{- 1}

where

\begin{array}{l} A (β) = \sum_{k = 1}^{K} A_{k} (β), Q_{l} (β) = E_{l} {\sum_{k = 1}^{K} M_{\tilde{z}, l 1 k} (β, τ)}^{\otimes 2}, \\ V_{l}^{I, (1)} (β) = V a r_{l} {\sum_{k = 1}^{K} \int_{0}^{τ} (1 - Δ_{l 1 k}) R_{l 1 k} (β, u) d Λ_{0 k} (u)}, \\ V_{l k}^{I, (2)} (β) = V a r_{l} {M_{\tilde{z}, l 1 k} (β, τ) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}, R_{lik} (β, t) = Y_{lik} (t) {\tilde{Z}}_{lik} (β, t) e^{β^{T} Z_{lik} (t)}, \\ e_{k} (β, t) = \frac{s_{k}^{(1)} (β, t)}{s_{k}^{(0)} (β, t)}, v_{k} (β, t) = \frac{s_{k}^{(2)} (β, t) s_{k}^{(0)} (β, t) - s_{k}^{(1)} {(β, t)}^{\otimes 2}}{s_{k}^{(0)} {(β, t)}^{2}}, \\ A_{k} (β) = \int_{0}^{τ} v_{k} (β, u) s_{k}^{(0)} (β, u) λ_{0 k} (u) d u, \\ {\tilde{Z}}_{lik} (β, t) = Z_{lik} (t) - e_{k} (β, t), M_{\tilde{z}, lik} (β, t) = \int_{0}^{t} {\tilde{Z}}_{lik} (β, u) {d M}_{lik} (β, u), \\ and M_{lik} (β, t) = N_{lik} (t) - \int_{0}^{t} Y_{lik} (u) λ_{0 k} (u) e^{β^{T} Z_{lik} (u)} d u . \end{array}

Note that E_l, Var_l, and Cov_l denote the expectation, the variance and the covariance within the lth stratum, respectively.

We now study the asymptotic properties of ${\tilde{Λ}}_{0 k}^{I} ({\tilde{β}}_{I}, t) (k = 1, \dots, K)$ . Let $W^{I} (t) = n^{1 / 2} {[{{\tilde{Λ}}_{01}^{I} ({\tilde{β}}_{I}, t) - Λ_{01} (t)}, \dots, {{\tilde{Λ}}_{0 K}^{I} ({\tilde{β}}_{I}, t) - Λ_{0 K} (t)}]}^{T}$ and $W^{I} (t) = {W_{1}^{I} (t), \dots, W_{K}^{I} (t)}^{T}$ where Inline graphic (t) is a zero-mean Gaussian process with the covariance function between $W_{j}^{I} (t_{1})$ and $W_{k}^{I} (t_{2})$ (1 ≤ j, k ≤ K and 0 ≤ t₁, t₂ ≤ τ) is $φ_{j k}^{I} (t_{1}, t_{2}) (β_{0})$ where

\begin{array}{l} φ_{j k}^{I} (t_{1}, t_{2}) (β_{0}) = \sum_{l = 1}^{L} p_{l} (E_{l} {ν_{l 1 j} (β_{0}, t_{1}) ν_{l 1 k} (β_{0}, t_{2})} + \frac{1 - α_{l}}{α_{l}} E_{l} {ψ_{l 1 j}^{I} (β_{0}, t_{1}) ψ_{l 1 k}^{I} (β_{0}, t_{2})} + (1 - α_{l}) \\ \times [I (j = k) p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) {Cov}_{l} {\int_{0}^{t_{1}} \frac{{d M}_{l 1 k} (u)}{s_{k}^{(0)} (β_{0}, u)}, \int_{0}^{t_{2}} \frac{{d M}_{l 1 k} (u)}{s_{k}^{(0)} (β_{0}, u)} | Δ_{l 1 k} = 1, ξ_{l 1} = 0} \\ + p r (Δ_{l 1 j} = 1) (\frac{1 - q_{l j}}{q_{l j}}) {Cov}_{l} {\int_{0}^{t_{1}} \frac{{d M}_{l 1 j} (u)}{s_{j}^{(0)} (β_{0}, u)}, r_{k} {(β_{0}, t_{2})}^{T} A {(β_{0})}^{- 1} M_{\tilde{z}, l 1 j} (β_{0}) | Δ_{l 1 j} = 1, ξ_{l 1} = 0} \\ + p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) {Cov}_{l} {\int_{0}^{t_{2}} \frac{{d M}_{l 1 k} (u)}{s_{k}^{(0)} (β_{0}, u)}, r_{j} {(β_{0}, t_{1})}^{T} A {(β_{0})}^{- 1} M_{\tilde{z}, l 1 k} (β_{0}) | Δ_{l 1 k} = 1, ξ_{l 1} = 0} \\ + \sum_{m = 1}^{K} p r (Δ_{l 1 m} = 1) (\frac{1 - q_{l m}}{q_{l m}}) r_{j} {(β_{0}, t_{1})}^{T} A {(β_{0})}^{- 1} {Var}_{l} {M_{\tilde{z}, l 1 m} (β_{0}) ∣ Δ_{l 1 m} = 1, ξ_{l 1} = 0} \\ \times A {(β_{0})}^{- 1} r_{k} (β_{0}, t_{2})], ν_{lik} (β, t) = r_{k} {(β, t)}^{T} A {(β)}^{- 1} \sum_{m = 1}^{K} M_{\tilde{z}, \lim} (β) + \int_{0}^{t} {s_{k}^{(0)} (β, u)}^{- 1} {d M}_{lik} (u), \\ ψ_{lik}^{I} (β, t) = r_{k} {(β, t)}^{T} A {(β)}^{- 1} \sum_{m = 1}^{K} (1 - Δ_{\lim}) \int_{0}^{τ} R_{\lim} (β, u) d Λ_{0 m} (u) \\ + (1 - Δ_{lik}) \int_{0}^{t} \frac{Y_{lik} (u) e^{β^{T} Z_{lik} (u)}}{s_{k}^{(0)} (β, u)} d Λ_{0 k} (u) and r_{k} (β, t) = - \int_{0}^{t} e_{k} (β, u) d Λ_{0 k} (u) . \end{array}

Also, let D[0, τ]^K be a metric space consisting of right-continuous functions f (t) with left-hand limits where f (t) = {f₁(t), …, f_K (t)}^T and f_k(t): [0, τ] → Inline graphic . This metric space is equipped with a uniform metric d_k(f, g) = sup_k,t_∈[0_,τ_]{|f_k(t) − g_k(t)|: 1 ≤ k ≤ K} for f, g ∈ D[0, τ]^K.

Theorem 2

Under Conditions (A) – (H), for each k = 1, …, K, ${\tilde{Λ}}_{0 k}^{I} ({\tilde{β}}_{I}, t)$ converges in probability to Λ₀_k(t) uniformly in t ∈ [0, τ]. Also, W^I(t) converges weakly to Inline graphic (t) in D[0, τ]^K

Proofs for Theorems 1 and 2 can be derived from those for their time-varying counterparts which will be provided in the next subsection. More detailed explanation on this is deferred to Section 3.4

3.3 Asymptotic properties of β̃_II and ${\tilde{Λ}}_{0 k}^{I I} ({\tilde{β}}_{I I}, t)$

In order to establish the asymptotic properties of the proposed estimators with time-varying weight functions, we need the following lemma on the asymptotic properties of time-varying sampling probability estimators α̂_lk(t) and q̂_k(t).

Lemma 3

For all l = 1, …, L and k = 1, …, K,

α̂_lk(t) and α̃_l converge to the same limit uniformly in t and
$n^{1 / 2} {{\hat{α}}_{l k} {(t)}^{- 1} - {\tilde{α}}_{l}^{- 1}} = \frac{1}{p_{l} {\tilde{α}}_{l} E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (t)}} n^{- 1 / 2} {\sum_{i = 1}^{n_{l}} (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) (1 - Δ_{lik}) Y_{lik} (t)} + o_{p} (1) .$
q̂_lk(t) and q̃_lk converge to the same limit uniformly in t and
$n^{1 / 2} {{\hat{q}}_{l k} {(t)}^{- 1} - {\tilde{q}}_{l k}^{- 1}} = \frac{1}{{\tilde{q}}_{l k} p_{l} (1 - {\tilde{α}}_{l}) E_{l} {Δ_{l 1 k} Y_{l 1 k} (t)}} n^{- 1 / 2} {\sum_{i = 1}^{n_{l}} (1 - \frac{η_{lik}}{{\tilde{q}}_{l k}}) Δ_{lik} (1 - ξ_{l i}) Y_{lik} (t)} + o_{p} (1) .$

PROOF

For each l and k, it follows from the Taylor expansion of α̂_lk(t)⁻¹ around α̃_l,

\begin{array}{l} {\hat{α}}_{l k} {(t)}^{- 1} - {\tilde{α}}_{l}^{- 1} = - \frac{1}{α_{*} {(t)}^{2}} {{\hat{α}}_{l k} (t) - {\tilde{α}}_{l}} \\ = \frac{{\tilde{α}}_{l}}{α_{*} {(t)}^{2}} \cdot \frac{1}{\sum_{i = 1}^{n_{l}} (1 - Δ_{lik}) Y_{lik} (t)} {\sum_{i = 1}^{n_{l}} (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) (1 - Δ_{lik}) Y_{lik} (t)} \end{array}

where α_* (t) is on the line segment between α̂_lk(t) and α̃_l. Then,

n^{1 / 2} {{\hat{α}}_{l k} {(t)}^{- 1} - {\tilde{α}}_{l}^{- 1}} = \frac{{\tilde{α}}_{l}}{α_{*} {(t)}^{2}} \cdot \frac{n}{n_{l}} \cdot \frac{n_{l}}{\sum_{i = 1}^{n_{l}} (1 - Δ_{lik}) Y_{lik} (t)} n^{- 1 / 2} {\sum_{i = 1}^{n_{l}} (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) (1 - Δ_{lik}) Y_{lik} (t)}

By Glivenko-Cantelli lemma, $n_{l}^{- 1} \sum_{i = 1}^{n_{l}} (1 - Δ_{lik}) Y_{lik} (t)$ converges to E_l {(1 − Δ_l₁_k)Y_l₁_k(t)} in probability uniformly in t. In view of lemma 2, $n^{- 1 / 2} \sum_{i = 1}^{n_{l}} (\frac{ξ_{l i}}{{\tilde{α}}_{l}} - 1) (1 - Δ_{lik}) Y_{lik} (t)$ converges to a zero-mean Gaussian process since (1 − Δ_lik)Y_lik(t)is bounded and monotone function in t. This implies $n^{- 1} \sum_{i = 1}^{n_{l}} (\frac{ξ_{l i}}{{\tilde{α}}_{l}} - 1) (1 - Δ_{lik}) Y_{lik} (t)$ converges to 0 in probability uniformly in t and consequently, α̂_lk(t) and α̃_l converges to the same limit uniformly in t. This ensures α_* (t) also converges to the same limit as α̃_l. Combining these results, it follows from Slutsky’s theorem and Condition (H) that

\begin{array}{l} n^{1 / 2} {{\hat{α}}_{l k} {(t)}^{- 1} - {\tilde{α}}_{l}^{- 1}} = \frac{1}{{\tilde{α}}_{l} p_{l} E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (t)}} n^{- 1 / 2} {\sum_{i = 1}^{n_{l}} (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) (1 - Δ_{lik}) Y_{lik} (t)} \\ + [\frac{{\tilde{α}}_{l}}{α_{*} {(t)}^{2}} \cdot \frac{n}{n_{l}} \cdot \frac{n_{l}}{\sum_{i = 1}^{n_{l}} (1 - Δ_{lik}) Y_{lik} (t)} - \frac{1}{{\tilde{α}}_{l} p_{l} E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (t)}}] n^{- 1 / 2} \sum_{i = 1}^{n_{l}} (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) (1 - Δ_{lik}) Y_{lik} (t) \\ = \frac{1}{{\tilde{α}}_{l} p_{l} E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (t)}} n^{- 1 / 2} {\sum_{i = 1}^{n_{l}} (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) (1 - Δ_{lik}) Y_{lik} (t)} + o_{p} (1) . \end{array}

(ii) can be shown via similar arguments.

Now, we state the asymptotic behavior of the regression parameter estimator β̃_II in the following theorem:

Theorem 3

Under Conditions (A) – (H), β̃_II solving Ũ_II (β) = 0 is a consistent estimator of β₀. Also, n^1/2(β̃_II − β₀) is asymptotically normally distributed with mean zero and with variance matrix Σ_II (β₀) of the following form

A {(β_{0})}^{- 1} \sum_{l = 1}^{L} p_{l} {Q_{l} (β_{0}) + \frac{1 - α_{l}}{α_{l}} V_{l}^{I I, (1)} (β_{0}) + (1 - α_{l}) \sum_{k = 1}^{K} p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) V_{l k}^{I I, (2)} (β_{0})} A {(β_{0})}^{- 1}

where

\begin{array}{l} V_{l}^{I I, (1)} (β) = V a r_{l} (\sum_{k = 1}^{K} (1 - Δ_{l 1 k}) \int_{0}^{τ} [R_{l 1 k} (β, u) - \frac{Y_{l 1 k} (u) E_{l} {(1 - Δ_{l 1 k}) R_{l 1 k} (β, u)}}{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (u)}}] d Λ_{0 k} (u)) and \\ V_{l k}^{I I, (2)} (β) = V a r_{l} [M_{\tilde{z}, l 1 k} (β, τ) - \int_{0}^{τ} Y_{l 1 k} (u) \frac{E_{l} {{d M}_{\tilde{z}, l 1 k} (β, u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1}} | Δ_{l 1 k} = 1, ξ_{l 1} = 0] . \end{array}

PROOF

The proof for the consistency of β̃_II is based on the application of the Inverse Function Theorem in Foutz (1977). One can show β̃_II to be consistent for β₀ provided:

$\frac{\partial {n^{- 1} {\tilde{U}}_{I I} (β)}}{\partial β^{T}}$ exists and is continuous in an open neighborhood of β₀,
$\frac{\partial {n^{- 1} {\tilde{U}}_{I I} (β_{0})}}{\partial β^{T}}$ is negative definite with probability going to one as n → ∞,
$\frac{\partial {n^{- 1} {\tilde{U}}_{I I} (β)}}{\partial β}$ converges to A(β₀) in probability uniformly for β in an open neighborhood about β₀,
n⁻¹Ũ_II (β) → 0 in probability.

One can write

\begin{matrix} \frac{\partial {n^{- 1} {\tilde{U}}_{I I} (β)}}{\partial β^{T}} = - n^{- 1} \sum_{k = 1}^{K} \int_{0}^{τ} {\tilde{V}}_{k} (β, u) d {\sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ω_{lik}^{I I} (u) N_{lik} (u)} where \\ {\tilde{V}}_{k} (β, t) = \frac{{\tilde{S}}_{I I, k}^{(2)} (β, t) {\tilde{S}}_{I I, k}^{(0)} (β, t) - {\tilde{S}}_{I I, k}^{(1)} {(β, t)}^{\otimes 2}}{{\tilde{S}}_{I I, k}^{(0)} {(β, t)}^{2}} \end{matrix}

(7)

Then, (i) is clearly satisfied on the basis of (7) and Condition (F). Now, following Andersen & Gill (1982),

\begin{array}{l} ‖ [- \frac{\partial {n^{- 1} {\tilde{U}}^{I I} (β)}}{\partial β^{T}}] - A (β) ‖ \leq ‖ \sum_{k = 1}^{K} \int_{0}^{τ} {{\tilde{V}}_{k} (β, u) - v_{k} (β, u)} n^{- 1} d {\sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} N_{lik} (u)} ‖ \\ + ‖ \sum_{k = 1}^{K} \int_{0}^{τ} {{\tilde{V}}_{k} (β, u) - v_{k} (β, u)} d [n^{- 1} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} {ω_{lik}^{I I} (u) - 1} N_{lik} (t)] ‖ \\ + ‖ \sum_{k = 1}^{K} \int_{0}^{τ} v_{k} (β, u) n^{- 1} d {\sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (β, u)} ‖ + ‖ \sum_{k = 1}^{K} n^{- 1} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} {ω_{lik}^{I I} (u) - 1} \int_{0}^{τ} v_{k} (β, u) {d M}_{lik} (β, u) ‖ \\ + ‖ \sum_{k = 1}^{K} \int_{0}^{τ} v_{k} (β, u) {{\tilde{S}}_{I I, k}^{(0)} (β, u) - s_{k}^{(0)} (β, u)} λ_{0 k} (u) d u ‖ \end{array}

(8)

Each of the terms on the right side of the above inequality can be shown to converge to zero, uniformly in β ∈ Inline graphic . To show the first term on the right side of (8), we will first show that

sup_{\begin{matrix} t \in [0, τ] \\ β \in B \end{matrix}} ‖ {\tilde{V}}_{k} (β, t) - v_{k} (β, t) ‖ \overset{p}{\to} 0 as n \to \infty for k = 1, \dots, K .

It suffices to show that ${sup}_{t \in [0, τ], β \in B} | | {\tilde{S}}_{I I, k}^{(d)} (β, t) - S_{k}^{(d)} (β, t) | | \overset{p}{\to} 0$ as n → ∞ for d = 0, 1 and 2. One can write

\begin{array}{l} n^{1 / 2} {{\tilde{S}}_{I I, k}^{(d)} (β, t) - S_{k}^{(d)} (β, t)} = n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (\frac{ξ_{l i}}{{\tilde{α}}_{l}} - 1) (1 - Δ_{lik}) Y_{lik} (t) Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)} \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (\frac{η_{lik}}{{\tilde{q}}_{k}} - 1) Δ_{lik} (1 - ξ_{l i}) Y_{lik} (t) Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)} \\ + {{\hat{α}}_{l k} {(t)}^{- 1} - {\tilde{α}}_{l}^{- 1}} n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (1 - Δ_{lik}) ξ_{l i} Y_{lik} (t) Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)} \\ + {{\hat{q}}_{l k} {(t)}^{- 1} - {\tilde{q}}_{l k}^{- 1}} n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} Δ_{lik} (1 - ξ_{l i}) η_{lik} Y_{lik} (t) Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)} . \end{array}

(9)

Then by lemma 3,

\begin{array}{l} (9) = n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (\frac{ξ_{l i}}{{\tilde{α}}_{l}} - 1) (1 - Δ_{lik}) Y_{lik} (t) Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)} \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (\frac{η_{lik}}{{\tilde{q}}_{l k}} - 1) Δ_{lik} (1 - ξ_{l i}) Y_{lik} (t) Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)} \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) (1 - Δ_{lik}) \frac{Y_{lik} (t)}{p_{l} E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (t)}} \\ \times {n^{- 1} \sum_{j = 1}^{n_{l}} (1 - Δ_{ljk}) \frac{ξ_{l j}}{{\tilde{α}}_{l}} Y_{ljk} (t) Z_{ljk} {(t)}^{\otimes d} e^{β^{T} Z_{ljk} (t)}} \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (1 - \frac{η_{lik}}{{\tilde{q}}_{l k}}) (1 - ξ_{l i}) Δ_{lik} \frac{Y_{lik} (t)}{p_{l} E_{l} {Δ_{l 1 k} Y_{l 1 k} (t)}} \\ \times {n^{- 1} \sum_{j = 1}^{n_{l}} Δ_{ljk} \frac{(1 - ξ_{l j})}{(1 - {\tilde{α}}_{l})} \frac{η_{ljk}}{{\tilde{q}}_{l k}} Y_{ljk} (t) Z_{ljk} {(t)}^{\otimes d} e^{β^{T} Z_{ljk} (t)}} + o_{p} (1) \end{array}

(10)

It follows from lemma 2 that, for d = 0, 1 and 2, $n^{- 1} \sum_{j = 1}^{n_{l}} (1 - Δ_{ljk}) \frac{ξ_{l j}}{{\tilde{α}}_{l}} Y_{ljk} (t) Z_{ljk} {(t)}^{\otimes d} e^{β^{T} Z_{ljk} (t)}$ and $n^{- 1} \sum_{j = 1}^{n_{l}} Δ_{ljk} \frac{(1 - ξ_{l j})}{(1 - {\tilde{α}}_{l})} \frac{η_{ljk}}{{\tilde{q}}_{l k}} Y_{ljk} (t) Z_{ljk} {(t)}^{\otimes d} e^{β^{T} Z_{ljk} (t)}$ converge to p_lE_l{(1 − Δ_l₁_k)Y_l₁_k(t)Z_l₁_k(t)^⊗de^{β^TZ_l1k(t)}} and p_lpr(Δ_l₁_k = 1)E_l{Y_l₁_k(t)Z_l₁_k(t)^⊗de^{β^TZ_l1k(t)} | Δ_l₁_k = 1, ξ_l₁ = 0 in probability uniformly in t under Condition (G), respectively. Thus, from (10)

\begin{array}{l} n^{1 / 2} {{\tilde{S}}_{I I, k}^{(d)} (β, t) - S_{k}^{(d)} (β, t)} \\ = n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (1 - Δ_{lik}) (\frac{ξ_{l i}}{{\tilde{α}}_{l}} - 1) Y_{lik} (t) \\ \times [Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)} - \frac{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (t) Z_{l 1 k} {(t)}^{\otimes d} e^{β^{T} Z_{l 1 k} (t)}}}{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (t)}}] \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (1 - ξ_{l i}) Δ_{lik} (\frac{η_{lik}}{{\tilde{q}}_{l k}} - 1) Y_{lik} (t) [Z_{lik} {(t)}^{\otimes d} e^{β^{T} Z_{lik} (t)} \\ - \frac{E_{l} {Y_{l 1 k} (t) Z_{l 1 k} {(t)}^{\otimes d} e^{β^{T} Z_{l 1 k} (t)} ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (t) ∣ Δ_{l 1 k} = 1}}] + o_{p} (1) \end{array}

(11)

It then follows from lemma 2 that, for d = 0, 1 and 2, $n^{1 / 2} {{\tilde{S}}_{I I, k}^{(d)} (β, t) - S_{k}^{(d)} (β, t)}$ converges weakly to zero-mean Gaussian processes under Condition (G). Consequently, together with condition (F),

sup_{\begin{matrix} t \in [0, τ] \\ β \in B \end{matrix}} ‖ {\tilde{S}}_{I I, k}^{(d)} (β, t) - s_{k}^{(d)} (β, t) ‖ \overset{p}{\to} 0 as n \to \infty for d = 0, 1, and 2.

(12)

Since $s_{k}^{(0)} (β, t)$ is bounded away from zero on Inline graphic × [0, τ] by condition (F), it follows from the above convergence results that for k = 1, …, K, Ũ_k(β, t) converges to v_k(β, t) in probability uniformly in t and β.

By combining these results with the Lenglart inequality for $n^{- 1} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} N_{lik} (τ)$ (Andersen & Gill, 1982, p1115), it follows that the first term on the right side of (8) converges to zero in probability, uniformly in β ∈ Inline graphic , as n → ∞.

The second term and the fourth terms on the right side of (8) can be shown to converge to zero by applying lemma 2. The third term can be shown to converge to zero by the Lenglart inequality for $\sum_{k = 1}^{K} \int_{0}^{τ} v_{k} (β, t) n^{- 1} d \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (t)$ (Andersen & Gill, 1982, p1115).

Conditions (D), (E) and (F) ensure the boundedness of sup_t_,_β{v_k(β, t)}_jj_′ and Λ₀_k(τ) for k = 1, …, K and j, j^′ = 1, …, p. Thus, together with the uniform convergence of ${\tilde{S}}_{I I, k}^{(0)} (β, t)$ to $s_{k}^{(0)} (β, t)$ in probability, the last term on the right side of (8) converges to zero in probability, uniformly in β ∈ Inline graphic as n → ∞. Hence,

- \frac{\partial {n^{- 1} {\tilde{U}}_{I I} (β)}}{\partial β^{T}} \overset{p}{\to} A (β) as n \to \infty uniformly in β \in B

and, thus, (ii) and (iii) are satisfied.

For (iv), we will show that n^−1/2Ũ_II (β₀) is asymptotically equivalent to

\begin{array}{l} n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} M_{\tilde{z}, lik} (β_{0}, τ) + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} (1 - Δ_{lik}) (\frac{ξ_{l i}}{{\tilde{α}}_{l}} - 1) \int_{0}^{τ} [R_{lik} (β_{0}, u) - Y_{lik} (u) \\ \times \frac{E_{l} {(1 - Δ_{l 1 k}) R_{l 1 k} (β_{0}, u)}}{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (u)}}] d Λ_{0 k} (u) + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} Δ_{lik} (1 - ξ_{l i}) (\frac{η_{lik}}{{\tilde{q}}_{l k}} - 1) \\ \times [M_{\tilde{z}, lik} (β_{0}, τ) - \int_{0}^{τ} Y_{lik} (u) \frac{E_{l} {d M_{\tilde{z}, l 1 k} (β_{0}, u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1}}] . \end{array}

(13)

Specifically, one can decompose n^−1/2Ũ^II (β₀) into the following four parts:

\begin{array}{l} n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} M_{\tilde{z}, l 1 k} (β_{0}, τ) + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} \int_{0}^{τ} {e_{k} (β, u) - \frac{{\tilde{S}}_{I I, k}^{(1)} (β_{0}, u)}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)}} {d M}_{lik} (β_{0}, u) \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} \int_{0}^{τ} {ω_{lik}^{I I} (u) - 1} {\tilde{Z}}_{lik} (β_{0}, u) {d M}_{lik} (u) \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} \int_{0}^{τ} {ω_{lik}^{I I} (u) - 1} {e_{k} (β, u) - \frac{{\tilde{S}}_{I I, k}^{(1)} (β_{0}, u)}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)}} {d M}_{lik} (β_{0}, u) \end{array}

(14)

The second term on the right-hand side of (14) can be shown to converge to zero uniformly in t. Note that, for fixed t, $n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (β_{0}, t)$ is a sum of i.i.d. zero-mean random variables. Based on Conditions (C) and (E), M_lik(β₀, t) is of bounded variation and therefore can be written as a difference of two monotone functions in t. It then follows from the example of 2.11.16 of van der Vaart & Wellner (1996, p215) that $n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (β_{0}, t)$ converges weakly to a zero-mean Gaussian process, say Inline graphic (t). It can be shown that E[{ (t) − (s)}⁴] ≤ C{Λ₀_k(t) − Λ₀_k(s)}² for some constant C > 0. Specifically, E[{ (t) − (s)}⁴] = 3(E[{ (t) − (s)}²])² since (t) is a zero-mean normal random variable for a fixed t. Then E[{ (t) − (s)}²] = E{ (t)²}+E{ (s)²}−2E{ Inline graphic (t) (s)} = E{ (t)²}− E{ (s)²} for s ≤ t. Since ${E W_{M k} {(t)}^{2}} = E {M_{lik} {(β_{0}, t)}^{2}} = E {\int_{0}^{t} Y_{lik} (u) e^{β_{0}^{T} Z_{lik} (u)} λ_{0 k} (u) d u}, E [{W_{M k} (t) - W_{M k} (s)}^{2}] = E {\int_{s}^{t} Y_{lik} (u) e^{β_{0}^{T} Z_{lik} (u)} λ_{0 k} (u) d u} \leq e^{C_{z}} E {\int_{s}^{t} λ_{0 k} (u) d u} = {\tilde{C}}_{z} {Λ_{0 k} (t) - Λ_{0 k} (s)}$ by the boundedness condition (C). Since Λ₀_k(·) is differentiable and λ₀(·) is bounded on [0, τ], there exists a constant M, such that Λ₀_k(t) − Λ₀_k(s) ≤ M (t − s) for s ≤ t. Therefore, $E [{W_{M k} (t) - W_{M k} (s)}^{2}] \leq C_{z}^{*} (t - s)$ and $E [{W_{M k} (t) - W_{M k} (s)}^{4}] \leq 3 {(E [{W_{M k} (t) - W_{M k} (s)}^{2}])}^{2} \leq {\tilde{C}}_{z}^{*} {(t - s)}^{2}$ for some constant $C_{z}^{*}$ . Then, by the Kolmogorov-Centsov Theorem (Karatzas & Shereve, 1988, p53), Inline graphic (t) has continuous sample paths. In addition, since ${\tilde{S}}_{I I, k}^{(1)} (β_{0}, t) / {\tilde{S}}_{I I, k}^{(0)} (β_{0}, t)$ is of bounded variation based on (12) and Conditions (C) and (F), we can write ${\tilde{S}}_{I I, k}^{(1)} (β_{0}, t) / {\tilde{S}}_{I I, k}^{(0)} (β_{0}, t) = Z_{k 1}^{*} (t) - Z_{k 2}^{*} (t)$ where both $Z_{k 1}^{*} (t)$ and $Z_{k 2}^{*} (t)$ are nonnegative, monotone in t and bounded. Therefore, ${\tilde{S}}_{I I, k}^{(1)} (β_{0}, t) / {\tilde{S}}_{I I, k}^{(0)} (β_{0}, t)$ is a sum of two monotone functions. Hence, it follows from lemma 1 that the second term on the right-hand side of (14) converges to 0 uniformly in t.

By similar arguments, the fourth term on the right-hand side of (14) can be shown to converge to 0 uniformly in t.

The third term on the right-hand side of (14) can be further decomposed as

\begin{array}{l} n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} (1 - Δ_{lik}) (\frac{ξ_{l i}}{{\tilde{α}}_{l}} - 1) M_{\tilde{z}, lik} (β_{0}, τ) \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} (1 - Δ_{lik}) ξ_{l i} \int_{0}^{τ} {{\hat{α}}_{l k}^{- 1} (u) - {\tilde{α}}_{l}^{- 1}} {\tilde{Z}}_{lik} (β_{0}, u) {d M}_{lik} (β_{0}, u) \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} Δ_{lik} (1 - ξ_{l i}) (\frac{η_{lik}}{{\tilde{q}}_{k}} - 1) M_{\tilde{z}, lik} (β_{0}, τ) \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} Δ_{lik} (1 - ξ_{l i}) η_{lik} \int_{0}^{τ} {{\hat{q}}_{l k}^{- 1} (u) - {\tilde{q}}_{l k}^{- 1}} {\tilde{Z}}_{lik} (β_{0}, u) {d M}_{lik} (β_{0}, u) \end{array}

(15)

The second term on the right side of (15) is asymptotically equivalent to

n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} (1 - Δ_{lik}) (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) \int_{0}^{τ} Y_{lik} (u) \frac{E_{l} {(1 - Δ_{l 1 k}) R_{l 1 k} (β_{0}, u)}}{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (u)}} d Λ_{0 k} (u)

(16)

by (i) in lemma 3 and applying lemma 2. Likewise, by (ii) in lemma 3 and applying lemma 2, the last term on the right side of (15) is asymptotically equivalent to

n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} Δ_{lik} (1 - ξ_{l i}) (1 - \frac{η_{lik}}{{\tilde{q}}_{l k}}) \int_{0}^{τ} Y_{lik} (u) \frac{E_{l} {d M_{\tilde{z}, l 1 k} (β_{0}, u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1}}

(17)

By combining (16) and (17), (15) is asymptotically equivalent to

\begin{array}{l} n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} (1 - Δ_{lik}) (\frac{ξ_{l i}}{{\tilde{α}}_{l}} - 1) \int_{0}^{τ} [R_{lik} (β_{0}, u) - Y_{lik} (u) \frac{E_{l} {(1 - Δ_{l 1 k}) R_{l 1 k} (β_{0}, u)}}{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (u)}}] d Λ_{0 k} (u) \\ + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} Δ_{lik} (1 - ξ_{i}) (\frac{η_{lik}}{{\tilde{q}}_{l k}} - 1) \\ \times [M_{\tilde{z}, lik} (β_{0}, τ) - \int_{0}^{τ} Y_{lik} (u) \frac{E_{l} {d M_{\tilde{z}, l 1 k} (β_{0}, u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1}}] \end{array}

(18)

Combining the above results, we have shown that n^−1/2Ũ_II (β₀) is asymptotically equivalent to (13). Under the regularity conditions, the first term on the right-hand side of (13) is asymptotically zero-mean normal with covariance matrix $\sum_{l = 1}^{L} p_{l} Q_{l} (β_{0})$ where $Q_{l} (β_{0}) = E {\sum_{k = 1}^{K} M_{\tilde{z}, l 1 k} (β_{0}, τ)}^{\otimes 2}$ by Spiekerman & Lin (1998). The second and the third terms on the right-hand side of (13) can be shown to be asymptotically zero-mean normal with covariance matrix $\sum_{l = 1}^{L} p_{l} \frac{1 - α_{l}}{α_{l}} V_{l}^{I I, (1)} (β_{0})$ and $\sum_{l = 1}^{L} p_{l} (1 - α_{l}) \sum_{k = 1}^{K} p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) V_{l k}^{I I, (2)} (β_{0})$ by lemma 2, respectively. It follows from conditional expectation arguments that these three terms are mutually independent. Therefore, n^−1/2Ũ_II (β₀) is asymptotically normally distributed with mean zero and with finite variance

\sum_{l = 1}^{L} p_{l} {Q_{l} (β_{0}) + \frac{1 - α_{l}}{α_{l}} V_{l}^{I I, (1)} (β_{0}) + (1 - α_{l}) \sum_{k = 1}^{K} p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) V_{l k}^{I I, (2)} (β_{0})} .

Hence n⁻¹Ũ_II (β) converges to zero in probability. Thus, (iv) is satisfied.

By (i),(ii),(iii) and (iv), it follows that there exists a unique sequence β̃_II s.t. n⁻¹Ũ_II (β̃_II ) = 0 with probability converging to one as n → 0 and with β̃_II converging in probability to β₀ by Theorem 2 (Foutz, 1977).

The asymptotic normality of β̃_II follows from the consistency of β̃_II and a Taylor series expansion of Ũ_II (β). This completes the proof.

The asymptotic properties of ${\tilde{Λ}}_{0 k}^{I I} ({\tilde{β}}_{I I}, t)$ (k = 1, …, K) are summarized in the following theorem.

Theorem 4

Under Conditions (A) – (H), for each k = 1, …, K, ${\tilde{Λ}}_{0 k}^{I I} ({\tilde{β}}_{I I}, t)$ converges in probability to Λ₀_k(t) uniformly in t ∈ [0, τ]. Also, $W^{I I} (t) = n^{1 / 2} {[{{\tilde{Λ}}_{01}^{I I} ({\tilde{β}}_{I I}, t) - Λ_{01} (t)}, \dots, {{\tilde{Λ}}_{0 k}^{I I} ({\tilde{β}}_{I I}, t) - Λ_{0 K} (t)}]}^{T}$ converges weakly to a zero-mean Gaussian process Inline graphic (t) in D[0, τ]^K where $W^{I I} (t) = {W_{1}^{I I} (t), \dots, W_{K}^{I I} (t)}^{T}$ . The covariance function between $W_{j}^{I I} (t_{1})$ and $W_{k}^{I I} (t_{2})$ is

\begin{array}{l} φ_{j k}^{I I} (t_{1}, t_{2}) (β_{0}) = \sum_{l = 1}^{L} p_{l} (E_{l} {ν_{l 1 j} (β_{0}, t_{1}) ν_{l 1 k} (β_{0}, t_{2})} + \frac{1 - α_{l}}{α_{l}} E_{l} {ψ_{l 1 j}^{I I} (β_{0}, t_{1}) ψ_{l 1 k}^{I I} (β_{0}, t_{2})} + (1 - α_{l}) \\ \times [I (j = k) p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) {Cov}_{l} {ζ_{l 1 k}^{(1)} (β_{0}, t_{1}), ζ_{l 1 k}^{(1)} (β_{0}, t_{2}) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0} \\ + p r (Δ_{l 1 j} = 1) (\frac{1 - q_{l j}}{q_{l j}}) {Cov}_{l} {ζ_{l 1 j}^{(1)} (β_{0}, t_{1}), r_{k} {(β_{0}, t_{2})}^{T} A {(β_{0})}^{- 1} ζ_{l 1 j}^{(2)} (β_{0}, t_{2}) ∣ Δ_{l 1 j} = 1, ξ_{l 1} = 0} \\ + p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) {Cov}_{l} {ζ_{l 1 k}^{(1)} (β_{0}, t_{2}), r_{j} {(β_{0}, t_{1})}^{T} A {(β_{0})}^{- 1} ζ_{l 1 k}^{(2)} (β_{0}, t_{1}) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0} \\ + \sum_{m = 1}^{K} p r (Δ_{l 1 m} = 1) (\frac{1 - q_{l m}}{q_{l m}}) \\ \times r_{j} {(β_{0}, t_{1})}^{T} A {(β_{0})}^{- 1} {Cov}_{l} {ζ_{l 1 m}^{(2)} (β_{0}, t_{1}), ζ_{l 1 m}^{(2)} (β_{0}, t_{2}) ∣ Δ_{l 1 m} = 1, ξ_{l 1} = 0} A {(β_{0})}^{- 1} r_{k} (β_{0}, t_{2})]) . \end{array}

where

\begin{array}{l} ζ_{lik}^{(1)} (β, t) = \int_{0}^{t} \frac{1}{s_{k}^{(0)} (β, u)} [{d M}_{lik} (β, u) - Y_{lik} (u) \frac{E_{l} {{d M}_{l 1 k} (β, u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1}}] and \\ ζ_{lik}^{(2)} (β, t) = M_{\tilde{z}, lik} (β, τ) - \int_{0}^{t} Y_{lik} (u) \frac{E_{l} {{d M}_{\tilde{z}, l 1 k} (β, u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1}} . \end{array}

PROOF

One can make the following decomposition

\begin{array}{l} n^{1 / 2} {{\tilde{Λ}}_{0 k}^{I I} ({\tilde{β}}_{I I}, t) - Λ_{0 k} (t)} \\ = n^{1 / 2} \int_{0}^{t} {\frac{1}{n {\tilde{S}}_{I I, k}^{(0)} ({\tilde{β}}_{I I}, u)} - \frac{1}{n {\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)}} d {\sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (β, u)} \\ + n^{1 / 2} \int_{0}^{t} {\frac{1}{n {\tilde{S}}_{I I, k}^{(0)} ({\tilde{β}}_{I I}, u)} - \frac{1}{n {\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)}} d [\sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} {ω_{lik}^{I I} (u) - 1} M_{lik} (β, u)] \\ + n^{1 / 2} \int_{0}^{t} {\frac{1}{{\tilde{S}}_{I I, k}^{(0)} ({\tilde{β}}_{I I}, u)} - \frac{1}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)}} {\tilde{S}}_{I I, k}^{(0)} (β_{0}, u) d Λ_{0 k} (u) + \int_{0}^{t} \frac{1}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)} d {n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (β, u)} \\ + \int_{0}^{t} \frac{1}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)} d [n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} {ω_{lik}^{I I} (u) - 1} M_{lik} (β, u)} \end{array}

(19)

By the Taylor expansion of ${\tilde{S}}_{I I, k}^{(0)} {({\tilde{β}}_{I I}, u)}^{- 1}$ around β₀, the first term on the right-hand side of (19) is equivalent to

\int_{0}^{t} {- \frac{{\tilde{S}}_{I I, k}^{(1)} {(β^{*}, u)}^{T}}{{\tilde{S}}_{I I, k}^{(0)} {(β^{*}, u)}^{2}}} ({\tilde{β}}_{I I} - β_{0}) d {n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (β_{0}, u)}

(20)

where β^* is on the line segment between β̃_II and β₀. Then, as n → ∞, (20) converges to 0 uniformly in t in probability by lemma 1 since ${\tilde{S}}_{I I, k}^{(1)} (β, u) / {\tilde{S}}_{I I, k}^{(0)} (β, u)$ is of bounded variation, β̃_II is consistent for β₀, and $n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (β_{0}, u)$ converges weakly to a zero-mean Gaussian process with continuous sample path. The second term can be shown to converge to 0 uniformly in t in probability by similar arguments.

Again, it follows from the Taylor expansion of ${\tilde{S}}_{I I, k}^{(0)} {({\tilde{β}}_{I I}, u)}^{- 1}$ around β₀, the uniform convergence of ${\tilde{S}}_{I I, k}^{(1)} (β, u)$ and ${\tilde{S}}_{I I, k}^{(0)} (β, u)$ , the consistency of β̂_II for β₀ and the boundedness of Λ₀_k(t) on [0, τ) that the third term is asymptotically equivalent to

n^{1 / 2} r_{k} {(β_{0}, t)}^{T} ({\tilde{β}}_{I I} - β_{0}) .

The fourth term can be shown to be asymptotically equivalent to

\int_{0}^{t} \frac{1}{s_{k}^{(0)} (β_{0}, u)} d {n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (β_{0}, u)}

by lemma 1 since ${\tilde{S}}_{I I, k}^{(0)} (β, u)$ converges to $s_{k}^{(0)} (β, u)$ uniformly in t and $n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} M_{lik} (β_{0}, u)$ converges weakly to a zero-mean Gaussian process with continuous sample path. For the last term on the right-hand side of (19), it follows from lemma 3, and the uniform convergence of ${\tilde{S}}_{I I, k}^{(0)} {(β_{0}, t)}^{- 1}$ to $s_{k}^{(0)} {(β_{0}, t)}^{- 1}$ , where $s_{k}^{(0)} (β_{0}, t)$ is bounded away from 0 that

\begin{array}{l} \int_{0}^{t} \frac{1}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)} d [n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} {ω_{lik}^{I I} (u) - 1} M_{lik} (β_{0}, u)] \\ = n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} [(1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) (1 - Δ_{lik}) \int_{0}^{t} \frac{1}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)} Y_{lik} (u) e^{β_{0}^{T} Z_{lik} (u)} d Λ_{0 k} (u) \\ - (1 - Δ_{lik}) \int_{0}^{t} \frac{1}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)} {{\hat{α}}_{l k} {(u)}^{- 1} - {\tilde{α}}_{l}^{- 1}} ξ_{l i} Y_{lik} (u) e^{β_{0}^{T} Z_{lik} (u)} d Λ_{0 k} (u) \\ + Δ_{lik} (1 - ξ_{l i}) (\frac{η_{lik}}{{\tilde{q}}_{k}} - 1) \int_{0}^{t} \frac{1}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)} {d M}_{lik} (β_{0}, u) \\ + Δ_{lik} (1 - ξ_{l i}) \int_{0}^{t} \frac{1}{{\tilde{S}}_{I I, k}^{(0)} (β_{0}, u)} {{\hat{q}}_{k} {(u)}^{- 1} - {\tilde{q}}_{k}^{- 1}} η_{lik} {d M}_{lik} (β_{0}, u)] \\ = n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ((1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) (1 - Δ_{lik}) \int_{0}^{t} Y_{lik} (u) \\ \times [e^{β_{0}^{T} Z_{lik} (u)} - \frac{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (u) e^{β_{0}^{T} Z_{l 1 k} (u)}}}{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (u)}}] \frac{d Λ_{0 k} (u)}{s_{k}^{(0)} (β_{0}, u)} \\ - Δ_{lik} (1 - ξ_{l i}) (1 - \frac{η_{lik}}{{\tilde{q}}_{l k}}) \int_{0}^{t} \frac{1}{s_{k}^{(0)} (β_{0}, u)} [{d M}_{lik} (u) - Y_{lik} (u) \frac{E_{l} {{d M}_{l 1 k} (u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} (Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1)}] \end{array}

(21)

Now by combining the above results and using the asymptotic expansion of n^1/2(β̃_II − β₀) where

\begin{array}{l} n^{1 / 2} ({\tilde{β}}_{I I} - β_{0}) = A {(β_{0})}^{- 1} n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} \sum_{k = 1}^{K} (M_{\tilde{z}, lik} (β_{0}, τ) \\ + (1 - Δ_{lik}) (\frac{ξ_{l i}}{{\tilde{α}}_{l}} - 1) \int_{0}^{τ} [R_{lik} (β_{0}, u) - Y_{lik} (u) \frac{E_{l} {(1 - Δ_{l 1 k}) R_{l 1 k} (β_{0}, u)}}{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (u)}}] d Λ_{0 k} (u) \\ + Δ_{lik} (1 - ξ_{l i}) (\frac{η_{lik}}{{\tilde{q}}_{l k}} - 1) \\ \times [M_{\tilde{z}, lik} (β_{0}, τ) - \int_{0}^{τ} Y_{lik} (u) \frac{E_{l} {d M_{\tilde{z}, l 1 k} (β_{0}, u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1}}]) + o_{p} (1), \end{array}

we have

n^{1 / 2} {{\tilde{Λ}}_{0 k}^{I I} ({\tilde{β}}_{I I}, t) - Λ_{0 k} (t)} = n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ν_{lik} (β_{0}, t) + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) ψ_{lik}^{I I} (β_{0}, t) + n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ν_{lik}^{*} (β_{0}, t) + o_{p} (1)

where

\begin{array}{l} ν_{lik} (β, t) = r_{k} {(β, t)}^{T} A {(β)}^{- 1} \sum_{m = 1}^{K} M_{\tilde{z}, \lim} (β, τ) + \int_{0}^{t} {s_{k}^{(0)} (β, u)}^{- 1} {d M}_{lik} (β, u), \\ ψ_{lik}^{I I} (β, t) = r_{k} {(β, t)}^{T} A {(β)}^{- 1} \sum_{m = 1}^{K} (1 - Δ_{\lim}) \int_{0}^{T} [R_{\lim} (β, u) \\ - \frac{Y_{\lim} (u) E_{l} {(1 - Δ_{l 1 m}) R_{l 1 m} (β, u)}}{E_{l} {(1 - Δ_{l 1 m}) Y_{l 1 m} (u)}}] d Λ_{0 m} (u) + (1 - Δ_{lik}) \int_{0}^{t} Y_{lik} (u) [e^{β^{T} Z_{lik} (u)} \\ - \frac{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (u) e^{β^{T} Z_{l 1 k} (u)}}}{E_{l} {(1 - Δ_{l 1 k}) Y_{l 1 k} (u)}}] \frac{d Λ_{0 k} (u)}{s_{k}^{(0)} (β, u)} and \\ ν_{lik}^{*} (β, t) = r_{k} {(β, t)}^{T} A {(β)}^{- 1} \sum_{m = 1}^{K} Δ_{\lim} (1 - ξ_{i}) (\frac{η_{\lim}}{{\tilde{q}}_{l m}} - 1) ζ_{\lim}^{(2)} (β, t) + Δ_{lik} (1 - ξ_{l i}) (\frac{η_{lik}}{{\tilde{q}}_{l k}} - 1) ζ_{lik}^{(1)} (β, t), \\ ζ_{lik}^{(1)} (β, t) = \int_{0}^{t} \frac{1}{s_{k}^{(0)} (β, u)} [{d M}_{lik} (β, u) - Y_{lik} (u) \frac{E_{l} {{d M}_{l 1 k} (β, u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1}}] and \\ ζ_{lik}^{(2)} (β, t) = M_{\tilde{z}, lik} (β, τ) - \int_{0}^{t} Y_{lik} (u) \frac{E_{l} {{d M}_{\tilde{z}, l 1 k} (β, u) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0}}{E_{l} {Y_{l 1 k} (u) ∣ Δ_{l 1 k} = 1}} \end{array}

Now, let $W^{(1)} (t) = {W_{1}^{(1)} (t), \dots, W_{K}^{(1)} (t)}^{T}$ where $W_{k}^{(1)} (t) = n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ν_{lik} (β_{0}, t), W^{(2)} (t) = {W_{1}^{(2)} (t), \dots, W_{K}^{(2)} (t)}^{T}$ where $W_{k}^{(2)} (t) = n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} (1 - \frac{ξ_{l i}}{{\tilde{α}}_{l}}) ψ_{lik}^{I I} (β_{0}, t)$ , and $W^{(3)} (t) = {W_{1}^{(3)} (t), \dots, W_{K}^{(3)} (t)}^{T}$ where $W_{k}^{(3)} (t) = n^{- 1 / 2} \sum_{l = 1}^{L} \sum_{i = 1}^{n_{l}} ν_{lik}^{*} (β_{0}, t)$ for k = 1, …, K. Then, W ⁽¹⁾(t) converges weakly to a zero-mean Gaussian process $W^{(1)} (t) = {W_{1}^{(1)} (t), \dots, W_{K}^{(1)} (t)}^{T}$ in D[0, τ]^K where the covariance function between $W_{j}^{(1)} (t_{1})$ and $W_{k}^{(1)} (t_{2})$ is $\sum_{l = 1}^{L} p_{l} E_{l} {ν_{l 1 j} (β_{0}, t_{1}) ν_{l 1 k} (β_{0}, t_{2})}$ by Spiekerman & Lin (1998, Thm.2). W ⁽²⁾(t) also can be shown to converge weakly to a zero-mean Gaussian process $W^{(2)} (t) = {W_{1}^{(2)} (t), \dots, W_{K}^{(2)} (t)}^{T}$ . For any finite number of time points (t₁, …, t_D), the finite dimensional distribution of W ⁽²⁾(t) is asymptotically the same as those of Inline graphic (t) by lemma 2 and Cramer-Wold device. Since the space D[0, τ]^K is equipped with the uniform metric, it suffices to show the marginal tightness of $W_{k}^{(2)} (t)$ for each k. The marginal tightness follows directly by applying lemma 2 to $W_{k}^{(2)} (t)$ . Thus, W ⁽²⁾(t) converges weakly to a zero-mean Gaussian process where the covariance function between $W_{j}^{(2)} (t_{1})$ and $W_{k}^{(2)} (t_{2})$ is $\sum_{l = 1}^{L} p_{l} \frac{1 - α_{l}}{α_{l}} E_{l} {ψ_{l 1 j}^{I I} (β_{0}, t_{1}) ψ_{l 1 k}^{I I} (β_{0}, t_{2})}$ . The weak convergence W⁽³⁾(t) to a zero-mean Gaussian process Inline graphic (t) follows from the similar arguments with the covariance function between $W_{j}^{(2)} (t_{1})$ and $W_{k}^{(3)} (t_{3})$ being

\begin{array}{l} \sum_{l}^{L} p_{l} (1 - α_{l}) [I (j = k) p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) {Cov}_{l} {ζ_{l 1 k}^{(1)} (β_{0}, t_{1}), ζ_{l 1 k}^{(1)} (β_{0}, t_{2}) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0} \\ + p r (Δ_{l 1 j} = 1) (\frac{1 - q_{l j}}{q_{l j}}) {Cov}_{l} {ζ_{l 1 j}^{(1)} (β_{0}, t_{1}), r_{k} {(β_{0}, t_{2})}^{T} A {(β_{0})}^{- 1} ζ_{l 1 j}^{(2)} (β_{0}, t_{2}) ∣ Δ_{l 1 j} = 1, ξ_{l 1} = 0} \\ + p r (Δ_{l 1 k} = 1) (\frac{1 - q_{l k}}{q_{l k}}) {Cov}_{l} {ζ_{l 1 k}^{(1)} (β_{0}, t_{2}), r_{j} {(β_{0}, t_{1})}^{T} A {(β_{0})}^{- 1} ζ_{l 1 k}^{(2)} (β_{0}, t_{1}) ∣ Δ_{l 1 k} = 1, ξ_{l 1} = 0} \\ + \sum_{m = 1}^{K} p r (Δ_{l 1 m} = 1) (\frac{1 - q_{l m}}{q_{l m}}) \\ \times r_{k} {(β_{0}, t_{1})}^{T} A {(β_{0})}^{- 1} {Cov}_{l} {ζ_{l 1 m}^{(2)} (β_{0}, t_{1}), ζ_{l 1 m}^{(2)} (β_{0}, t_{2}) ∣ Δ_{l 1 m} = 1, ξ_{l 1} = 0} A {(β_{0})}^{- 1} r_{j} (β_{0}, t_{2})] . \end{array}

It follows from the conditional expectation argument that these three terms are mutually independent. Therefore, W^II(t) = W ⁽¹⁾(t) + W ⁽²⁾(t) + W ⁽³⁾(t) converges weakly to a zero-mean Gaussian process Inline graphic (t) = (t) + (t) + (t) where the covariance function between $W_{j}^{I I} (t_{1})$ and $W_{k}^{I I} (t_{2})$ is $φ_{j k}^{I I} (t_{1}, t_{2}) (β_{0})$ . This completes the proofs.

3.4 Proofs of Theorems 1 and 2

Proofs for Theorems 1 and 2 basically follow the same steps used for those for Theorems 3 and 4. However, the steps involving the asymptotic expansions of α̂_lk(t)⁻¹ and q̂_lk(t)⁻¹ around α̃_l and q̃_lk (lemma 3) can now be omitted. Specifically, the third and fourth terms in (9) and (10), and the second and fourth terms in (15) and (21) vanish.

4 Discussion

In this paper, we considered fitting marginal hazards models for failure time data with multiple disease outcomes from two types of stratified case-cohort study designs: the original and the generalized stratified case-cohort designs. In either design, subcohort members are sampled via stratified random sampling with possibly different sampling proportions within each stratum where the strata were constructed based on the information available for the entire cohort members. After the selection of the subcohort members, we sample all the remaining cases outside the subcohort under the original stratified case-cohort design whereas we are allowed to select a part of cases outside the subcohort via stratified random sampling under the generalized stratified case-cohort design. For estimation, we proposed weighted estimating equation approach for regression parameters and Breslow-Aalen type estimator for cumulative baseline hazards functions. We also provided a detailed proofs for deriving the asymptotic properties of the proposed estimators. The proposed estimators were shown to have desirable asymptotic properties such as consistency and asymptotic normality.

One modification to the generalized case-cohort study design in the current paper might be worth considering. Instead of sampling cases outside the sub-cohort within each stratum separately, one might want to sample cases outside the subcohort from the whole cohort regardless of their strata. Our proposed methods can be easily adapted to this design simply by redefining the strata, i.e., defining cases outside the subcohort as a separate single stratum.

Acknowledgments

This research is partly supported by National Institutes of Health NHLBI Grant R01 HL-57444.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Sangwook Kang, Email: skang@uga.edu, Department of Epidemiology and Biostatistics, University of Georgia, Athens, Georgia 30602, United States.

Jianwen Cai, Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States.

References

Andersen P, Gill R. Cox’s regression model for counting processes: A large sample study. The Annals of Statistics. 1982;10:1100–1120. [Google Scholar]
Barlow W. Robust variance estimation for the case-cohort design. Biometrics. 1994;50:1064–1072. [PubMed] [Google Scholar]
Borgan O, Langholz B, Samuelsen SO, Goldstein L, Pogoda J. Exposure stratified case-cohort designs. Lifetime Data Analysis. 2000;6:39–58. doi: 10.1023/a:1009661900674. [DOI] [PubMed] [Google Scholar]
Breslow NE, Wellner JA. Weighted likelihood for semipara-metric models and two-phase stratified samples, with application to cox regression. Scandinavian Journal of Statistics. 2007;34:86–102. doi: 10.1111/j.1467-9469.2007.00574.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chen K, Lo S. Case-cohort and case-control analysis with cox’s model. Biometrika. 1999;86:755–764. [Google Scholar]
Chen K. Generalized case-cohort sampling. Journal of the Royal Statistical Society, Series B. 2001;63:791–809. [Google Scholar]
Cullen KJ. Mass health examinations in the Busselton population, 1966 to 1970. Australian Journal of Medicine. 1972;2:714–718. doi: 10.5694/j.1326-5377.1972.tb103506.x. [DOI] [PubMed] [Google Scholar]
Foutz RV. On the unique consistent solution to the likelihood equations. Journal of the American Statistical Association. 1977;72:147–148. [Google Scholar]
Hájek J. Limiting distributions in simple random sampling from a finite population. Pub Math Inst Hungar Acad Sci. 1960;5:361–374. [Google Scholar]
Horvitz DG, Thompson DJ. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association. 1951;47:663–685. [Google Scholar]
Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. 2. New York: Wiley, John & Sons; 2002. [Google Scholar]
Kang S, Cai J. Marginal hazards model for case-cohort studies with multiple disease outcomes. Biometrika. 2009;96:887–901. doi: 10.1093/biomet/asp059. [DOI] [PMC free article] [PubMed] [Google Scholar]
Karatzas I, Shereve SE. Brownian Motion and Stochastic Calculus. 2. New York: Springer-Verlag; 1988. [Google Scholar]
Kulich M, Lin DY. Additive hazards regression for case-cohort studies. Biometrika. 2000;87:73–87. [Google Scholar]
Kulich M, Lin DY. Improving the efficiency of relative-risk estimation in case-cohort studies. Journal of the American Statistical Association. 2004;99:832–844. [Google Scholar]
Langholz B, Thomas D. Nested case-control and case-cohort methods of sampling from a cohort: A critical comparison. American Journal of Epidemiology. 1990;131:169–176. doi: 10.1093/oxfordjournals.aje.a115471. [DOI] [PubMed] [Google Scholar]
Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society, Series B. 2000;62:711–730. [Google Scholar]
Lin DY. On fitting cox’s proportional hazards models to survey data. Biometrika. 2000;87:37–47. [Google Scholar]
Prentice R. A case-cohort design for epidemiologic cohort studies and disease prevention trials. Biometrika. 1986;73:1–11. [Google Scholar]
Self SG, Prentice RL. Asymptotic distribution theory and efficiency results for case-cohort studies. Annals of Statistics. 1988;16:64–81. [Google Scholar]
Shorack GR, Wellner JA. Empirical Processes with Applications to Statistics. New York: Wiley; 1986. [Google Scholar]
Spiekerman CF, Lin DY. Marginal regression models for multivariate failure time data. Journal of the American Statistical Association. 1998;93:1164–1175. [Google Scholar]
Thisted R. Elements of Statistical Computing. New York: Chapman & Hall; 1988. [Google Scholar]
van der Vaart AW, Wellner JA. Weak Convergence and Empirical Processes. New York: Springer-Verlag; 1996. [Google Scholar]
Wacholder S, Gail M, Pee D. Efficient design for assessing exposure-disease relationships in an assembled cohort. Biometrics. 1991;47:63–76. [PubMed] [Google Scholar]
Wei LJ, Lin DY, Weissfeld L. Regression analysis of multivariate incomplete failure time data by modeling marginal distributions. Journal of the American Statistical Association. 1989;84:1065–1073. [Google Scholar]

[R1] Andersen P, Gill R. Cox’s regression model for counting processes: A large sample study. The Annals of Statistics. 1982;10:1100–1120. [Google Scholar]

[R2] Barlow W. Robust variance estimation for the case-cohort design. Biometrics. 1994;50:1064–1072. [PubMed] [Google Scholar]

[R3] Borgan O, Langholz B, Samuelsen SO, Goldstein L, Pogoda J. Exposure stratified case-cohort designs. Lifetime Data Analysis. 2000;6:39–58. doi: 10.1023/a:1009661900674. [DOI] [PubMed] [Google Scholar]

[R4] Breslow NE, Wellner JA. Weighted likelihood for semipara-metric models and two-phase stratified samples, with application to cox regression. Scandinavian Journal of Statistics. 2007;34:86–102. doi: 10.1111/j.1467-9469.2007.00574.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] Chen K, Lo S. Case-cohort and case-control analysis with cox’s model. Biometrika. 1999;86:755–764. [Google Scholar]

[R6] Chen K. Generalized case-cohort sampling. Journal of the Royal Statistical Society, Series B. 2001;63:791–809. [Google Scholar]

[R7] Cullen KJ. Mass health examinations in the Busselton population, 1966 to 1970. Australian Journal of Medicine. 1972;2:714–718. doi: 10.5694/j.1326-5377.1972.tb103506.x. [DOI] [PubMed] [Google Scholar]

[R8] Foutz RV. On the unique consistent solution to the likelihood equations. Journal of the American Statistical Association. 1977;72:147–148. [Google Scholar]

[R9] Hájek J. Limiting distributions in simple random sampling from a finite population. Pub Math Inst Hungar Acad Sci. 1960;5:361–374. [Google Scholar]

[R10] Horvitz DG, Thompson DJ. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association. 1951;47:663–685. [Google Scholar]

[R11] Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. 2. New York: Wiley, John & Sons; 2002. [Google Scholar]

[R12] Kang S, Cai J. Marginal hazards model for case-cohort studies with multiple disease outcomes. Biometrika. 2009;96:887–901. doi: 10.1093/biomet/asp059. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Karatzas I, Shereve SE. Brownian Motion and Stochastic Calculus. 2. New York: Springer-Verlag; 1988. [Google Scholar]

[R14] Kulich M, Lin DY. Additive hazards regression for case-cohort studies. Biometrika. 2000;87:73–87. [Google Scholar]

[R15] Kulich M, Lin DY. Improving the efficiency of relative-risk estimation in case-cohort studies. Journal of the American Statistical Association. 2004;99:832–844. [Google Scholar]

[R16] Langholz B, Thomas D. Nested case-control and case-cohort methods of sampling from a cohort: A critical comparison. American Journal of Epidemiology. 1990;131:169–176. doi: 10.1093/oxfordjournals.aje.a115471. [DOI] [PubMed] [Google Scholar]

[R17] Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society, Series B. 2000;62:711–730. [Google Scholar]

[R18] Lin DY. On fitting cox’s proportional hazards models to survey data. Biometrika. 2000;87:37–47. [Google Scholar]

[R19] Prentice R. A case-cohort design for epidemiologic cohort studies and disease prevention trials. Biometrika. 1986;73:1–11. [Google Scholar]

[R20] Self SG, Prentice RL. Asymptotic distribution theory and efficiency results for case-cohort studies. Annals of Statistics. 1988;16:64–81. [Google Scholar]

[R21] Shorack GR, Wellner JA. Empirical Processes with Applications to Statistics. New York: Wiley; 1986. [Google Scholar]

[R22] Spiekerman CF, Lin DY. Marginal regression models for multivariate failure time data. Journal of the American Statistical Association. 1998;93:1164–1175. [Google Scholar]

[R23] Thisted R. Elements of Statistical Computing. New York: Chapman & Hall; 1988. [Google Scholar]

[R24] van der Vaart AW, Wellner JA. Weak Convergence and Empirical Processes. New York: Springer-Verlag; 1996. [Google Scholar]

[R25] Wacholder S, Gail M, Pee D. Efficient design for assessing exposure-disease relationships in an assembled cohort. Biometrics. 1991;47:63–76. [PubMed] [Google Scholar]

[R26] Wei LJ, Lin DY, Weissfeld L. Regression analysis of multivariate incomplete failure time data by modeling marginal distributions. Journal of the American Statistical Association. 1989;84:1065–1073. [Google Scholar]

PERMALINK

Asymptotic results for fitting marginal hazards models from stratified case-cohort studies with multiple disease outcomes

Sangwook Kang

Jianwen Cai

Abstract

1 Introduction

2 Model, study design, and estimating procedure

2.1 Model

2.2 Study designs

2.3 Estimation of regression parameters under the original stratified case-cohort design

2.4 Estimation of regression parameters under the generalized stratified case-cohort design

2.5 Estimation of the cumulative baseline hazard function

3 Asymptotic properties

3.1 Conditions

3.2 Asymptotic properties of β̃_I and ${\tilde{Λ}}_{0 k}^{I} ({\tilde{β}}_{I}, t)$

Lemma 1

PROOF

Lemma 2

PROOF

Theorem 1

Theorem 2

3.3 Asymptotic properties of β̃_II and ${\tilde{Λ}}_{0 k}^{I I} ({\tilde{β}}_{I I}, t)$

Lemma 3

PROOF

Theorem 3

PROOF

Theorem 4

PROOF

3.4 Proofs of Theorems 1 and 2

4 Discussion

Acknowledgments

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Asymptotic results for fitting marginal hazards models from stratified case-cohort studies with multiple disease outcomes

Sangwook Kang

Jianwen Cai

Abstract

1 Introduction

2 Model, study design, and estimating procedure

2.1 Model

2.2 Study designs

2.3 Estimation of regression parameters under the original stratified case-cohort design

2.4 Estimation of regression parameters under the generalized stratified case-cohort design

2.5 Estimation of the cumulative baseline hazard function

3 Asymptotic properties

3.1 Conditions

3.2 Asymptotic properties of β̃I and Λ∼0kI(β∼I,t)

Lemma 1

PROOF

Lemma 2

PROOF

Theorem 1

Theorem 2

3.3 Asymptotic properties of β̃II and Λ∼0kII(β∼II,t)

Lemma 3

PROOF

Theorem 3

PROOF

Theorem 4

PROOF

3.4 Proofs of Theorems 1 and 2

4 Discussion

Acknowledgments

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3.2 Asymptotic properties of β̃_I and ${\tilde{Λ}}_{0 k}^{I} ({\tilde{β}}_{I}, t)$

3.3 Asymptotic properties of β̃_II and ${\tilde{Λ}}_{0 k}^{I I} ({\tilde{β}}_{I I}, t)$