Nonparametric estimation of the mean function for recurrent event data with missing event category

FENG-CHANG LIN; JIANWEN CAI; JASON P FINE; HUICHUAN J LAI

doi:10.1093/biomet/ast016

. Author manuscript; available in PMC: 2014 Jan 9.

Published in final edited form as: Biometrika. 2013 Jun 30;100(3):10.1093/biomet/ast016. doi: 10.1093/biomet/ast016

Nonparametric estimation of the mean function for recurrent event data with missing event category

FENG-CHANG LIN ¹, JIANWEN CAI ², JASON P FINE ³, HUICHUAN J LAI ⁴

PMCID: PMC3887139 NIHMSID: NIHMS502061 PMID: 24415792

Summary

Recurrent event data frequently arise in longitudinal studies when study subjects possibly experience more than one event during the observation period. Often, such recurrent events can be categorized. However, part of the categorization may be missing due to technical difficulties. If the event types are missing completely at random, then a complete case analysis may provide consistent estimates of regression parameters in certain regression models, but estimates of the baseline event rates are generally biased. Previous work on nonparametric estimation of these rates has utilized parametric missingness models. In this paper, we develop fully nonparametric methods in which the missingness mechanism is completely unspecified. Consistency and asymptotic normality of the nonparametric estimators of the mean event functions accommodate nonparametric estimators of the event category probabilities, which converge more slowly than the parametric rate. Plug-in variance estimators are provided and perform well in simulation studies, where complete case estimators may exhibit large biases and parametric estimators generally have a larger mean squared error when the model is misspecified. The proposed methods are applied to data from a cystic fibrosis registry.

Some key words: Cystic fibrosis, Local polynomial regression, Nelson–Aalen estimation, Pseudomonas aeruginosa infection, Rate proportion

1. Introduction

Recurrent event data frequently occur in biomedical studies where subjects may suffer from repeated symptoms, infections or hospitalizations. Such data also arise in industrial manufacturing when tested units or equipment may experience multiple failures and repairs. Often, such recurrent events can be categorized. Taking cystic fibrosis for example, patients may experience repeated Pseudomonas aeruginosa infections in early childhood and later acquire other mutated types of infection, which also occur recurrently even after aggressive antibiotic use (Li et al., 2005). However, the identification of the event category may not be complete due to technical difficulties. As demonstrated in § 5, such missingness poses challenges for the analysis of the rates of particular event types.

A common approach for the analysis of recurrent events is based on a rate function. In contrast to an intensity function approach, which conditions on all previous information, a rate function approach conditions only on the current value of covariates (Pepe & Cai, 1993; Lin et al., 2000; Cook & Lawless, 2007; Cook et al., 2009). Complete case analyses that censor missing event types lead to underestimation of either the intensity or the rate function (Schaubel & Cai, 2006). Cai & Schaubel (2004) studied a proportional rate model for multiple recurrent event processes, with unbiased estimation of the regression parameters but not the baseline rate function obtained with missingness completely at random. Schaubel & Cai (2006) later proposed an estimation procedure that is valid under weaker missingness assumptions and yields unbiased estimates of the baseline rate function. Parametric models were used to estimate the missingness probabilities, which were then used as weights in the usual rate model estimating equation. Chen & Cook (2009) specified a parametric frailty model to characterize dependence amongst the events and employed maximum likelihood analysis, which requires correct specification of rate models for all event types, as well as of the frailty distribution. In this paper, we consider nonparametric estimation of the rate function without specifying parametric models for the missingness or imposing restrictions on the models for other event types.

To formalize the data set-up, suppose that there are n independent subjects with K recurrent event categories. Let $N_{i k}^{*} (t)$ denote the total number of category k events occurring before time t for subject i, such that $d N_{i k}^{*} (s) \in {0, 1}$ and $d N_{i k}^{*} (s) d N_{i ℓ}^{*} (s) = 0$ for k ≠ ℓ. The mean function $μ_{k} (t) = E {N_{i k}^{*} (t)}$ is continuous with a smooth derivative r_k(t) = dμ_k(t)/dt. Let C_i denote the censoring time for subject i. The observed number of events is given by $N_{i k} (t) = N_{i k}^{*} (t \land C_{i})$ , where a ∧ b denotes the minimum of a and b. Assuming C_i is independent of $N_{i k}^{*}$ for each i and k, we have E{N_ik(t) | Y_i (t)} = Y_i (t)μ_k(t) with Y_i (t) = I (C_i ≥ t) indicating whether subject i is at risk for any event type.

With event categories always being observed, a Nelson–Aalen-type estimator (Nelson, 1988), defined by

{\hat{μ}}_{k}^{n} (t) = \sum_{i = 1}^{n} \int_{0}^{t} Y . {(s)}^{- 1} d N_{i k} (s),

(1)

is consistent for μ_k(t) for each k, where $Y . (t) = \sum_{i = 1}^{n} Y_{i} (t)$ denotes the total number of subjects who are at risk at time t. The variance of ${\hat{μ}}_{k}^{n} (t)$ can be consistently estimated by

{\hat{V}}_{k}^{n} (t) = \sum_{i = 1}^{n} {[\int_{0}^{t} Y . {(s)}^{- 1} {d N_{i k} (s)} - Y_{i} (s) d {\hat{μ}}_{k}^{n} (s)}]}^{2} .

In the previous literature, this estimator was studied only for events of a single type (Andersen et al., 1993; Lawless & Nadeau, 1995; Cook et al., 1996; Chiang et al., 2005). With multiple event types, one may choose to explicitly model the dependence amongst the events, e.g., using a mixed Poisson process (Abu-Libdeh et al., 1990) or to construct marginal models that may be fitted separately (Cai & Schaubel, 2004). Intuitively, ${\hat{μ}}_{k}^{n}$ should behave like estimators with a single type, since the estimator is calculated separately for each k.

The estimator (1) cannot be computed when event category information is missing. Naively censoring such events in a complete case analysis leads to underestimation. On the other hand, even with such missingness, the overall event process $d N_{i} . (t) = \sum_{k = 1}^{K} d N_{i k} (t)$ is observable. Using this information and information on events with known event types, one may estimate the probabilities of different event types conditionally on the observed data. These probabilities may be incorporated as weights in (1), yielding valid inferences. Schaubel & Cai (2006) employed a fully parametric logit model for the event category probabilities. When the model is misspecified, the resulting estimate for μ_k(t) could be biased. In this paper, we develop a fully nonparametric method for estimating μ_k(t) that is able to estimate the probability of an event being type k without any model assumption. The event category probabilities cannot be estimated at the usual parametric rate, which greatly complicates the analysis of the weighted version of (1). We show that the resulting estimator is root-n consistent and asymptotically normal, with variance which may be estimated using a simple plug-in formula.

2. Estimation methods

Let δ_i (t) ∈ {1,…, K } denote the type of the event that occurs to subject i at time t, and let δ_ik(t) = I{δ_i (t) = k} be an indicator function that indicates the category. Let R_i (t) = 1 when the event category is observed and R_i (t) = 0 otherwise. When some of the event categories are missing, a complete case analysis based on events with known event types, which is defined by

{\hat{μ}}_{k}^{c} (t) = \sum_{i = 1}^{n} \int_{0}^{t} Y . {(s)}^{- 1} R_{i} (s) d N_{i k} (s),

will underestimate μ_k(t) even when the event category is missing completely at random.

Note that dN_ik(t) = δ_ik(t) dN_i_·(t), since dN_ik(t) dN_i_ℓ(t) = 0 for k ≠ ℓ. Thus, dN_ik(t) = R_i (t) dN_ik(t) + {1 − R_i (t)} δ_ik(t) dN_i_·(t), and ${\hat{μ}}_{k}^{n} (t)$ in (1) can be written as

{\hat{μ}}_{k}^{c} (t) + \sum_{i = 1}^{n} \int_{0}^{t} Y . {(s)}^{- 1} {1 - R_{i} (s)} δ_{i k} (s) d N_{i} . (s) .

(2)

Since δ_ik(t) is unobservable when R_i (t) = 0, the complete case estimator ${\hat{μ}}_{k}^{c} (t)$ underestimates the truth due to ignorance of the second part in (2). A prediction of δ_ik(t), based on observable data, could be inserted to estimate the unknown part and correct the underestimation of ${\hat{μ}}_{k}^{c} (t)$ .

Assume that π_i (t) = E{R_i (t) | dN_ik(t) = 1} is the same for each k. One can show that

p_{k} (t) = E {δ_{i k} (t) ∣ d N_{i} . (t) = 1, R_{i} (t) = 0, Y_{i} (t)} = \frac{{1 - π_{i} (t)} E {d N_{i k} (t) ∣ Y_{i} (t)}}{\sum_{ℓ = 1}^{K} {1 - π_{i} (t)} E {d N_{i ℓ} (t) ∣ Y_{i} (t)}},

for k = 1,…, K, which equals $r_{k} (t) / \sum_{ℓ = 1}^{K} r_{ℓ} (t)$ . Thus, if one can estimate p_k(t) based on the rate functions r_k(t), a consistent estimator may be derived by inserting in the estimated probabilities for the missing δ_ik(t) in (2). However, it is not clear how to estimate the rate function r_k(t) when events with missing type are present in the data. Interestingly, without estimating r_k(t) for each k, one may estimate p_k(t), a rate proportion, by utilizing the events with known type, i.e., from a complete case analysis.

One can show that the limiting processes of ${\hat{μ}}_{k}^{c} (t)$ and its derivatives, respectively, are

μ_{k}^{c} (t) = \int_{0}^{t} y_{1} {(s)}^{- 1} π_{1}^{*} (s) d μ_{k} (s), r_{k}^{c} (t) = y_{1} {(t)}^{- 1} π_{1}^{*} (t) r_{k} (t),

where $y_{1} (t) = {lim}_{n \to \infty} n^{- 1} \sum_{i = 1}^{n} E {Y_{i} (t)}$ and $π_{1}^{*} (t) = {lim}_{n \to \infty} n^{- 1} \sum_{i = 1}^{n} π_{i} (t) E {Y_{i} (t)}$ . One may utilize $r_{k}^{c} (t)$ to estimate the rate proportion p_k(t), using the fact that

p_{k} (t) = \frac{r_{k} (t)}{\sum_{ℓ = 1}^{K} r_{ℓ} (t)} = \frac{y_{1} (t) π_{1}^{*} {(t)}^{- 1} r_{k}^{c} (t)}{y_{1} (t) π_{1}^{*} {(t)}^{- 1} \sum_{ℓ = 1}^{K} r_{ℓ}^{c} (t)} = \frac{r_{k}^{c} (t)}{\sum_{ℓ = 1}^{K} r_{ℓ}^{c} (t)} .

That is, although the complete case estimator itself underestimates the true underlying rate function, it can otherwise consistently estimate the probability of an observed event being type k. We hereafter refer to this approach as the rate proportion method, since the probability is simply a proportion of the overall rate.

To estimate p_k(t), we propose a nonparametric estimator for θ_k(t) = log{p_k(t)/ p_K (t)} via a local likelihood method and estimate p_k(t) through $p_{k} (t) = exp {θ_{k} (t)} / \sum_{ℓ = 1}^{K} exp {θ_{ℓ} (t)}$ . For any time t₀ ∈ [0, τ], define the νth derivative of θ_k(t) as $θ_{k}^{(ν)} (t) = \partial^{ν} θ_{k} (t) / \partial t^{ν}$ . One may expand θ_k(t) as

θ_{k} (t) \approx \sum_{ν = 0}^{q} \frac{1}{ν!} θ_{k}^{(ν)} (t_{0}) {(t - t_{0})}^{ν},

if t is in the neighbourhood of t₀, say, t ∈ [t₀ − h, t₀ + h] with bandwidth h. Let $β_{ν k} = {(ν!)}^{- 1} θ_{k}^{(ν)} (t_{0})$ , β_k = (β₀_k, …, β_qk)^T, and ${\tilde{θ}}_{k} (t, t_{0}; β_{k}) = \sum_{ν = 0}^{q} β_{ν k} {(t - t_{0})}^{ν}$ . The local log-likelihood for $β = {(β_{1}^{T} \dots, β_{K - 1}^{T})}^{T}$ is defined by

ℓ (β) = \sum_{i = 1}^{n} \int_{0}^{τ} K_{h} (u - t_{0}) ℓ_{i} (u, t_{0}; β) R_{i} (u) d N_{i \cdot} (u),

with

ℓ_{i} (u,, t_{0}; β) = \sum_{k = 1}^{K - 1} δ_{i k} (u) {\tilde{θ}}_{k} (u, t_{0}; β_{k}) - log [1 + \sum_{k = 1}^{K - 1} exp {{\tilde{θ}}_{k} (u, t_{0}; β_{k})}],

where Inline graphic (·) = (·/ h)/ h with (·) being a kernel function; τ is a constant that satisfies pr(C_i ≥ τ) > 0 for each i. By theory of local polynomial modelling (Fan & Gijbels, 1996), we can approximate θ_k(t) by ${\tilde{θ}}_{k} (t, t_{0}; {\hat{β}}_{k}) = \sum_{ν = 0}^{q} {\hat{β}}_{ν k} {(t - t_{0})}^{ν}$ , where β̂_k = (β̂₀_k,…, β ^_qk)^T maximizes the local likelihood ℓ(β). Consequently, an estimator for θ_k(t₀) is simply the local intercept β̂₀_k, and by moving t₀ within [0, τ], we can obtain functional estimates for θ_k(t).

Our goal, however, is to replace δ_ik(t) in (2) with an estimate of p_k(t) by

p_{k} (t; \hat{θ}) = \frac{exp ({\hat{β}}_{0 k})}{\sum_{ℓ = 1}^{K} exp ({\hat{β}}_{0 ℓ})},

where θ̂ = (β̂₀₁,…, β̂₀₍_K₋₁₎)^T with β ^₀_k (k = 1,…, K − 1) being local likelihood estimates at t, and β ^₀_K ≡ 0. Our estimator of the mean function by the rate proportion method is

{\hat{μ}}_{k}^{r} (t; \hat{θ}) = {\hat{μ}}_{k}^{c} (t) + \sum_{i = 1}^{n} \int_{0}^{t} Y . {(s)}^{- 1} {1 - R_{i} (s)} p_{k} (s; \hat{θ}) d N_{i \cdot} (s),

(3)

with consistent variance estimator

{\hat{V}}_{k}^{r} (t) = n^{- 1} \sum_{i = 1}^{n} {\hat{φ}}_{i k} {(t; \hat{θ})}^{2},

(4)

where φ̂_ik(t; θ) is defined in Theorem 1 in § 3 and p_k(t) is estimated only when an event with unknown category occurred at t.

3. Asymptotic properties

Let A(υ) be a column vector that satisfies ${\tilde{θ}}_{k} (u, t; β_{k}) = β_{k}^{T} A (u - t)$ and A(υ)^⊗2 = A(υ) A(υ)^T. Take ${\tilde{p}}_{k} (u, t; β) = exp {{\tilde{θ}}_{k} (u, t; β_{k})} / [1 + \sum_{ℓ = 1}^{K - 1} exp {{\tilde{θ}}_{ℓ} (u, t; β_{ℓ})}]$ for k = 1,…, K … 1. In addition, let {(K − 1) × (q + 1)}-square matrix ℍ denote blockdiag{H,…, H} with H = diag{1, h, …, h^q}, and take β̂^* = ℍβ ^ and $β_{0}^{*} = ℍ β_{0}$ , where β₀ is the true value of β. Let $r . (u) = \sum_{k = 1}^{K} r_{k} (u)$ and $θ^{(ν)} (u) = {θ_{1}^{(ν)} (u), \dots, θ_{K - 1}^{(ν)} (u)}^{T}$ .

We first provide the following lemma showing the consistency and large sample normality of the local likelihood estimator, which can be derived from a local polynomial method (Fan & Gijbels, 1996).

Lemma 1

Assume that the regularity conditions in the Appendix hold. Given t₀ ∈ [0, τ], we have

{(n h)}^{1 / 2} {{\hat{β}}^{*} - β_{0}^{*} - b (t_{0})} \to N {0, A {(t_{0})}^{- 1} B (t_{0}) A {(t_{0})}^{- 1}}

in distribution, where $b (t_{0}) = A {(t_{0})}^{- 1} {{\bar{ℓ}}_{1} {(β_{0}^{*})}^{T}, \dots, {\bar{ℓ}}_{K - 1} {(β_{0}^{*})}^{T}}^{T}$ , with

{\bar{ℓ}}_{k} (β_{0}^{*}) = f (t_{0}) h^{q + 1} \frac{θ^{(q + 1)} {(t_{0})}^{T}}{(q + 1)!} Ω_{k} (t_{0}) \int υ^{q + 1} A (υ) K (υ) d v {1 + o (1)},

$f (t_{0}) = π_{1}^{*} (t_{0}) r . (t_{0})$ , Ω_k(t₀) is a (K − 1)-column vector with ρ_k(t₀) = p_k(t₀){1 − p_k(t₀)} in the kth element and ρ_k_ℓ(t₀) = − p_k(t₀) p_ℓ(t₀) in the ℓth element, for ℓ ≠ k; Inline graphic (t₀) consists of diagonal block elements (k = 1,…, K … 1), and off-diagonal block elements = , k ≠ ℓ, where = ρ_k(t₀) f (t₀) ∫ A(υ)^⊗2 (υ) dυ and = ρ_k_ℓ(t₀) f (t₀) ∫ A(υ)^⊗2 (υ) dυ; (t₀), the limiting variance matrix of the score function, consists of block elements Inline graphic = ρ_k(t₀) f (t₀) ∫A(υ)^⊗2 (υ)² dυ, and = = ρ_k_ℓ(t₀) f (t₀) ∫ A(υ)^⊗2 (υ)² dυ, for k ≠ ℓ.

In the special case with q = 1 and K = 2, Lemma 1 can be simplified to the following corollary.

Corollary 1

Under the conditions of Lemma 1, we have

{(n h)}^{1 / 2} {{\hat{β}}^{*} - β_{0}^{*} - b (t_{0})} \to N {0, Q (t_{0})}

in distribution, where the bias $b (t_{0}) = (h^{2} / 2) μ_{2} {θ_{1}^{(2)} (t_{0}), 0}^{T} + o (h^{2})$ and the variance $Q (t_{0}) = ρ_{1} {(t_{0})}^{- 1} f {(t_{0})}^{- 1} diag {v_{0}, μ_{2}^{- 2} v_{2}}$ with μ₂ = ∫ υ² Inline graphic (υ) dυ, ν₀ = ∫ (υ)² dυ, and ν₂ = ∫ υ² (υ)² dυ. Furthermore,

{(n h)}^{1 / 2} {\hat{θ} (t_{0}) - θ_{1} (t_{0}) - \frac{1}{2} h^{2} μ_{2} θ_{1}^{(2)} (t_{0}) + o (h^{2})} \to N {0, σ^{2} (t_{0})}

in distribution, where σ²(t₀) = ν₀ρ₁(t₀)⁻¹ f(t₀)⁻¹.

When q = 1 and K = 2, the theoretical optimal bandwidth for estimating θ₁(·) can be derived by minimizing the asymptotic integrated mean squared error ∫ {b(s)² + σ²(s)/(nh)}ω(s) ds with some weighting function ω. One can show that

h_{opt} = {\int σ^{2} (s) w (s) d s}^{1 / 5} {μ_{2}^{2} \int θ_{1}^{(2)} {(s)}^{2} w (s) d s}^{1 / 5} n^{- 1 / 5} .

For arbitrary K ≥2, one can show that the optimal choice of the bandwidth for θ_k(·) is of order n^−1/(2^q⁺³⁾ for q ≥0. This is a critical result for the proof of the root-n weak convergence rate for ${\hat{μ}}_{k}^{r}$ , due to the slower convergence rate of the local polynomial estimator θ̂

Large sample properties of ${\hat{μ}}_{k}^{r}$ are summarized in the following theorem, whose proof is given in the Appendix.

Theorem 1

Under the conditions of Lemma 1, the rate proportion estimator ${\hat{μ}}_{k}^{r} (t; \hat{θ})$ is uniformly consistent for μ_k(t) in t ∈ [0, τ], and $n^{1 / 2} {{\hat{μ}}_{k}^{r} (t; \hat{θ}) - μ_{k} (t)}$ converges weakly to a Gaussian process with mean zero and covariance function V_k(s, t), s, t ∈ [0, τ], which can be consistently estimated by

{\hat{V}}_{k}^{r} (s, t) = \sum_{i = 1}^{n} {\hat{φ}}_{i k} (s; \hat{θ}) {\hat{φ}}_{i k} (t; \hat{θ}),

(5)

where

{\hat{φ}}_{i k} (t; θ) = \int_{0}^{t} Y . {(s)}^{- 1} {\hat{Ω}}_{k} {(s)}^{T} e_{ν}^{T} \hat{b} (s) {1 - R_{i} (s)} d N_{i \cdot} (s) + \int_{0}^{t} Y . {(s)}^{- 1} d {\hat{M}}_{i k}^{r} (s; \hat{θ}),

with Ω̂ _k(s) being a consistent estimate of Ω_k(s) obtained by replacing p_k(s) with p̂_k(s; θ) for k =1,…, K − 1, $e_{ν} = {(e_{1}^{T}, \dots, e_{1}^{T})}^{T}$ with (q + 1)-column vectors e₁ = (1, 0,…, 0)^T, b̂ (s) being an estimate of the bias term b(s), and

d {\hat{M}}_{i k}^{r} (s; θ) = R_{i} (s) d N_{i k} (s) + {1 - R_{i} (s)} {\hat{p}}_{k} (s; θ) d N_{i \cdot} (s) - Y_{i} (s) d {\hat{μ}}_{k}^{r} (s; θ) .

The summation of the first term in φ̂_ik(t; θ) will be dominated by the summation of the second term. Hence the naive variance estimator for ${\hat{μ}}_{k}^{r} (t)$ , defined by

n^{- 1} \sum_{i = 1}^{n} {\int_{0}^{t} Y . {(s)}^{- 1} d {\hat{M}}_{i k}^{r} (s; \hat{θ})}^{2},

is applicable when the sample size is large, without considering the variation contributed by the local likelihood estimates. That is, the limiting variance equals that from an estimator in which the event category probabilities are known. This differs from the case where parametric missingness models are fitted (Schaubel & Cai, 2006), where the resulting variance estimators depend on the variability in the parametric model estimates.

Observe, however, that the weak convergence rate of the two summation terms can be very close, e.g., O(n^−3/5) versus O(n^−1/2), when applying the local linear model. The naive variance estimator will likely underestimate the true variance when the sample size is relatively small, while the proposed variance estimator in (5) incorporates the variability of the local polynomial estimate. Specifically, one can estimate the bias term b(s) by using a higher order polynomial. For example, in the special case with q = 1 and K = 2, the bias term depends on the second derivative of θ₁(t), which can be estimated by 2β̂₂₁ in a local cubic regression for θ₁(t). In short, we denote ${\hat{V}}_{k}^{r} (t) = {\hat{V}}_{k}^{r} (t, t)$ , as in (4).

4. Simulation studies

In this section, simulation experiments are presented to demonstrate finite sample properties of our proposed estimation procedures. Three methods were evaluated. In the analysis of event category always being observed, we include every event in the estimation to serve as a reference for comparison. This kind of analysis is not feasible in practice with missing category data. Another method is the weighted estimating equations method (Schaubel & Cai, 2006) with a parametric logit model for the probability of a target category. A biased estimate may be anticipated when the true model is misspecified by the parametric model. Our proposed method, however, aims to provide consistent and robust estimates.

We consider three scenarios. In the first and second scenarios, we considered two types of recurrent events in 200 subjects. Let λ₁(t) = 1, λ₂(t) = t, and λ₃(t) = t²/3. We first generated event processes with intensity functions Gr₀₁λ₁(t) and Gr₀₂λ₂(t), where the shared random variable G was sampled from a Gamma(1/α, α) with E(G) = 1 and var(G) = α. The mean functions we aim to estimate, therefore, are μ₁(t) = r₀₁t and μ₂(t) = r₀₂t²/2. In this setting the parametric logistic model in the weighted estimating equations method may correctly specify the model for p_k(t) if one uses log(t) as a covariate since log{p₁(t)/ p₂(t)} = log(r₀₁/r₀₂) − log(t). However, in a second scenario, if the second process is generated by an intensity function Gr₀₂{λ₁(t) + λ₂(t) + λ₃(t)} with a mean function μ₂(t) = r₀₂(t + t²/2 + t³/9), the parametric model may be off the truth if one uses t as a covariate, especially when t is large. In the third scenario, we consider three types of recurrent events when n = 50 or 200 with intensity functions Gr₀₁λ₁(t), Gr₀₂{λ₁(t) + λ₂(t)}, and Gr₀₂λ₃(t), where G = log(W)/ exp(0·5) with W generated from a standard normal distribution.

The probability of having a missing category when an event occurred is

1 - π_{i} (t) = {[1 + exp {- z_{i} {(t)}^{T} κ}]}^{- 1},

(6)

where z_i (t) = {1, t, N_i_·(t·), Z_i }^T with N_i_·(t·) counting the total number of events before t; Z_i = 1 if i is odd, and 0 otherwise. In the simulation we set κ= (κ₀, κ_t, κ_n, κ_z)^T, with κ_t = −0·1, κ_n = 0·05, and κ_z = 0 or log(8), in which κ_z ≠ 0 indicated missing due to covariates or missing at random in Little & Rubin (2002). Various values of κ₀ were set to create different amount of events with missing category in order to systematically explore the effects of missingness, for which estimators would have more variation when events with missing category occurred more often. The simulation results shown in Tables 1 and 2 support this.

Table 1.

Simulation results for μ₁(t) = r₀₁t at t = 3; all entries except $e_{r}^{n}$ and $e_{r}^{w}$ are shown after multiplication by 10²

Biasⁿ

{\tilde{V}}_{1}^{n 1 / 2}

κ_z

graphic file with name nihms502061ig12.jpg

Bias^r

{\tilde{V}}_{1}^{r 1 / 2}

{\tilde{V}}_{1}^{r 1 / 2}

graphic file with name nihms502061ig11.jpg

Bias^ω

{\tilde{V}}_{1}^{w 1 / 2}

e_{r}^{n}

e_{r}^{w}

μ₂(t) = r₀₂t²/2

0·5

−0·45

17·9

10%

−0·13

18·0

18·2

94·5

−0·38

17·9

0·99

20%

0·09

18·3

18·1

94·6

−0·38

18·1

0·96

0·98

30%

0·25

18·7

18·0

94·0

−0·48

18·4

0·91

0·97

2·08

10%

−0·18

18·3

18·1

93·8

−0·45

18·2

0·96

0·99

20%

0·00

18·6

18·1

93·7

−0·46

18·3

0·93

0·98

30%

0·49

18·9

18·1

92·9

−0·34

18·6

0·90

0·97

1·0

0·93

22·8

10%

1·05

22·9

22·2

93·8

0·86

22·9

0·99

20%

1·30

23·3

22·2

93·8

0·85

23·1

0·96

0·99

30%

1·79

23·4

22·3

94·2

1·10

23·2

0·94

0·98

2·08

10%

1·12

23·1

22·3

93·6

0·95

23·0

0·98

0·99

20%

1·42

23·1

22·3

94·2

0·99

23·0

0·97

0·99

30%

1·84

23·6

22·3

94·0

1·10

23·3

0·93

0·97

μ₂(t) = r₀₂(t + t²/2 + t³/9)

0·5

0·52

18·3

10%

0·17

18·6

18·1

93·8

2·04

18·7

0·97

1·03

20%

0·03

19·0

17·9

92·6

3·21

19·2

0·92

1·05

30%

−0·06

19·7

17·8

91·5

4·56

19·8

0·86

1·07

2·08

10%

0·29

18·5

18·1

93·7

1·81

18·7

0·98

1·03

20%

0·11

19·0

17·9

92·9

2·92

19·3

0·93

1·05

30%

0·52

19·7

17·8

91·9

4·72

20·4

0·86

1·13

1·0

2·07

21·8

10%

1·63

22·0

22·1

94·6

4·90

22·5

0·98

1·09

20%

1·35

22·3

21·9

94·4

6·62

23·1

0·96

1·16

30%

1·20

22·8

21·8

93·7

8·55

23·8

0·91

1·22

2·08

10%

1·59

21·9

22·2

94·9

4·32

22·4

0·99

1·08

20%

1·32

22·3

22·0

94·4

6·21

23·1

0·95

1·14

30%

1·35

22·7

21·9

94·1

8·13

23·8

0·92

1·23

Open in a new tab

Table 2.

Simulation results for μ₁(t) = r₀₁t when K = 3 and the frailty follows a log-normal distribution; all entries except $e_{r}^{n}$ and $e_{r}^{w}$ are shown after multiplication by 10²

Biasⁿ

Ṽⁿ^1/2

κ_z

Bias^r

Ṽ^r^1/2

V̄^r^1/2

Bias^ω

Ṽ^ω^1/2

e_{r}^{n}

e_{r}^{w}

0·83

14·5

15%

0·76

14·7

16·1

92·4

0·46

14·6

0·99

0·98

25%

0·63

14·9

18·1

93·8

0·09

14·7

0·96

0·98

35%

0·65

15·2

20·7

94·3

−0·16

14·9

0·92

0·96

2·08

15%

0·80

14·7

16·3

92·0

0·47

14·6

0·97

0·98

25%

0·76

15·0

18·1

93·9

0·19

14·8

0·95

0·97

35%

0·68

15·2

20·9

94·5

−0·20

14·9

0·92

0·96

200

0·24

7·11

15%

0·35

7·24

7·78

95·7

−0·13

7·19

0·96

0·98

25%

0·36

7·31

8·39

96·3

−0·51

7·20

0·94

0·97

35%

0·52

7·39

9·14

96·8

−0·81

7·24

0·92

0·97

2·08

15%

0·32

7·20

7·84

95·3

−0·21

7·15

0·97

0·98

25%

0·40

7·27

8·46

96·4

−0·51

7·18

0·95

0·98

35%

0·48

7·46

9·36

97·1

−0·95

7·30

0·90

0·97

3·66

38·6

15%

4·45

38·2

38·9

92·1

5·47

39·4

1·02

1·07

25%

5·10

38·3

40·3

92·9

6·33

40·0

1·01

1·10

35%

5·54

38·3

42·1

92·5

6·86

40·2

1·01

1·11

2·08

15%

4·49

38·4

39·0

91·6

5·16

39·4

1·01

1·06

25%

5·03

38·4

40·4

92·0

5·92

39·9

1·00

1·08

35%

6·58

39·5

42·7

93·1

7·55

41·6

0·94

1·11

200

0·48

19·0

15%

1·17

19·0

18·8

93·1

2·28

19·6

1·00

1·07

25%

1·76

19·3

19·2

94·0

3·13

20·0

0·96

1·09

35%

2·33

19·3

19·7

94·0

3·81

20·1

0·96

1·10

2·08

15%

1·28

19·3

18·9

93·9

2·17

19·8

0·97

1·06

25%

1·83

19·3

94·5

2·88

19·9

0·97

1·08

35%

2·88

19·3

20·0

95·1

4·00

20·3

0·95

1·12

Open in a new tab

We assumed r₀₁ = 0·75 or 1·25, r₀₂ = 0·625, and the Gamma parameter α = 0·5 or 1 in the first two scenarios, where a larger α represents higher dependence between event processes. On average, we observed about 4 events per subject when μ₂(t) = r₀₂t²/2 and about 7 events when μ₂(t) = r₀₂(t + t²/2 + t³/9). In the third scenario, we assumed r₀₁ = 0·5 and r₀₂ = 0·625, which also results in about 7 events per subject. Censoring times were independently generated by a uniform distribution between 0 and 5. All of our local likelihood estimation was implemented using the Epanechnikov kernel Inline graphic (x) = 0·75(1 − x²), |x| < 1, and a local linear model, i.e., q = 1. When K = 2, a nearest-neighbour method was used to calculate the varying bandwidth and AIC (Akaike, 1974) was used as a bandwidth selection criteria. These procedures can be implemented using an R (R Development Core Team, 2013) package locfit (Loader, 2010). When K = 3, a fixed bandwidth proportional to n^−1/5 was applied. While log(t) was used in the first scenario for the correct model specification in the weighted estimating equations method, covariates z_i (t) in (6) were used in the other two scenarios for the purpose of model misspecification.

We first show graphic results for μ₁(t) over the observation period with different combinations of α and κ_z in Fig. 1 when μ₂(t) = r₀₂t²/2. In these figures, the solid lines correspond to the true μ₁(t) and grey areas represent $μ_{1} (t) \pm 1 \cdot 96 \times {\tilde{V}}_{1}^{r} {(t)}^{1 / 2}$ , where ${\tilde{V}}_{1}^{r} (t)$ is the empirical variance of the replicated estimates ${\hat{μ}}_{1}^{r} (t)$ ; dotted lines show the average of the replicated ${\hat{μ}}_{1}^{r} (t)$ and its $\pm 1 \cdot 96 \times {\bar{V}}_{1}^{r} {(t)}^{1 / 2}$ pointwise confidence limits, where ${\bar{V}}_{1}^{r} (t)$ is the average of the replicated variance estimates ${\hat{V}}_{1}^{r} (t)$ ; dashed lines show the average of the replicated ${\hat{μ}}_{1}^{c} (t)$ based on the complete case analysis. Overall, the estimation by the complete case analysis performs worse as the follow-up time t increases, due to more events with missing category at the later part of the observation period. On the contrary, our proposed estimator based on the rate proportion method is approximately unbiased. Also, the upper and lower dotted lines cover the grey area. This means that the point estimator ${\hat{μ}}_{1}^{r} (t)$ is approximately unbiased and that the variance estimator ${\hat{V}}_{1}^{r} (t)$ approximates the asymptotic variance well.

Table 1 shows the simulation results for μ₁(t) = r₀₁t at t = 3 when r₀₁ = 0·75 in the first two scenarios using ${\hat{μ}}_{1}^{n} (t)$ in (1), the rate proportion method ${\hat{μ}}_{1}^{r} (t)$ in (3), and the weighted estimating equations method ${\hat{μ}}_{1}^{w} (t)$ . We report the bias of the estimation, defined by the average of the replicated estimates minus the true value, the empirical standard deviation ${\tilde{V}}_{1}^{1 / 2}$ , defined by the sample standard deviation of the replicated estimates, the average of the replicated standard deviation estimates ${\bar{V}}_{1}^{1 / 2}$ , empirical coverage probability at a 0·95 nominal level, denoted by Inline graphic , and the relative mean squared error to the rate proportion method, denoted by $e_{r}^{x} = m^{x} / m^{r}$ , where $m^{x} = {({bias}^{x})}^{2} + {\tilde{V}}_{1}^{x} (x = n, w)$ and m^r is defined similarly. The empirical percentage of recurrent events with missing category is denoted by Inline graphic . When μ₂(t) = r₀₂t²/2 and the weighted estimating equations method correctly specifies the model, all of the three estimators have bias close to 0 but ${\hat{μ}}_{1}^{r} (t)$ has slightly larger empirical variance that results in a larger mean squared error. However, the relative error is rather moderate to ${\hat{μ}}_{1}^{n} (t)$ and minimal to ${\hat{μ}}_{1}^{w} (t)$ . Hence our nonparametric estimator is very competitive with the current existing parametric method even when the parametric method correctly specifies the model. When μ₂(t) = r₀₂(t + t²/2 + t³/9) and the model was misspecified by the parametric method, only ${\hat{μ}}_{1}^{n} (t)$ and ${\hat{μ}}_{1}^{r} (t)$ are consistent. The estimator ${\hat{μ}}_{1}^{w} (t)$ is generally biased and has larger empirical variance than ${\hat{μ}}_{1}^{r} (t)$ , resulting in a high ratio of mean squared errors. Overall, the rate proportion method has comparable variation to the analysis when the event category is always observed, has variance estimation close to the empirical variance that results in good empirical coverage, and has substantially better mean squared error when the true model is misspecified by the weighted estimating equations method. Similar results can be seen in Table 2, where the relative mean squared error is much greater in a later time when events with missing types occur more often. Interestingly, the empirical variance Ṽ₁ changes only slightly in both the rate proportion method and the weighted estimating equations method when the missingness depends on the covariate, so both estimators seem to be robust to the mechanics of missingness. However, when the data have more events with missing category, both estimators have larger variation but the rate proportion method performs better than the misspecified weighted estimating equations method.

5. Cystic fibrosis registry data

Cystic fibrosis is the most common life-shortening genetic disorder in Caucasians, with an incidence of approximately 1 in 3000 white live births (Kosorok et al., 1996). Chronic lung disease in children can be characterized by recurrent infections of P. aeruginosa, the most important pathogen that leads to the airway obstruction and lung function decay. Pseudomonas aeruginosa infection was found to be a major predictor of morbidity and mortality (Kosorok et al., 2001). Young cystic fibrosis patients aged 1–5 years in 1990 with positive respiratory cultures for P. aeruginosa have significantly higher death rates and worse lung function during the following 8 years (Emerson et al., 2002). According to Li et al. (2005), about 30% of newborn infants acquired nonmucoid type of infection in the first 6 months of life, with a mucoid type of infection prevailing after age 4 years. It is of interest to characterize these patterns of infection in young cystic fibrosis patients.

The United States Cystic Fibrosis Foundation Patient Registry has documented the diagnosis and follow-up of all known cystic fibrosis patients from 114 accredited centres since the 1970s. The quality of this database improved greatly in 1986 because of more consistent reporting and quality control (FitzSimmons, 1993). In the 2007 registry data, there are 6585 subjects who were born after 1997 and have at least two follow-ups before the end of year 2007. The total length of follow-up is 27 412·7 person-years, averaging 4·2 years per subject. In these follow-up years, there were 10 353 nonmucoid and 3190 mucoid P. aeruginosa infections, along with 1339 events missing their category. Roughly, the occurrence rates are 3·8 for nonmucoid type and 1·2 for mucoid type per 10 years, not counting events with missing type. However, a patient may test positive for both nonmucoid and mucoid types at the same visit. To simplify the analysis, we treat the event with both types positive in the same visit as a third type of recurrent event process. Accordingly, there are 1582 such events during the follow-ups.

A large percentage of infections have missing category, so our estimation methods are preferable, as the complete case analysis that censors those events would have dramatic underestimation. Figure 2, derived by the rate proportion method and complete case analysis, reveals this. Particularly in nonmucoid type infections, there is substantial discrepancy between our estimates and the complete case analysis after the first year of age. In general, the two estimates diverge as age increases, partly due to more events with missing type being recorded over time. Based on the rate proportion method, the average number of nonmucoid type infections per patient is 2·4 by age 7, while that for mucoid type infections is 0·4. The rate for having both types of infections is similar to the rate for the mucoid type. Both increase more rapidly after age 7.

In Fig. 2, we also compare the estimation results between the rate proportion method and the weighted estimating equations method. We define the relative difference as the percent change of the weighted estimating equation estimates from the rate proportion estimates. In the weighted estimating equations approach, we used patient’s gender and mode of diagnosis as covariates.

The two methods produced similar results in estimating the nonmucoid P. aeruginosa infection rate with the relative difference being less than 5% over the range of the 10-year period. However, the infection rates of the mucoid type and of having both types in the same visit were significantly underestimated by the weighted estimating equation approach. The relative difference may reach as much as 50% in the first year of age.

6. Remarks

We assume that the observation probability π_ik(t) is the same for each event type, which may not be realistic in practice when some types of events are more likely to have a missing category. However, the observation probability may not be estimable due to lack of information in those events with missing types. One possible generalization of our approach is to assume that the observation probability is known a priori for each category. One can show that, if π_ik(t) is different for each k, our current approach leads to the estimation of $p_{i k}^{*} (t) = E {δ_{i k} ∣ d N_{i \cdot} (t) = 1, R_{i} (t) = 1, Y_{i} (t)}$ , which differs from p_ik(t) = E{δ_ik | dN_i_·(t) = 1, R_i(t) = 0, Y_i(t)}, the unknown quantity in our mean function estimator. However, with $r_{k} (t) = π_{i k} {(t)}^{- 1} p_{i k}^{*} (t)$ and $p_{i k} (t) = {1 - π_{i k} (t)} r_{k} (t) / \sum_{ℓ = 1}^{K} {1 - π_{i ℓ} (t)} r_{ℓ} (t)$ , one may estimate p_ik(t) with the estimation of $p_{i k}^{*} (t)$ and known π_ik(t).

In the rate proportion method, the local likelihood procedure yields a nonparametric estimator via a regression model that uses time as a covariate. One may prefer to apply different non-parametric regression methods for categorized outcomes, such as the generalized additive model (Hastie & Tibshirani, 1990) or smoothing splines (Gu, 2002). It will be of interest to develop asymptotic theory for estimates based on such approaches and compare the performance across different nonparametric regression methods.

Acknowledgments

The authors thank Dr Preston Campbell from the United States Cystic Fibrosis Foundation for providing the registry data, and the editor and two referees for their helpful comments. This work is partially supported by the U.S. National Institutes of Health.

Appendix

We first provide the following regularity conditions.

Condition A1. Variables {N_i₁(·), …, N_iK(·)} (i = 1, …, n) are independent and identically distributed.
Condition A2. The expected number of subjects at risk $y_{1} (t) = {lim}_{n \to \infty} n^{- 1} \sum_{i = 1}^{n} E {Y_{i} (t)} > 0$ for every t ∈ [0, τ].
Condition A3. The total number of events N_i_·(τ) < η < ∞.
Condition A4. For t ∈ [0, τ], observation probability π_i(t) = E{R_i(t)|dN_ik(t) = 1} is the same for every k.
Condition A5. The likelihood function ℓ(β^*) is bounded and twice differentiable. The Hessian matrix ℓ̈(β^*) = ∂²ℓ(β^*)/∂β^*∂β^*T is negative definite and invertible.
Condition A6. The function θ_k(·) for each k ∈ {1, …, K} has a continuous (q + 1)th derivative for q > 0.
Condition A7. The kernel function (·) has a bounded and symmetric density with a compact support, and satisfies ∫ υ (υ) dυ = ∫ υ³ (υ) dυ = 0.
Condition A8. Assume nh → ∞ as h → 0 and n → ∞.

Conditions A1–A3 are regularity conditions for recurrent event processes. We require data from the subjects to be independent and identically distributed in Condition A1. Our estimation, however, accommodates multiple dependent recurrent event processes. Condition A4 assumes that each type of event has the same probability for the category being missing. Conditions A5–A8 are otherwise regularity conditions for the large sample properties of the local likelihood estimates.

Proof of Theorem 1

To show the consistency of ${\hat{μ}}_{k}^{r} (t; \hat{θ})$ , we decompose ${\hat{μ}}_{k}^{r} (t; \hat{θ}) - μ_{k} (t)$ as ${\hat{μ}}_{k}^{r} (t; \hat{θ}) - {\hat{μ}}_{k}^{r} (t) + {\hat{μ}}_{k}^{r} (t) - μ_{k} (t)$ with

{\hat{μ}}_{k}^{r} (t) = {\hat{μ}}_{k}^{c} (t) + \sum_{i = 1}^{n} \int_{0}^{t} Y . {(s)}^{- 1} {1 - R_{i} (s)} p_{k} (s) d N_{i \cdot} (s) .

Let $ω_{k}^{(1)} (t; \hat{θ}) = {\hat{μ}}_{k}^{r} (t; \hat{θ}) - {\hat{μ}}_{k}^{r} (t)$ . First, we write

ω_{k}^{(1)} (t; \hat{θ}) = \sum_{i = 1}^{n} \int_{0}^{t} Y . {(s)}^{- 1} {1 - R_{i} (s)} {p_{k} (s; \hat{θ}) - p_{k} (s)} d N_{i \cdot} (s) .

Then, expanding $ω_{k}^{(1)} (t; \hat{θ})$ around θ = (θ₁, …, θ_K _{− 1})^T, we have

ω_{k}^{(1)} (t; \hat{θ}) = \sum_{i = 1}^{n} \int_{0}^{t} Y . {(s)}^{- 1} {1 - R_{i} (s)} Ω_{k} {(s)}^{T} {\hat{θ} (s) - θ (s)} d N_{i \cdot} (s) + o (1) .

It can be shown that $ω_{k}^{(1)} (t; \hat{θ})$ converges in probability to

\int_{0}^{t} y_{1} {(s)}^{- 1} Ω_{k} {(s)}^{T} e_{v}^{T} b (s) {1 - π_{1}^{*} (s)} r . (s) d s .

Since the bias term b(t) converges uniformly in probability to 0 when h → 0, we can conclude that $ω_{k}^{(1)} (t; \hat{θ})$ converges in probability to 0, uniformly in t. With ${\hat{μ}}_{k}^{r} (t) = {\hat{μ}}_{k}^{n} (t) + o (1)$ uniformly in t, we can prove the uniform consistency of ${\hat{μ}}_{k}^{r} (t; \hat{θ})$ by the fact that ${\hat{μ}}_{k}^{n} (t)$ uniformly converge to μ_k(t).

To prove the large sample normality we need to obtain the rate of the weak convergence when inserting in the local polynomial estimate. One can show that $ω_{k}^{(1)} (t : \hat{θ})$ has the same weak convergence rate as

\int_{0}^{t} y_{1} {(s)}^{- 1} Ω_{k} {(s)}^{T} {\hat{θ} (s) - θ_{0} (s)} {1 - π_{1}^{*} (s)} r . (s) d s .

(A1)

Recall that the local polynomial estimate θ̂(s) is O(n^−1/(2^q⁺³⁾) when using the optimal bandwidth. Under the smoothness assumption of θ, one can show that (A1) is O(n⁻⁽^q^+2)/(2^q⁺³⁾), which is faster than O(n^−1/2). That means the sequence of $ω_{k}^{(1)} (t; \hat{θ})$ will be dominated by the sequence of $ω_{k}^{(2)} (t) = {\hat{μ}}_{k}^{r} (t) - μ_{k} (t)$ , which has a O(n^−1/2) weak convergence rate.

Combined with the asymptotic equivalency of $n^{1 / 2} ω_{k}^{(2)} (t)$ and $n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{t} y_{1} {(s)}^{- 1} d M_{i k}^{r} (s)$ , where $d M_{i k}^{r} (t) = R_{i} (s) d N_{i k} (s) + {1 - R_{i} (s)} p_{k} (s) d N_{i \cdot} (s) - Y_{i} (s) d μ_{k} (s)$ , one can show that n^1/2ω_k(t; θ̂) is asymptotically equivalent to $n^{- 1 / 2} \sum_{i = 1}^{n} φ_{i k} (t; θ_{0})$ , where

φ_{i k} (t; θ) = \int_{0}^{t} y_{1} {(s)}^{- 1} Ω_{k} {(s)}^{T} e_{v}^{T} b (s) {1 - R_{i} (s)} d N_{i \cdot} (s) + \int_{0}^{t} y_{1} {(s)}^{- 1} d M_{i k}^{r} (s) .

Notice that φ_ik(t; θ₀) (i = 1, …, n) are independent and identically distributed zero-mean variables, so $n^{- 1 / 2} \sum_{i = 1}^{n} φ_{i k} (t; θ_{0})$ converges to a multivariate normal distribution with mean zero and covariance V_k(s, t) = E{φ₁_k(s; θ₀)φ₁_k(t; θ₀)} for s, t ∈ [0, τ]. Hence n^1/2ω_k(t; θ̂) converges weakly to a Gaussian process by the functional central limit theorem (Pollard, 1990), as the φ_ik(t; θ₀) is composed of functions that are monotone in t, i.e., φ_ik(t; θ₀) is manageable and n^1/2ω_k(t; θ₀) is tight.

Contributor Information

FENG-CHANG LIN, Email: flin@bios.unc.edu, Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina 27599, U.S.A.

JIANWEN CAI, Email: cai@bios.unc.edu, Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina 27599, U.S.A.

JASON P. FINE, Email: jfine@bios.unc.edu, Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina 27599, U.S.A

HUICHUAN J. LAI, Email: hlai@wisc.edu, Department of Nutritional Sciences, University of Wisconsin, Madison, Wisconsin 53706, U.S.A

References

Abu-Libdeh H, Turnbull BW, Clark LC. Analysis of multi-type recurrent events in longitudinal studies: Application to a skin cancer prevention trial. Biometrics. 1990;46:1017–34. [PubMed] [Google Scholar]
Akaike H. A new look at the statistical model identification. IEEE Trans Auto Contr. 1974;19:716–23. [Google Scholar]
Andersen P, Borgan Ø, Gill R, Keiding N. Statistical Models Based on Counting Processes. New York: Springer; 1993. [Google Scholar]
Cai J, Schaubel D. Marginal means/rates models for multiple type recurrent event data. Lifetime Data Anal. 2004;10:121–38. doi: 10.1023/b:lida.0000030199.23383.45. [DOI] [PubMed] [Google Scholar]
Chen BE, Cook RJ. The analysis of multivariate recurrent events with partially missing event types. Lifetime Data Anal. 2009;15:41–58. doi: 10.1007/s10985-008-9091-3. [DOI] [PubMed] [Google Scholar]
Chiang CT, Wang MC, Huang CY. Kernel estimation of rate function for recurrent event data. Scand J Statist. 2005;32:77–91. doi: 10.1111/j.1467-9469.2005.00416.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cook RJ, Lawless JF. The Statistical Analysis of Recurrent Events. New York: Springer; 2007. [Google Scholar]
Cook RJ, Lawless JF, Lakhal-Chaieb L, Lee KA. Robust estimation of mean functions and treatment effects for recurrent events under event-dependent censoring and termination: Application to skeletal complications in cancer metastatic to bone. J Am Statist Assoc. 2009;104:60–75. [Google Scholar]
Cook RJ, Lawless JF, Nadeau C. Robust tests for treatment comparisons based on recurrent event responses. Biometrics. 1996;52:557–71. [PubMed] [Google Scholar]
Emerson J, Rosenfeld M, McNamaral S, Ramsey B, Gibson RL. Pseudomonas aeruginosa and other predictors of mortality and morbidity in young children with cystic fibrosis. Pediatric Pulmonol. 2002;34:91–100. doi: 10.1002/ppul.10127. [DOI] [PubMed] [Google Scholar]
Fan J, Gijbels I. Local Polynomial Modelling and Its Applications. London: Chapman & Hall; 1996. [Google Scholar]
FitzSimmons SC. The changing epidemiology of cystic fibrosis. J Pediatrics. 1993;122:1–9. doi: 10.1016/s0022-3476(05)83478-x. [DOI] [PubMed] [Google Scholar]
Gu C. Smoothing Spline ANOVA models. New York: Springer; 2002. [Google Scholar]
Hastie T, Tibshirani R. Generalized Additive Models. London: Chapman & Hall; 1990. [DOI] [PubMed] [Google Scholar]
Kosorok MR, Wei WH, Farrell PM. The incidence of cystic fibrosis. Statist Med. 1996;15:449–62. doi: 10.1002/(SICI)1097-0258(19960315)15:5<449::AID-SIM173>3.0.CO;2-X. [DOI] [PubMed] [Google Scholar]
Kosorok MR, Zeng L, West SE, Rock MJ, Splaingard ML, Laxova A, Green CG, Collins J, Farrell PM. Acceleration of lung disease in children with cystic fibrosis after Pseudomonas aeruginosa acquisition. Pediatric Pulmonol. 2001;32:277–87. doi: 10.1002/ppul.2009.abs. [DOI] [PubMed] [Google Scholar]
Lawless J, Nadeau C. Some simple robust methods for the analysis of recurrent events. Technometrics. 1995;37:158–68. [Google Scholar]
Li Z, Kosorok MR, Farrell PM, Laxova A, West SEH, Green CG, Collins J, Rock MJ, Splaingard ML. Longitudinal development of mucoid Pseudomonas aeruginosa infection and lung disease progression in children with cystic fibrosis. J Am Med Assoc. 2005;293:581–8. doi: 10.1001/jama.293.5.581. [DOI] [PubMed] [Google Scholar]
Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. J R Statist Soc B. 2000;62:711–30. [Google Scholar]
Little RJA, Rubin DB. Statistical Analysis with Missing Data. New York: Wiley; 2002. [Google Scholar]
Loader C. R package version 1.5-6. 2010. locfit: Local Regression, Likelihood and Density Estimation. [Google Scholar]
Nelson WB. Graphical analysis of system repair data. J Qual Technol. 1988;20:24–35. [Google Scholar]
Pepe MS, Cai J. Some graphical displays and marginal regression analyses for recurrent failure times and time dependent covariates. J Am Statist Assoc. 1993;88:811–20. [Google Scholar]
Pollard D. Empirical Processes: Theory and Applications. Hayward: Institute of Mathematical Statistics; 1990. [Google Scholar]
R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2013. http://www.R-project.org. [Google Scholar]
Schaubel D, Cai J. Rate/mean regression for multiple-sequence recurrent event data with missing event category. Scand J Statist. 2006;33:191–207. [Google Scholar]

[R1] Abu-Libdeh H, Turnbull BW, Clark LC. Analysis of multi-type recurrent events in longitudinal studies: Application to a skin cancer prevention trial. Biometrics. 1990;46:1017–34. [PubMed] [Google Scholar]

[R2] Akaike H. A new look at the statistical model identification. IEEE Trans Auto Contr. 1974;19:716–23. [Google Scholar]

[R3] Andersen P, Borgan Ø, Gill R, Keiding N. Statistical Models Based on Counting Processes. New York: Springer; 1993. [Google Scholar]

[R4] Cai J, Schaubel D. Marginal means/rates models for multiple type recurrent event data. Lifetime Data Anal. 2004;10:121–38. doi: 10.1023/b:lida.0000030199.23383.45. [DOI] [PubMed] [Google Scholar]

[R5] Chen BE, Cook RJ. The analysis of multivariate recurrent events with partially missing event types. Lifetime Data Anal. 2009;15:41–58. doi: 10.1007/s10985-008-9091-3. [DOI] [PubMed] [Google Scholar]

[R6] Chiang CT, Wang MC, Huang CY. Kernel estimation of rate function for recurrent event data. Scand J Statist. 2005;32:77–91. doi: 10.1111/j.1467-9469.2005.00416.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Cook RJ, Lawless JF. The Statistical Analysis of Recurrent Events. New York: Springer; 2007. [Google Scholar]

[R8] Cook RJ, Lawless JF, Lakhal-Chaieb L, Lee KA. Robust estimation of mean functions and treatment effects for recurrent events under event-dependent censoring and termination: Application to skeletal complications in cancer metastatic to bone. J Am Statist Assoc. 2009;104:60–75. [Google Scholar]

[R9] Cook RJ, Lawless JF, Nadeau C. Robust tests for treatment comparisons based on recurrent event responses. Biometrics. 1996;52:557–71. [PubMed] [Google Scholar]

[R10] Emerson J, Rosenfeld M, McNamaral S, Ramsey B, Gibson RL. Pseudomonas aeruginosa and other predictors of mortality and morbidity in young children with cystic fibrosis. Pediatric Pulmonol. 2002;34:91–100. doi: 10.1002/ppul.10127. [DOI] [PubMed] [Google Scholar]

[R11] Fan J, Gijbels I. Local Polynomial Modelling and Its Applications. London: Chapman & Hall; 1996. [Google Scholar]

[R12] FitzSimmons SC. The changing epidemiology of cystic fibrosis. J Pediatrics. 1993;122:1–9. doi: 10.1016/s0022-3476(05)83478-x. [DOI] [PubMed] [Google Scholar]

[R13] Gu C. Smoothing Spline ANOVA models. New York: Springer; 2002. [Google Scholar]

[R14] Hastie T, Tibshirani R. Generalized Additive Models. London: Chapman & Hall; 1990. [DOI] [PubMed] [Google Scholar]

[R15] Kosorok MR, Wei WH, Farrell PM. The incidence of cystic fibrosis. Statist Med. 1996;15:449–62. doi: 10.1002/(SICI)1097-0258(19960315)15:5<449::AID-SIM173>3.0.CO;2-X. [DOI] [PubMed] [Google Scholar]

[R16] Kosorok MR, Zeng L, West SE, Rock MJ, Splaingard ML, Laxova A, Green CG, Collins J, Farrell PM. Acceleration of lung disease in children with cystic fibrosis after Pseudomonas aeruginosa acquisition. Pediatric Pulmonol. 2001;32:277–87. doi: 10.1002/ppul.2009.abs. [DOI] [PubMed] [Google Scholar]

[R17] Lawless J, Nadeau C. Some simple robust methods for the analysis of recurrent events. Technometrics. 1995;37:158–68. [Google Scholar]

[R18] Li Z, Kosorok MR, Farrell PM, Laxova A, West SEH, Green CG, Collins J, Rock MJ, Splaingard ML. Longitudinal development of mucoid Pseudomonas aeruginosa infection and lung disease progression in children with cystic fibrosis. J Am Med Assoc. 2005;293:581–8. doi: 10.1001/jama.293.5.581. [DOI] [PubMed] [Google Scholar]

[R19] Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. J R Statist Soc B. 2000;62:711–30. [Google Scholar]

[R20] Little RJA, Rubin DB. Statistical Analysis with Missing Data. New York: Wiley; 2002. [Google Scholar]

[R21] Loader C. R package version 1.5-6. 2010. locfit: Local Regression, Likelihood and Density Estimation. [Google Scholar]

[R22] Nelson WB. Graphical analysis of system repair data. J Qual Technol. 1988;20:24–35. [Google Scholar]

[R23] Pepe MS, Cai J. Some graphical displays and marginal regression analyses for recurrent failure times and time dependent covariates. J Am Statist Assoc. 1993;88:811–20. [Google Scholar]

[R24] Pollard D. Empirical Processes: Theory and Applications. Hayward: Institute of Mathematical Statistics; 1990. [Google Scholar]

[R25] R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2013. http://www.R-project.org. [Google Scholar]

[R26] Schaubel D, Cai J. Rate/mean regression for multiple-sequence recurrent event data with missing event category. Scand J Statist. 2006;33:191–207. [Google Scholar]

PERMALINK

Nonparametric estimation of the mean function for recurrent event data with missing event category

FENG-CHANG LIN

JIANWEN CAI

JASON P FINE

HUICHUAN J LAI

Summary

1. Introduction

2. Estimation methods

3. Asymptotic properties

Lemma 1

Corollary 1

Theorem 1

4. Simulation studies

Table 1.

Table 2.

Fig. 1.

5. Cystic fibrosis registry data

Fig. 2.

6. Remarks

Acknowledgments

Appendix

Proof of Theorem 1

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Nonparametric estimation of the mean function for recurrent event data with missing event category

FENG-CHANG LIN

JIANWEN CAI

JASON P FINE

HUICHUAN J LAI

Summary

1. Introduction

2. Estimation methods

3. Asymptotic properties

Lemma 1

Corollary 1

Theorem 1

4. Simulation studies

Table 1.

Table 2.

Fig. 1.

5. Cystic fibrosis registry data

Fig. 2.

6. Remarks

Acknowledgments

Appendix

Proof of Theorem 1

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases