Semiparametric modeling and estimation of the terminal behavior of recurrent marker processes before failure events

Kwun Chuen Gary Chan; Mei-Cheng Wang

doi:10.1080/01621459.2016.1140051

. Author manuscript; available in PMC: 2018 May 3.

Published in final edited form as: J Am Stat Assoc. 2017 May 3;112(517):351–362. doi: 10.1080/01621459.2016.1140051

Semiparametric modeling and estimation of the terminal behavior of recurrent marker processes before failure events

Kwun Chuen Gary Chan ¹, Mei-Cheng Wang ²

PMCID: PMC5501427 NIHMSID: NIHMS751407 PMID: 28694552

Abstract

Recurrent event processes with marker measurements are mostly and largely studied with forward time models starting from an initial event. Interestingly, the processes could exhibit important terminal behavior during a time period before occurrence of the failure event. A natural and direct way to study recurrent events prior to a failure event is to align the processes using the failure event as the time origin and to examine the terminal behavior by a backward time model. This paper studies regression models for backward recurrent marker processes by counting time backward from the failure event. A three-level semiparametric regression model is proposed for jointly modeling the time to a failure event, the backward recurrent event process, and the marker observed at the time of each backward recurrent event. The first level is a proportional hazards model for the failure time, the second level is a proportional rate model for the recurrent events occurring before the failure event, and the third level is a proportional mean model for the marker given the occurrence of a recurrent event backward in time. By jointly modeling the three components, estimating equations can be constructed for marked counting processes to estimate the target parameters in the three-level regression models. Large sample properties of the proposed estimators are studied and established. The proposed models and methods are illustrated by a community-based AIDS clinical trial to examine the terminal behavior of frequencies and severities of opportunistic infections among HIV infected individuals in the last six months of life.

Keywords: Marked Counting Process, Partial Likelihood, Recurrent Event Process, Semi-parametric models

1. INTRODUCTION

In prospective follow-up studies, recurrent events with marker data are often collected where markers are observed at the occurrence of recurrent events. Examples include CD4 counts, viral load measurements, severity scores at recurrent opportunistic infections and medical costs at repeated hospital visits. Recurrent marker processes often exhibit a certain terminal behavior before failure events such as deaths. Lunney et al. (2003) studied the functional decline at the last year of life and concluded that heterogeneity in the decline is often observed in clinical settings. They suggested that a better understanding of functional decline before death will improve organization and delivery of palliative care. The understanding of marker behavior prior to the failure event could identify vulnerable subpopulations that will help prioritizing services or treatments in resources-limited settings. Using the human immun-odeficiency virus (HIV) infection as an illustrating example, it is scientifically and clinically interesting to understand the terminal behavior of the frequency and severity of opportunistic infections before death. Evidence suggests that HIV-infected patients experienced higher frequency of AIDS-defining events before death, where the frequencies could vary with gender, risk behaviors or geographic location (Chan et al., 1995). Taking Alzheimer’s Disease as another example, cognitive decline across the spectrum of numerous symptoms is commonly characterized by changes in biomarkers before diagnosis of the disease. Interestingly, backward-in-time performance of biomarkers for Alzheimer’s Disease is widely recognized as an important research topic for signaling the occurrence of disease, though the data analyses conducted in the research area typically overlooked the effect of censoring from follow-up (Hall et al., 2000; Wilson et al. 2007, among other relevant papers). For end-stage renal diseases, Usvyat et al. (2013) studied the pattern of interdialytic weight gain, systolic blood pressure, serum albumin and C-reactive protein levels before death, and showed that race was associated with some biomarkers but not others. Based on the results, they made a clinical recommendation on how patients with end-stage renal diseases should be monitored.

When interests focus on the behavior of processes counting time forward from an initial event, Nelson (1988), Pepe and Cai (1993), Lawless and Nadeau (1995), Cook and Lawless (1997), Lin et al. (2000), Wang, Qin and Chiang (2001), Liu, Wolfe and Huang (2004), Zeng and Lin (2007), Liu, Schaubel and Kalbfleisch (2012) among others studied recurrent event processes, Lin (2000) studied medical cost processes, Pawitan and Self (1993) studied CD4 count processes, and Huang and Wang (2004), Ye, Kalbfleisch and Schaubel (2007), Huang, Qin and Wang (2010), Zhao, Zhou and Sun (2011) and Kalbfleisch et al. (2013) considered joint analysis of recurrent and terminal events. Such forward processes, however, cannot conveniently model the terminal behavior of the processes. In fact, most of the forward models studied in the literature implicitly assume that the recurrent marker processes show no change in pattern before failure events. An exception is Liu, Wolfe and Kalbfleisch (2007), who considered a joint model of repeated monthly medical costs and failure time which allows costs to increase uniformly for a period of time before death.

By aligning the time origins to the failure events, models on backward processes are natural and direct ways to study the terminal behavior of stochastic processes. Although the notion of backward process is implicitly employed in the medical publications listed above, the adopted analytical approaches are usually ad hoc and based on only uncensored data. For example, Chan et al. (1995) and Lunney et al. (2003) employed complete-case analyses that used only uncensored observations to study risk factors associated with the number of infections before death. A similar analytical strategy was also used by Hall et al. (2000) to study the association between biomarkers and Alzheimer’s disease incidence, where uncensored observations are treated as cases and all the censored observations are treated as controls. Such analytical approaches are common in biomedical data applications and would typically result in biased estimates for population parameters when failure time and recurrent marker process are correlated. Specifically, if we treat right censored subjects as missing, the complete-case subsample is biased toward parameter values for a subpopulation with shorter failure times. To study the terminal behavior of dynamic processes, Chan and Wang (2010) introduced the backward recurrent event process that counts time backward from the failure event. In that paper they compared forward and backward models, and considered one sample estimation problems. By aligning the origins of the processes to failure events, the terminal behavior of stochastic processes could be naturally and directly studied by backward processes.

In this paper, we propose a three-level joint model of failure time and backward recurrent marker processes. We directly model the recurrent marker processes before failure events by aligning the origins of the processes at failure events, and study the terminal behavior of the processes by counting time backward from the failure events. One of the analytical challenges is to deal with censoring, as time origins of backward processes are only partially observed due to right censoring. Another challenge arises because we want to model the backward processes flexibly by semiparametric models which contain multiple nuisance functions. By jointly modeling the failure time, recurrent events and a marker measurement, estimating equations are proposed for follow-up data collected subject to right censoring.

The manuscript is organized as follows. A three-level semiparametric model is proposed in Section 2 to jointly model the hazard rate of failure, the rate of occurrence of recurrent events before death and the average level of marker measurement observed at each recurrence. Semiparametric inference for the proposed model is developed in Section 3: Estimation and inference of finite-dimensional target parameters are established in Section 3.1, estimation of infinite-dimensional functional parameters is studied in Section 3.2, and a diagnostic test for the proposed model is discussed in Section 3.3. Section 4 contains numerical results, which includes a simulation study and an analysis of a data set from the Terry Beirn Community Programs for Clinical Research on AIDS studies. Section 5 provides several concluding remarks. Proofs of the theoretical results are given in the Appendix.

2. A THREE-LEVEL SEMIPARAMETRIC MODEL

In this section, a semiparametric model is proposed for the backward recurrent marker processes. Let (M(t), Q(t)) be a bivariate process in the time interval [0, τ₁], where τ₁ is a pre-specified time corresponding to the maximal follow-up time, M(t) is the cumulative number of recurrent events from the time origin to time t, and Q(t) is a marker process which is potentially observable conditioning on the occurrence of an event, i.e. dM(t) = 1, t ∈ [0, τ₁]. Suppose the marker data are observed only when recurrent events occur. Here, M(·) and Q(·) could be arbitrarily correlated. We define $V (t) = \int_{0}^{t} Q (s) d M (s)$ as a recurrent marker process. This process is well-defined since Q(s) is defined given dM(s) = 1, or equivalently, given the occurrence of a recurrent event at time s.

In the Terry Beirn Community Programs for Clinical Research on AIDS (CPCRA) study, the initial event is treatment randomization and the failure event is death, M(t) is the number of opportunistic infections within t time units after randomization, Q(t) is the severity score associated with an infection at time t and V (t) is the severity-weighted number of opportunistic infections up to time t after randomization (Neaton et al., 1994). V (t) is a measure of both the frequency and the severity of opportunistic infections. In the absence of censoring, recurrent marker processes are terminated by a failure event such as death. We call such an event a theoretical terminal event to distinguish them from an observed terminal event in the presence of incomplete follow-up. Let T be the time to the failure event, C be the censoring time and Y = min(T, C) be the observed terminal event. Often in the recurrent events literature, an observed terminal event is called a censoring event which may cause confusion, and we distinguish the observed terminal event from the potential censoring event in this paper. Let Δ = I(T ≤ C) be an indicator that a failure event is observed, and X be a q-vector covariate. Extensions to time-varying covariates will be discussed in Section 5.

To study the terminal behavior of recurrent marker processes, we define the following backward processes: M^B(u) = M(T) – M(T – u)⁻, Q^B(u) = Q(T – u) and $V^{B} (u) = \int_{0}^{u} Q^{B} (v) d M^{B} (v) = V (T) - V {(T - u)}^{-}$ , where the superscript ⁻ represents the left-hand limit. With slight abuse of notations, the superscript ^T represents the transpose of a matrix or a column vector. Conditioning on covariate X, the outcomes of interests (T, M^B(·), Q^B(·)) are allowed to be correlated. A conditional independent censoring condition is assumed for estimation, in which (T, M^B(·), Q^B(·)) is conditionally independent of the censoring time C given covariate X. Note that the backward processes are defined relative to the failure event occurring at T, not the observed terminal event occurring at Y. In the CPCRA study, M^B(u) is the number of opportunistic infections within u time units before death, Q^B(u) is the severity score associated with an infection at time u before death and V^B(u) is the severity-weighted number of opportunistic infections within u time units before death. The time origins for the backward processes, u = 0, are the failure events of interest. The mean function of backward processes has a direct interpretation as the average pattern of processes before failure events. The forward and backward processes are designed to study different scientific questions about the processes. For example, forward medical cost processes are more relevant for studying the total cost of care for cancer patients, but backward costs are more relevant for comparing and evaluating palliative care settings.

There are certain considerations for constructing regression models of backward processes. First of all, the failure time T, the backward recurrent event process M^B(u) and the backward marker process Q^B(u) are dependent in general and a regression model should allow such a dependence. Also, there is an order of defining the processes. A backward process can be viewed as a marked process defined at the failure event, which extends the notion of a marked variable discussed in Huang and Louis (1998), and a marked variable can be considered as a generalization of a cause of death in competing risk (Prentice et al., 1978; Sun, Gilbert and McKeague, 2009). In addition, a backward marker measurement is observed only when a recurrent event occurs. Note that parameters for marginal models of marked processes are not identified when we have limited study period, as discussed in Huang (2002) and Huang and Wang (2003). One way to overcome this identifiability problem is to jointly model both the failure time and the backward processes.

Following the above considerations, we propose the following three-level regression model for jointly modeling T, M^B(u) and Q^B(u). Let h(t; x) be the hazard function at time t conditioning on a covariate value x, and λ(u; x, t) be the backward rate of recurrent event at u time units before the failure event given a covariate value x and the occurrence of failure event at time t after the initial event, that is λ(u; x, t)du = E(dM^B(u)|X = x, T = t). Conditioned on an occurrence of recurrent event at backward time u before the failure event, let μ(u; x, t) represents the mean of marker, Q^B(u), given covariate value x and failure time t, that is μ(u; x, t) = E(Q^B(u)|X = x, T = t, M^B(du) = 1). In the CPCRA example, λ(u; x, t) is the average number of opportunistic infections per unit time at backward time u before death given covariates X = x and that death occurs at T = t, μ(u; x, t) is the mean severity score if an opportunistic infection occurs at backward time u before death. As the terminal behavior of processes occur within a rather short period of time before failure events, relevant scientific questions center on a short period τ₀ before death, where τ₀ is a pre-specified time period of interest which is usually much short than the maximum follow-up period τ₁. For example, Chan et al. (1995) studied the frequency of opportunistic infections within the last six months of life, and Chan and Wang (2010) studied the medical cost for cancer patients within the last year of life. While τ₀ is typically specified beforehand indicating a time period of scientific interest, it is ideal to pick τ₀ which is not too small and not too large. If τ₀ is small, only a small amount of backward data can be used and it will lead to a reduction of estimation efficiency. If τ₀ is large, as subjects with failure time less than τ₀ are not included in the model population, it implies that a good proportion of subjects will be excluded from the model population and this will subsequently affect the biomedical or public health interpretation of the analysis results.

We consider the following three-level model:

Level 1

Proportional hazard model of T conditioning on X = x,

h (t; x) = h_{0} (t) exp (ξ_{0}^{T} x)

Level 2

Proportional rate model of M^B(u) conditioning on (X = x, T = t),

λ (u; x, t) = λ_{0} (u) exp (f_{0} (t) + α_{0}^{T} x)

Level 3

Proportional mean model of Q^B(u) conditioning on dM^B(u) = 1 and (X = x, T = t),

μ (u; x, t) = μ_{0} (u) exp (g_{0} (t) + β_{0}^{T} x)

where t > 0, u ∈ [0, τ₀], and h₀(t), f₀(t), g₀(t), λ₀(u) and μ₀(u) are unspecified functions. Since the models at levels 2 or 3 each contains two unspecified functions, we normalize λ₀(u) and μ₀(u) for identifiability purposes. In particular, we assume $\int_{0}^{τ_{0}} λ_{0} (u) d u = 1$ and $\int_{0}^{τ_{0}} λ_{0} (u) μ_{0} (u) d u = 1$ . The parameters of main interests are ξ₀, α₀ and β₀, which have interpretations as the log relative hazards of the failure event, the log relative rate of the recurrent events before the failure event, and the log relative mean of the backward markers at an recurrent event before the failure event. The proposed three level models recognize the ordering of observation by sequential conditioning, and also allow the processes to be correlated with failure time. Moreover, the model parameters are estimable from follow-up data subject to incomplete follow-up, as will be discussed in section 3.

Note that the models at levels 2 and 3 together imply an integrated model for the recurrent marker process V^B(u),

ν (u; x, t) = ν_{0} (t) exp (l_{0} (t) + γ_{0}^{T} x)

where ν(u; x, t)du = μ(u; x, t) × λ(u; x, t)du = E(dV^B(u)|X = x, T = t) is called the backward generalized rate of a recurrent marker process, ν₀(u) = λ₀(u)×μ₀(u), l₀(t) = f₀(t)+g₀(t) and γ₀ = α₀ + β₀. The parameter γ₀ has the interpretation of log relative mean of the backward recurrent marker process per unit time. In the CPCRA example, α₀ is log rate ratio of the number of opportunistic infections per unit time before death, β₀ is log mean ratio of the severity score of an opportunistic infection before death and γ₀ = α₀ + β₀ is log mean ratio of the severity-weighted frequency of opportunistic infection per unit time before death.

In levels 2 and 3 models, we only specify the rate and the mean of the processes, but not the full distribution. While a hazard function completely specifies a survival distributions, the specification of the full distribution of marked point processes is very difficult in general (Cox and Isham, 1980). Although a Poisson process assumption and a parametric assumption for the mark distribution given the history of the processes can specify the full likelihood, such assumptions are usually considered very restrictive in the recurrent event literature (Lin et al. 2001; Cai et al., 2010; among others). In contrast with those fully parameterized models, our semiparametric rate and mean models possess some desirable features and therefore provide more flexibility in modeling and data analysis. Furthermore, while shared frailty models are viable alternatives, they often require full parametric specification which is unappealing in the analysis of recurrent events. As a comparison, through the target parameters and multiple nonparametric functional parameters, our joint models in levels 1 to 3 handle the dependence structure of the three outcome components flexibly, which is known to be notoriously difficult in the recurrent event literature.

3. ESTIMATION AND INFERENCE

3.1 Finite-dimensional target parameters

Let 𝒩_i(t) = I(Y_i ≤ t, Δ_i = 1), i = 1, …, n, be counting processes of the observed failure events and $N_{i}^{*} (t) = N_{i} (t) - \int_{0}^{t} I (Y_{i} \geq s) exp (ξ_{0}^{T} X_{i}) d H_{0} (s)$ be martingale residual processes, where $H_{0} (t) = \int_{0}^{t} h_{0} (s) d s$ . For notational convenience, we further define the following: For any real-valued q-dimensional vector c, let $S^{(k)} (t; c) = n^{- 1} \sum_{i = 1}^{n} I (Y_{i} \geq t) X_{i}^{\otimes k} exp (c^{T} X_{i})$ , k = 0, 1, 2 where a^⊗0 = 1, a^⊗1 = a and a^⊗2 = aa^T, and s⁽^k⁾(t; c) = E[I(Y ≥ t)X^⊗^k exp(c^TX)]. Also, let X̄(t; c) = S⁽¹⁾(t; c)/S⁽⁰⁾(t; c) and x̄(t; c) = s⁽¹⁾(t; c)/s⁽⁰⁾(t; c).

It is well known that the log relative hazard parameter ξ₀ in the level 1 model can be estimated by maximizing the partial likelihood (Cox 1972), and is equivalent to solving the partial score equation U₁(τ₁; ξ) = 0, where

U_{1} (t; ξ) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} (X_{i} - \bar{X} (s; ξ)) d N_{i} (t) = 0.

A difficulty for estimating α₀ and β₀ in levels 2 and 3 models is that there are four unspecified nuisance functions, λ₀(u), μ₀(u), f₀(t), g₀(t), and there are two different time scales, the forward time scale t and the backward time scale u. For proportional hazards model with two time scales, Efron (2002) argued that the nuisance functions in two time scales cannot be eliminated simultaneously in the estimation of finite-dimensional parameters. Under the proposed three-level model, we develop the following method to eliminate the nuisance functions in both time scales and to estimate the target parameters α₀ and β₀. The method naturally extends the partial likelihood methodology to marked counting processes, with a major difference that the estimators for the levels 2 and 3 models cannot be derived from a profile likelihood approach similar to the partial likelihood for the level 1 model, since the rate and mean models do not specify the full distribution of the backward processes.

Define a bivariate process to record information of $M_{i}^{B} (u)$ , u ∈ [0, τ₀], for a subject with an uncensored event in $[τ_{0}, t] : M_{i} (t, u) = M_{i}^{B} (u) I (τ_{0} \leq Y_{i} \leq t, Δ_{i} = 1)$ . For t ∈ [τ₀, τ₁] and u ∈ [0, τ₀], under the conditional independent censoring condition, observe that

\begin{array}{l} E (M_{i} (d t, d u) ∣ Y_{i} \geq t, X_{i}) = E (N_{i} (d t) ∣ T_{i} \geq t, C_{i} \geq t, X_{i}) \times E (M_{i} (d t, d u) ∣ N_{i} (d t) = 1, C_{i} \geq t, X_{i}) \\ = E (N_{i} (d t) ∣ T_{i} \geq t, X_{i}) \times E (M_{i}^{B} (d u) ∣ N_{i} (d t) = 1, X_{i}) \\ = λ_{0} (u) h_{0} (t) exp (f_{0} (t) + (α_{0} + ξ_{0}) X_{i}) dtdu . \end{array}

(1)

Consider the reparametrization θ = α + ξ. For t ∈ [τ₀, τ₁] and u ∈ [0, τ₀], (1) implies $E (M_{i}^{*} (t, u; B_{0}^{M}, θ_{0})) = 0$ where

M_{i}^{*} (t, u; B_{0}^{M}, θ_{0}) = M_{i} (t, u) - \int_{v = 0}^{u} \int_{s = τ_{0}}^{t} I (Y_{i} \geq s) λ_{0} (v) exp (θ_{0} X_{i}) {d B}_{0}^{M} (s) d v,

(2)

θ₀ = α₀+ξ₀ and $B_{0}^{M} (t) = \int_{τ_{0}}^{t} exp (f_{0} (s)) d H_{0} (s)$ . Furthermore, $M_{i}^{*} (t, τ_{0}; B_{0}^{M}, θ_{0}) = M_{i} (t, τ_{0}) - \int_{s = τ_{0}}^{t} I (Y_{i} \geq s) exp (θ_{0} X_{i}) {d B}_{0}^{M} (s)$ since $\int_{0}^{τ_{0}} λ_{0} (u) d u = 1$ . This motivates us to consider the following set of estimating equations:

\frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{τ_{1}} M_{i}^{*} (d t, d u; B^{M}, θ) = 0

(3)

\frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{τ_{1}} X_{i} M_{i}^{*} (d t, d u; B^{M}, θ) = 0.

(4)

For a given θ, the solution to (3) is

{\hat{B}}^{M} (t; θ) = \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} \frac{M_{i} (d s, d u)}{\sum_{j = 1}^{n} \int_{0}^{τ_{0}} I (Y_{i} \geq s) λ_{0} (v) e^{θ X_{j}} d v} = \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} \frac{M_{i} (d s, d u)}{S^{(0)} (s; θ)},

where the last equality holds because $\int_{0}^{τ_{0}} λ_{0} (u) d u = 1$ . Replacing B^M with B̂^M, (4) becomes U₂(τ₁; θ) = 0 where

\begin{array}{l} U_{2} (t; θ) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} (X_{i} - \bar{X} (s; θ)) M_{i} (d s, d v) \\ = \frac{1}{n} \sum_{i = 1}^{n} \int_{τ_{0}}^{τ_{1}} (X_{i} - \bar{X} (s; θ)) M_{i}^{B} (τ_{0}) N_{i} (d s) . \end{array}

Note that U₂(t; θ) only involves the target parameter θ, but not the nuisance functions λ₀(u) and $B_{0}^{M} (t)$ . Denote the solution of U₂(τ₁; θ) = 0 by θ̂, α₀ can be estimated by α̂ = θ̂ − ξ̂.

To estimate β₀, we consider a marked counting process $V_{i} (t, u) = V_{i}^{B} (u) I (τ_{0} \leq Y_{i} \leq t, Δ_{i} = 1)$ . Define ϕ₀ = α₀ + β₀ + ξ₀, $B_{0}^{V} (t) = \int_{τ_{0}}^{t} exp (f_{0} (s) + g_{0} (s)) d H_{0} (s)$ . Following similar arguments as above, we have $E (V_{i}^{*} (t, u; B_{0}^{V}, ϕ_{0})) = 0$ where

V_{i}^{*} (t, u; B_{0}^{V}, ϕ_{0}) = V_{i} (t, u) - \int_{v = 0}^{u} \int_{s = τ_{0}}^{t} I (Y_{i} \geq s) λ_{0} (v) μ_{0} (v) exp (ϕ_{0} X_{i}) {d B}_{0}^{V} (s) d v .

Furthermore, we can estimate ϕ₀ by solving the estimating equation U₃(τ₁; ϕ) = 0, where

\begin{array}{l} U_{3} (t; ϕ) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} (X_{i} - \bar{X} (s; ϕ)) V_{i} (d s, d u) \\ = \frac{1}{n} \sum_{i = 1}^{n} \int_{τ_{0}}^{t} (X_{i} - \bar{X} (s; ϕ)) V_{i}^{B} (τ_{0}) N_{i} (d s) . \end{array}

Note that U₃(t; ϕ) only involves the target parameter ϕ, but not other nuisance functions. Denote the solution of U₃(t; ϕ) = 0 by ϕ̂, β₀ can be estimated by β̂ = ϕ̂ − α̂ − ξ̂ = ϕ̂ − θ̂.

To study the asymptotic results, we introduce the following notations: ψ̂= (ξ̂^T, θ̂^T, ϕ̂^T)^T, $ψ_{0} = {(ξ_{0}^{T}, θ_{0}^{T}, ϕ_{0}^{T})}^{T}, η_{i 1} = \int_{0}^{τ_{1}} (X_{i} - \bar{x} (t; ξ_{0})) N_{i}^{*} (d t), η_{i 2} = \int_{0}^{τ_{0}} \int_{τ_{0}}^{τ_{1}} (X_{i} - \bar{x} (t; θ_{0})) M_{i}^{*} (d t, d u)$ and $η_{i 3} = \int_{0}^{τ_{0}} \int_{τ_{0}}^{τ_{1}} (X_{i} - \bar{x} (t; ϕ_{0})) V_{i}^{*} (d t, d u), η_{i} = {(η_{i 1}^{T}, η_{i 2}^{T}, η_{i 3}^{T})}^{T}, \sum = E (η_{1} η_{1}^{T}), A_{1} = E (\int_{0}^{τ_{1}} {(X - \bar{x} (t; ξ_{0}))}^{\otimes 2} I (Y \geq t) e^{ξ_{0} X} d H_{0} (t)), A_{2} = E (\int_{τ_{0}}^{τ_{1}} {(X - \bar{x} (t; θ_{0}))}^{\otimes 2} I (Y \geq t) e^{θ_{0} X} {d B}_{0}^{M} (t)), A_{3} = E (\int_{τ_{0}}^{τ_{1}} {(X - \bar{x} (t; ϕ_{0}))}^{\otimes 2} I (Y \geq t) e^{ϕ_{0} X} {d B}_{0}^{V} (t))$ , and A = A₁ ⊗ A₂ ⊗ A₃ where ⊗ denotes direct sum of matrices.

The following theorem indicates that the root-n scaled and centered versions of the parameter estimators of the three level models will jointly converge in distribution to a zero-mean 3 × q-variate normal distribution.

Theorem 1

Under the regularity conditions stated in the appendix, $\hat{ψ} \overset{a . s .}{\to} ψ_{0}$ and $\sqrt{n} (\hat{ψ} - ψ_{0}) \overset{d}{\to} N (0, V)$ where V = A⁻¹Σ(A⁻¹)^T.

The proof is given in Appendix A.1. As apparent in the definition of A and, Σ the estimation the asymptotic variance matrix V requires consistent estimation of certain cumulative versions of the baseline functions H₀(t), $B_{0}^{M} (t)$ and $B_{0}^{V} (t)$ , for which we discuss next.

3.2 Functional parameters

It is well known that H₀(t) can be estimated by the Breslow (1974) estimator,

\hat{H} (t; \hat{ξ}) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{t} \frac{N_{i} (d s)}{S^{(0)} (s; \hat{ξ})} .

We could also estimate $B_{0}^{M} (t)$ and $B_{0}^{V} (t)$ by Breslow-type estimators

{\hat{B}}^{M} (t; \hat{θ}) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} \frac{M_{i} (d s, d u)}{S^{(0)} (s; \hat{θ})}

and

{\hat{B}}^{V} (t; \hat{ϕ}) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} \frac{V_{i} (d s, d u)}{S^{(0)} (s; \hat{ϕ})}

respectively. Note that the estimates B̂^M and B̂^V can be used for the estimation of the asymptotic covariance matrix of the finite-dimensional parameter, as well as for the model diagnostic procedure given in Section 3.3. A remark on the types of functional parameters considered in the literature is given in Section 5. By plugging in unknown parameters (ξ₀, θ₀, ϕ₀, H₀(t), $B_{0}^{M} (t), B_{0}^{V} (t)$ ) in the definition of A and Σ by their estimates, we obtain Â and Σ̂. Therefore, we can estimate the asymptotic variance matrix V by V̂ = Â⁻¹Σ̂(Â⁻¹)^T, which is shown to be a consistent estimator of V in Appendix A.2.

Furthermore, we can estimate $L_{0} (u) = \int_{0}^{u} λ_{0} (v) d v$ and $N_{0} (u) = \int_{0}^{u} ν_{0} (v) d v = \int_{0}^{u} λ_{0} (v) μ_{0} (v) d v$ by

{\hat{L}}_{0} (u) = \sum_{i = 1}^{n} \int_{τ_{0}}^{τ_{1}} \frac{M_{i} (d s, u)}{\sum_{j = 1}^{n} I (Y_{j} \geq s) e^{\hat{θ} X_{j}}} / \sum_{i = 1}^{n} \int_{τ_{0}}^{τ_{1}} \frac{M_{i} (d s, τ_{0})}{\sum_{j = 1}^{n} I (Y_{j} \geq s) e^{\hat{θ} X_{j}}}

and

{\hat{N}}_{0} (u) = \sum_{i = 1}^{n} \int_{τ_{0}}^{τ_{1}} \frac{V_{i} (d s, u)}{\sum_{j} I (Y_{j} \geq s) e^{\hat{ϕ} X_{j}}} / \sum_{i = 1}^{n} \int_{τ_{0}}^{τ_{1}} \frac{V_{i} (d s, τ_{0})}{\sum_{j = 1}^{n} I (Y_{j} \geq s) e^{\hat{ϕ} X_{j}}} .

To identify the limiting processes of the functional estimators, we define the following notations: $b_{0}^{M} (t; θ_{0}) = \int_{τ_{0}}^{t} \bar{x} (s; θ_{0}) {d B}_{0}^{M} (s), b_{0}^{V} (t; ϕ_{0}) = \int_{τ_{0}}^{t} \bar{x} (s; θ_{0}) {d B}_{0}^{V} (s)$ ,

\begin{matrix} Π_{i}^{M} (t) = \int_{τ_{0}}^{t} \frac{M_{i}^{*} (d s, τ_{0})}{s^{(0)} (s; θ_{0})} - b_{0}^{M} (t; θ_{0}) A_{2}^{- 1} η_{2 i}, \\ Π_{i}^{V} (t) = \int_{τ_{0}}^{t} \frac{V_{i}^{*} (d s, τ_{0})}{s^{(0)} (s; ϕ_{0})} - b_{0}^{V} (t; ϕ_{0}) A_{3}^{- 1} η_{3 i}, \\ Λ_{i}^{M} (u) = \int_{τ_{0}}^{τ_{1}} \frac{M_{i}^{*} (d s, u)}{s^{(0)} (s; θ_{0})} - L_{0} (u) b_{0}^{M} (τ_{1}; θ_{0}) A_{2}^{- 1} η_{2 i}, \\ Λ_{i}^{V} (u) = \int_{τ_{0}}^{τ_{1}} \frac{V_{i}^{*} (d s, u)}{s^{(0)} (s; ϕ_{0})} - N_{0} (u) b_{0}^{V} (τ_{1}; ϕ_{0}) A_{3}^{- 1} η_{3 i}, \\ Γ_{i}^{M} (u) = \frac{Λ_{i}^{M} (u) - L_{0} (u) Λ_{i}^{M} (τ_{0})}{B_{0}^{M} (τ_{1})}, \end{matrix}

and

Γ_{i}^{V} (u) = \frac{Λ_{i}^{V} (u) - N_{0} (u) Λ_{i}^{V} (τ_{0})}{B_{0}^{V} (τ_{1})} .

The following theorem summarizes the large sample properties of the functional estimators.

Theorem 2

Under the regularity conditions stated in the appendix,

$\sqrt{n} ({\hat{B}}^{M} (t; \hat{θ}) - B_{0}^{M} (t))$ converges to a mean-zero Gaussian process with a covariance function $ρ^{M} (t_{1}, t_{2}) = E (Π_{1}^{M} (t_{1}) Π_{1}^{M} (t_{2}))$ for t₁, t₂ ∈ [τ₀, τ₁].
$\sqrt{n} ({\hat{B}}^{V} (t; \hat{ϕ}) - B_{0}^{V} (t))$ converges to a mean-zero Gaussian process with a covariance function $ρ^{V} (t_{1}, t_{2}) = E (Π_{1}^{V} (t_{1}) Π_{1}^{V} (t_{2}))$ for t₁, t₂ ∈ [τ₀, τ₁].
$\sqrt{n} (\hat{L} (u) - L_{0} (u))$ converges to a mean-zero Gaussian process with a covariance function $ϱ^{M} (u_{1}, u_{2}) = E (Γ_{1}^{M} (u_{1}) Γ_{1}^{M} (u_{2}))$ for u₁, u₂ ∈ [0, τ₀].
$\sqrt{n} (\hat{N} (u) - N_{0} (u))$ converges to a mean-zero Gaussian process with a covariance function $ϱ^{V} (u_{1}, u_{2}) = E (Γ_{1}^{V} (u_{1}) Γ_{1}^{V} (u_{2}))$ for u₁, u₂ ∈ [0, τ₀].

Inference for the functional parameters can be conducted by the following resampling method. For simplicity, we only discuss the construction of simultaneous confidence bands for L₀(u), u ∈ [0, τ₀] in detail and similar procedures can be applied to construct confidence bands for N₀(u), $B_{0}^{M} (t)$ and $B_{0}^{V} (t)$ . Let ${\hat{Γ}}_{i}^{M} (u)$ be an estimator of $Γ_{i}^{M} (u)$ by plugging in unknown quantities by consistent estimators defined earlier. The limiting process of $\sqrt{n} (\hat{L} (u) - L_{0} (u))$ can be approximated by ${\tilde{ϱ}}^{M} (u) = n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{Γ}}_{i}^{M} (u) G_{i}$ where G = (G₁, . . . , G_n) are independent standard normal variables which are generated independent of the observed data. Conditioning on the data, the only randomness in ϱ̃^M comes from (G₁, . . . , G_n), and we can show as in Appendix A.4 in Lin et al. (2000) that ϱ̃^M(u) and $\sqrt{n} (\hat{L} (u) - L_{0} (u))$ have the same limiting process. By simulating many realizations of G, we can obtain c_α/₂ which is the (1 − α) × 100 percentile of ${sup}_{u \in [0, τ_{0}]} ∣ {\tilde{ϱ}}^{M} (u) / \sqrt{{\tilde{ϱ}}^{M} (u, u)} ∣$ , where ${\hat{ϱ}}^{M} (u, u) = n^{- 1} \sum_{i = 1}^{n} {({\hat{Γ}}_{i}^{M} (u))}^{2}$ . Since L₀(u) is positive, we can construct confidence bands using the logarithmic transformation, and the values are given by $\hat{L} (u) exp {\pm n^{- 1 / 2} c_{α / 2} \sqrt{{\hat{ϱ}}^{M} (u, u)} / \hat{L} (u)}$ , u ∈ [0, τ₀]. Pointwise confidence intervals can be constructed by replacing c_α/₂ with the standard normal critical value z_α/₂.

3.3 Goodness-of-fit test

When the proposed model is true, residuals 𝒩*, ℳ* and 𝒱* are unbiased. Therefore, residual-based diagnostic test statistics can be constructed to examine potential model violation. We focus on testing against violation of the exponential link functions, since the parameter interpretation and the validity of reparametrizations θ and ϕ depends critically on the exponential link. For the proportional hazards model, Lin et al. (1993) considered a goodness-of-fit test statistic based on the following process:

T_{1} (x) = n^{- 1} \sum_{i = 1}^{n} I ({\hat{ξ}}^{T} X_{i} \leq x) {\hat{N}}_{i}^{*} (τ_{1}) .

They proposed a test based on sup_x |𝒯₁(x)| and showed that the test is consistent against incorrect link functions in the form of g(ξ*^TX), where g is not the exponential function. Inspired by this idea, we consider two additional processes for the recurrent events and the marker processes:

T_{2} (y) = n^{- 1} \sum_{i = 1}^{n} I ({\hat{θ}}^{T} X_{i} \leq y) {\hat{M}}_{i}^{*} (τ_{1}, τ_{0})

and

T_{3} (z) = n^{- 1} \sum_{i = 1}^{n} I ({\hat{ϕ}}^{T} X_{i} \leq z) {\hat{V}}_{i}^{*} (τ_{1}, τ_{0}) .

Let a = (x, y, z) and 𝒯 (a) = (𝒯₁(x), 𝒯₂(y), 𝒯₃(z))^T. A test statistic is given by 𝒮 = sup_a |𝒯(a)|. We use the following simulation method to approximate the null distribution of 𝒮. We show in Appendix A.4 that $\sqrt{n} T (a)$ converges to a mean-zero Gaussian process under the null hypothesis that the model is correctly specified, and $\sqrt{n} T (a) = n^{- 1 / 2} \sum_{i = 1}^{n} T_{i}^{†} (a) + o_{p} (1)$ . The limiting distribution of $\sqrt{n} T (a)$ can be approximated by $n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{T}}_{i}^{†} (a) G_{i}$ , where ${\hat{T}}_{i}^{†} (a)$ replaces unknown functions in 𝒯^†(a) by their consistent estimators given in Appendix A.4 and G = (G₁, . . . , G_n) are independent normal variables generated independent of the data. By generating many realizations of G, the null distribution of $\sqrt{n} S$ is approximated by the distribution of ${sup}_{a} ∣ n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{T}}_{i}^{†} (a) G_{i} ∣$ . Conditioning on observed data, the only randomness comes from G and we can use the arguments in Appendix A.5 of Lin et al. (2000) to show that the limiting processes of $\sqrt{n} T (a)$ and $n^{- 1 / 2} \sum_{i = 1}^{n} {\hat{T}}_{i}^{†} (a) G_{i}$ are the same.

4. NUMERICAL STUDIES

We studied the finite sample performance of the proposed estimators by Monte Carlo simulations. Data were generated 5000 times in each simulation, and each simulated data set consisted of 100, 200 or 400 observations. We simulated the data as follows. Two covariates were included in the model: a Bernoulli(0.5) covariate, and a standard Gaussian covariate. Failure time was generated by T = 0.75 + T′ where T′ is Weibull distributed with shape parameter 2 and scale parameter $1 / exp (0.5 \times ξ_{0}^{T} X)$ , where ξ₀ = (0.5, 1). Censoring time was generated from a uniform (0, 5) distribution independent of the failure time. Given T and X, the recurrent event processes at time u before death were generated from a non-homogeneous Poisson process with rate $2 (1 - exp (- u / 2)) exp (- log T + α_{0}^{T} X)$ where α₀ = (0.75, 0.25). Given T, X and occurrence of recurrent event at time u before death, the associated marks were generated from a gamma distribution with shape parameter 1 and scale parameter $exp (- 0.5 u - 0.5 log T + β_{0}^{T} X)$ where β₀ = (−0.5, −0.25) and u ∈ [0, 1]. The frequency of recurrent events and marker values were negatively correlated with survival time.

The simulation results are summarized in Table 1. In each case, the estimators for ξ₀, α₀, β₀ had small biases, the variance estimates were close to the sampling variation of the parameter estimates and the empirical coverage of the 95% confidence intervals based on the proposed sandwich variance estimate was close to the nominal value. We conducted additional simulations to study the performance of the proposed model diagnostic tests for the exponential link function. To approximate the null distributions, the resampling procedure described in Section 3.3 was performed 1000 times for each simulation data set. Under exponential link functions, the tests at 5% significance level had an empirical rejection proportion of 5.3% and 5.1% for n = 200 and n = 500, respectively. Under a misspecified linear link function, the simulated powers were 73.1% and 96.3% for n = 200 and n = 500, respectively. Under a logarithmic link function, the empirical powers were 82.2% and 99.2% for n = 200 and n = 500, respectively.

Table 1.

Summary of simulation study: Bias, standard error and coverage probabilities.

(a) Sample size=100

Truth

Bias

SSE

SEE CP(%)

ξ₀

0.50

0.015

0.279

0.274

94.6

1.00

0.033

0.297

0.287

92.8

α₀

0.75

0.013

0.245

0.233

93.7

0.25

0.017

0.282

0.268

92.9

β₀

−0.5

0.011

0.216

0.199

91.7

−0.25

0.001

0.237

0.216

9.13

L₀(0.5)

0.27

0.002

0.053

0.048

9.17

N₀(0.5)

0.316

0.001

0.078

0.069

90.6

B_{0}^{M} (1.5)

0.835

0.005

0.266

0.258

94.4

B_{0}^{V} (1.5)

0.539

0.006

0.199

0.189

93.5

(b) Sample size=200

Truth

Bias

SSE

SEE CP(%)

ξ₀

0.50

0.004

0.190

0.189

94.9

1.00

0.015

0.204

0.198

93.2

α₀

0.75

0.002

0.166

0.161

94.2

0.25

0.008

0.189

0.184

93.8

β₀

−0.5

0.002

0.149

0.143

93.7

−0.25

0.003

0.162

0.153

92.6

L₀(0.5)

0.27

0.001

0.036

0.035

93.4

N₀(0.5)

0.316

0.001

0.054

0.051

93.6

B_{0}^{M} (1.5)

0.835

0.001

0.186

0.182

94.6

B_{0}^{V} (1.5)

0.539

0.001

0.139

0.136

94.3

Truth

Bias

SSE

SEE

CP(%)

ξ₀

0.50

0.004

0.134

0.132

94.7

1.00

0.004

0.139

94.7

α₀

0/75

0.003

0.114

0.113

95.2

0.25

0.006

0.133

0.129

94.0

β₀

−0.5

0.004

0.103

0.101

94.4

−0.25

0.003

0.111

0.108

93.8

L₀(0.5)

0.27

0.001

0.026

0.025

94.2

N₀(0.5)

0.316

0.001

0.039

0.037

93.6

B_{0}^{M} (1.5) β_{0}

0.835

0.002

0.132

0.128

94.3

M_{0}^{V} (1.5)

0.539

0.002

0.100

0.096

93.6

Open in a new tab

Bias and SSE are the sampling bias and sampling standard error respectively. SEE is the sample average of the standard error estimator, and CP is the empirical coverage probability of the 95% confidence intervals.

The proposed methods were also applied to analyze the recurrent marker data collected from the ddC/ddI trials of the Terry Beirn Community Programs for Clinical Research on AIDS (CPCRA) study. The CPCRA study is a randomized trial comparing didanosine (ddI) and zalcitabine (ddC) as treatments for HIV-infected patients. Recurrent opportunistic infections are common to HIV-infected individuals because of their compromised immune system. The trial collected the time from randomization to death or censoring, along with the occurrence of recurrent opportunistic diseases and the severity of each infection. Each individual was observed to experience between 0 and 5 recurrent infections, and a severity score is assessed by ten physicians at each event (Neaton et al, 1994). Table 2 shows the results from the proposed regression model with ddI/ddC treatment arm, age, gender and race (African American vs. non African American) as explanatory variables. The diagnostic tests for exponential link functions gave p-values of 0.43, 0.82 and 0.96 for models with τ₀ = 4, 5, 6 months. Results show that there is a significant gender and race effect on the frequency or severity of opportunistic infections up to six months before death. In particular, female had 76% less opportunistic infections in the last six months of life than male after controlling for other explanatory variables and failure time (95% C.I.: 93% less to 10% less). It is estimated that African Americans had 55% less opportunistic infections in the last six months of life compare to non-African Americans (95% C.I.: 75% less to 18% less). In contrast, using a forward proportional rate model as in Lin et al. (2000) and Cai et al. (2010), it is estimated that females had 23% less opportunistic infections (95% C.I.: 82% less to 220% more) than males and African Americans had 36% less opportunistic infections (95% C.I.: 60% less to 4% more) than non-African Americans. The estimated baseline backward cumulative functions L̂₀(u) and N̂₀(u) are given in Figure 1, which shows that the frequency and severity of opportunistic infections had a sharp increase between two to four weeks prior to death.

Table 2.

Regression Analysis of the CPCRA data with four explanatory variables: Trt(1=ddC,0=ddI), Age, Sex(1=female, 0=male), Race(1=African American,0=others)

		τ₀ = 4 months		τ₀ = 5 months		τ₀ = 6 months
		Est	95%C.I.	Est	95%C.I.	Est	95%C.I.
ξ₀	Trt	−0.21	(−0.49,0.08)
	Age	0.01	(−0.01,0.03)
	Sex	0.18	(−0.32,0.68)
	Race	0.05	(−0.30,0.40)
α₀	Trt	−0.03	(−0.48,0.42)	0.02	(−0.40,0.44)	0.09	(−0.36,0.54)
	Age	0.01	(−0.02,0.03)	0.01	(−0.02,0.03)	0.01	(−0.02,0.04)
	Sex	−0.76	(−1.82,0.31)	−1.40	(−2.71,−0.76)	−1.42	(−2.73,−0.10)
	Race	−1.19	(−1.96,−0.44)	−0.76	(−1.35,−0.18)	−0.80	(−1.39,−0.20)
β₀	Trt	−0.07	(−0.24,0.11)	−0.02	(−0.17,0.13)	−0.01	(−0.16,0.14)
	Age	0.00	(−0.01,0.01)	0.00	(−0.01,0.01)	0.00	(−0.01,0.01)
	Sex	−0.58	(−1.43,0.27)	−0.85	(−2.28,0.58)	−0.85	(−2.29,0.58)
	Race	0.09	(−0.14,0.33)	0.14	(−0.02,0.29)	0.07	(−0.16,0.31)
γ₀	Trt	−0.10	(−0.56,0.37)	−0.03	(−0.45,0.44)	0.08	(−0.38,0.55)
	Age	0.01	(−0.02,0.04)	0.01	(−0.02,0.03)	0.01	(−0.02,0.03)
	Sex	−1.34	(−2.73,0.05)	−2.25	(−4.21,−0.29)	−2.26	(−4.24,−0.29)
	Race	−1.09	(−1.09,−0.28)	−0.63	(−1.23,−0.03)	−0.72	(−1.36,−0.09)

Open in a new tab

Estimates of baseline backward cumulative functions and 95% confidence bands for CPCRA data. (a) L̂₀(u), (b) N̂₀(u).

5. CONCLUDING REMARKS

In this paper, we proposed a semiparametric regression model for studying the terminal behavior of recurrent marker processes. The model consists of three levels, the first level is a proportional hazards model for a failure time, the second level is a proportional rate model for a backward recurrent event process and the third level is a proportional mean model for a backward marker given a recurrent event occurrence before death. The combined model involves three-level modeling in which the backward process models are constructed with failure time included as a covariate, and the model for backward marker measurement conditions on both the failure time and the occurrence of a recurrent event. This three-level conditional model takes into account of the order of data observation and dependence is allowed among the three outcome variables. For this complicated problem, we proposed a conceptually sophisticated yet analytically simple procedure for estimation and inference. Specifically, by carefully constructing the flexible model structure in Section 3, we end up with multiple cancellation of nuisance functions in levels 1 to 3 models from the proposed estimating equations, which we consider a novel procedure for estimation. Alternative models can be developed in the future, for example, using subject-specific latent variables to model the dependence among the failure time, the backward recurrent event process and the backward marker process.

A referee indicated that Liu et al. (2007) proposed a turn-back-time method to negate the impact of recurrent events immediately prior to death. In particular, they modeled longitudinal costs over discrete times from an initial event, and assumed that there is a uniform increase in costs for the last b months, which has a similar flavor as the approach of Wulfsohn and Tsiatis (1997). While we agree that the model of Liu et al. is novel and interesting, it is important to point out that our model and analytical approach are, in fact, quite different from the one in Liu et al. (2007), whose main interest was to model the association of the risk factors with monthly medical costs. The approach of Liu et al. did not consider modeling recurrent events (such as repeated hospitalizations) which in fact have an important role to determine the medical cost. Also, the regression parameters for covariates are assumed to be the same for the initial period and the final period before death. Since only monthly costs are modeled, their approach does not serve to answer questions such as whether a certain treatment is associated with more frequent hospitalizations. Our model focused on random recurrent events where markers are observed only at recurrent events but not on regular intervals. Another important distinction is that they artificially censor the cost information before failure or censoring events in order to focus on estimation of the initial periods. Censoring of cost information before failure events is also considered by Huang and Wang (2003). Our focus is entirely in the opposite direction: instead of discarding information from the last period before the events, we only use those information to answer important scientific questions relating to this period.

Similar to the difficulty encountered in the existing literature of flexible semiparametric joint modeling (Lin and Ying, 2001, among others), the de-convolution of mixed functions is generally hard and tedious, if not impossible, and in this paper we only estimate mixed functional parameters such as $B_{0}^{M} (t)$ and $B_{0}^{V} (t)$ . While non-mixed cumulative functions can be estimated when failure time is assumed to be independent of the processes given covariates (Lin et al, 2000, Cai et al., 2010), our model is more general and can handle informative survival and recurrent events which is known to be notoriously difficult in the recurrent event literature. Estimation of cumulative mixed function is a tradeoff for a more general model, but the functional estimates can still be interpreted as cumulative mark-specific hazard functions (Huang and Louis, 1998), and are essential for the estimation of the asymptotic covariance matrix of the finite-dimensional parameter (as shown in Appendix A.2), and for the model diagnostic procedure given in Section 3.3.

An associate editor suggested us to consider an extension to time-varying covariates, which requires careful thoughts as there are two time scales in the models (forward and backward times). Let X(t) be a vector of time-varying covariates and 𝒳(t) the history of the time-varying process X(·) up to t. Suppose we consider the following models:

Level 1

Proportional hazard model of T conditioning on 𝒳(t), depends only on X(t) = x(t),

h (t; x) = h_{0} (t) exp (ξ_{0}^{T} x (t))

Level 2^†

Proportional rate model of M^B(u) conditioning on (T = t, 𝒳(t)), depends only on (T = t, X(t − u) = x(t − u)),

λ (u; x (t - u), t) = λ_{0} (u) exp (f_{0} (t) + α_{0}^{T} x (t - u))

Level 3^†

Proportional mean model of Q^B(u) conditioning on T = t, 𝒳(t) and dM^B(u) = 1 only depends on (T = t, X(t − u) = x(t − u)):

μ (u; x (t - u), t) = μ_{0} (u) exp (g_{0} (t) + β_{0}^{T} x (t - u)) .

For t ≥ τ₀ and u ∈ [0, τ₀], we observe analogous to (1) that

E (M_{i} (d t, d u) ∣ Y_{i} \geq t, X_{i} (t)) = λ_{0} (u) h_{0} (t) exp (f_{0} (t) + α_{0} X_{i} (t - u) + ξ_{0} X_{i} (t)) dtdu,

and define

M_{i}^{†} (t, u; B_{0}^{M}, α_{0}, ξ_{0}) = M_{i} (t, u) - \int_{v = 0}^{u} \int_{s = τ_{0}}^{t} I (Y_{i} \geq s) λ_{0} (v) exp (α_{0} X_{i} (s - v) + ξ_{0} X_{i} (s)) {d B}_{0}^{M} (s) d v,

then $E (M_{i}^{†} (t, u; B_{0}^{M}, θ_{0})) = 0$ .

Following previous arguments, we derive

{\hat{B}}^{M} (t; α, ξ) = \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} \frac{M_{i} (d s, d u)}{\sum_{j = 1}^{n} \int_{0}^{τ_{0}} I (Y_{i} \geq s) λ_{0} (v) e^{α^{T} X_{j} (s - v) + ξ^{T} X_{j} (s)} d v} .

Note that λ₀(v) is not eliminated through integration as in the case of time-independent covariates. In fact, it is typical that one of the nuisance parameters cannot be eliminated under a proportional hazard model with two time scales (Efron, 2002). As discussed in Efron (2002), we can use a cubic regression spline for approximating λ₀(u) for u ∈ [0, τ₀]. Since we assume that $\int_{0}^{τ_{0}} λ_{0} (u) d u = 1$ , λ₀(u) is analogous to a density function on [0, τ₀], and we can use the log-spline density approximation as in Kooperberg and Stone (1991): $λ_{0} (u) = exp (κ_{0}^{T} B (u) - c (κ_{0}))$ where B(u) = (B₁(u), …, B_p(u))^T are spline basis functions, κ₀ is a p-dimensional vector of unknown coefficients and $c (κ_{0}) = log (\int exp (κ_{0}^{T} B (u)) d u)$ is a normalizing constant. Let $X_{i}^{†} (t, u) = {(B (u), X_{i} (t - u))}^{T}$ , a system of estimating equations can be constructed as

\int_{0}^{τ_{0}} \int_{τ_{0}}^{τ_{1}} (X_{i}^{†} (s, u) - {\bar{X}}_{i}^{†} (s)) M_{i} (d t, d u) = 0

where

{\bar{X}}_{i}^{†} (t) = \frac{\sum_{j = 1}^{N} I (Y_{j} \geq t) \int_{0}^{τ_{0}} X_{j} (t, u) exp (κ^{T} B (u) + α^{T} X_{j} (t - u) + {\hat{ξ}}^{T} X_{j} (t)) d u}{\sum_{j = 1}^{N} I (Y_{j} \geq t) \int_{0}^{τ_{0}} exp (κ^{T} B (u) + α^{T} X_{j} (t - u) + {\hat{ξ}}^{T} X_{j} (t)) d u} .

The above estimating equation is p + q dimensional with p + q unknown parameters. A similar estimating equation can be derived for estimating γ₀ = α₀ + β₀. Alternatively, when the level 2 and 3 models are formulated as

Level 2*

Proportional rate model of M^B(u) conditioning on (T = t, 𝒳(t)), depends only on (T = t, X(t) = x(t)),

λ (u; x (t), t) = λ_{0} (u) exp (f_{0} (t) + α_{0}^{T} x (t))

Level 3*

Proportional mean model of Q^B(u) conditioning on T = t, 𝒳(t) and dM^B(u) = 1 only depends on (T = t, X(t) = x(t)):

μ (u; x (t), t) = μ_{0} (u) exp (g_{0} (t) + β_{0}^{T} x (t)) .

Then,

E (M_{i} (d t, d u) ∣ Y_{i} \geq t, X_{i} (t)) = λ_{0} (u) h_{0} (t) exp (f_{0} (t) + (α_{0} + ξ_{0}) X_{i} (t)) dtdu,

and

\begin{array}{l} {\hat{B}}^{M} (t; α, ξ) = \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} \frac{M_{i} (d s, d u)}{\sum_{j = 1}^{n} \int_{0}^{τ_{0}} I (Y_{i} \geq s) λ_{0} (v) e^{(α + ξ) X_{j} (s)} d v} \\ = \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} \frac{M_{i} (d s, d u)}{\sum_{j = 1}^{n} I (Y_{i} \geq s) λ_{0} (v) e^{(α + ξ) X_{j} (s)}} . \end{array}

In this case, λ₀(u) can be eliminated from the estimation of finite-dimensional parameters, and the method proposed for time-independent covariates can be directly extended. For this model, time-varying covariates are assumed observable at the failure event, but we can use time-varying covariates which are defined as summary measures of the end-of-life history of certain processes. For example, with a time-varying treatment status Z(t), X(t) can be defined as the fraction of years in which an individual takes the treatment between time t − τ₀ and t, that is $X (t) = \int_{t - τ_{0}}^{t} Z (t) d t / τ_{0}$ . Using this definition, X(t) is well defined for individuals with Y ≥ τ₀, which are individuals being included in the estimation of level 2 and 3 models. We can also change the domain of integration from [0, τ₁] to [τ₀, τ₁] for proportional hazards model under this definition of X(t).

In this paper we study the terminal behavior of recurrent marker processes before failure events. In applications, it could be clinically more relevant to consider the complete history of recurrent events for evaluating treatments on disease progression. Nevertheless, studying the full history of recurrent marker processes is not straightforward, particularly when it is complicated by the presence of terminal behavior before failure event of interest. In this paper we focus on modeling the terminal behavior of processes as it is often an important scientific question. A promising future research direction is to jointly model the forward and backward processes to study the complete history of processes, for which one has to pay extra attention to model consistency in joint modeling.

Acknowledgments

This research was partially supported by the National Institutes of Health grants P01 CA 098252, R01 AI 089341 and R01 HL 122212.

APPENDIX

A.1 Asymptotic properties of finite-dimensional parameters

We assume the following regularity conditions:

Parameters (ξ, θ, ϕ) ∈ Ξ × Θ × Φ, where Ξ, Θ and Φ are compact subspaces of ℝ^q.
P(C ≥ τ₁) > 0.
The support for M^B(τ₀), V^B(τ₀) and X are bounded.
The matrix A is positive definite.

These conditions are analogous to those of Andersen and Gill (1982) for the proportional hazards model and Lin et al. (2000) for the proportional rate model for recurrent events.

We first prove the consistency of the proposed estimators. It follows from Andersen and Gill (1982) that the maximum partial likelihood estimator ξ̂ is a consistent estimator of ξ₀. To show the consistency of θ̂, we first consider

L_{2} (θ) = \frac{1}{n} [\sum_{i = 1}^{n} \int_{τ_{0}}^{τ_{1}} {(θ - θ_{0})}^{T} X_{i} M_{i} (d t, τ_{0}) - \int_{τ_{0}}^{τ_{1}} log {\frac{S^{(0)} (t; θ)}{S^{(0)} (t; θ_{0})}} M (d t, τ_{0})] .

where $M (t, u) = n^{- 1} \times \sum_{i = 1}^{n} M_{i} (t, u)$ . Since ℳ(t, τ₀) and S⁽⁰⁾(t; θ) have bounded variations, it follows from strong law of large numbers that L₂(θ) converges almost surely to

L_{2} (θ) = E [\int_{τ_{0}}^{τ_{1}} {(θ - θ_{0})}^{T} X_{1} M_{1} (d t, τ_{0}) - \int_{τ_{0}}^{τ_{1}} log {\frac{s^{(0)} (t; θ)}{s^{(0)} (t; θ_{0})}} M_{1} (d t, τ_{0})]

for every θ. Note that

\frac{\partial^{2} L_{2} (θ)}{\partial θ^{2}} = - \frac{1}{n} \sum_{i = 1}^{n} \int_{τ_{0}}^{τ_{1}} {[X_{i} - \bar{X} (t; θ)]}^{\otimes 2} I (Y_{i} \geq t) exp (θ^{T} X_{i}) \frac{M_{i} (d t, τ_{0})}{S^{(0)} (t; θ)}

which is negative semidefinite and ∂L₂/∂θ is U₂(τ₁; θ). Also, it can be seen that ∂ℒ₂(θ₀)/∂θ = 0 and $\partial^{2} L_{2} (θ_{0}) / \partial θ^{2} = - E (η_{21} η_{21}^{T})$ under the model assumptions. Since L₂(θ) and ℒ₂(θ) are both concave functions and the parameter space is compact, it follows that ${sup}_{θ \in Θ} ∣ L_{2} (θ) - L_{2} (θ) ∣ \overset{a . s .}{\to} 0$ (Rockafellar, 1970). Note that θ̂ is the unique maximizer for L₂ and θ₀ is the unique maximizer of ℒ₂. Therefore, following the arguments in Appendix A.1 of Lin et al. (2000), we can show that θ̂ converges almost surely to θ₀. Strong consistency of ϕ̂ can be proven using similar arguments.

We will then prove the weak convergence results. Let U*(t; ψ) = (U₁(t; ξ)^T, U₂(t, θ)^T, U₃(t, ϕ)^T)^T. It can be seen that

U^{*} (t; ψ_{0}) = {\bar{D}}_{X}^{*} (t) - \int_{0}^{t} {\bar{X}}^{*} (s; ψ_{0}) d {\bar{D}}^{*} (s)

where ${\bar{D}}^{*} (t) = n^{- 1} \times {(\sum N_{i}^{*} {(t)}^{T}, \sum M_{i}^{*} {(t, τ_{0})}^{T}, \sum V_{i}^{*} {(t, τ_{0})}^{T})}^{T}$ and ${\bar{D}}_{X}^{*} (t) = n^{- 1} \times {(\sum X_{i} N_{i}^{*} {(t)}^{T}, \sum X_{i} M_{i}^{*} {(t, τ_{0})}^{T}, \sum X_{i} V_{i}^{*} {(t, τ_{0})}^{T})}^{T}$ . Since $N_{i}^{*} (t), M_{i}^{*} (t, τ_{0})$ and $V_{i}^{*} (t, τ_{0})$ can be expressed as sums and products of monotone functions in t with bounded second moments, which implies that 𝒟̄*(t) and ${\bar{D}}_{X}^{*} (t)$ are Donsker (van der Vaart and Wellner (1997), p. 215). Then it follows from functional central limit theorem (Pollard (1990), p.53; van der Vaart and Wellner (1997), p. 211) that ( $\sqrt{n} \bar{D} (t), \sqrt{n} {\bar{D}}_{X} (t)$ ) converges weakly to zero-mean Gaussian processes. Also, $\sqrt{n} {S^{(0)} (t; c) - s^{(0)} (t; c)}$ and $\sqrt{n} {S^{(1)} (t; c) - s^{(1)} (t; c)}$ converges weakly to zero-mean Gaussian processes as shown in Lemma 5.1 in Tsiatis (1981). Using Lemma 3 of Gill (1989), one can establish that the mapping U*(·, ϕ) is compactly differentiable with respect to the supremum norm, and by the functional delta method, one can conclude that

\sqrt{n} U^{*} (τ; ψ_{0}) = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} η_{i} + o_{p} (1)

where $η_{i} = {(η_{i 1}^{T}, η_{i 2}^{T}, η_{i 3}^{T})}^{T}, η_{i 1} = \int_{0}^{τ_{1}} (X_{i} - \bar{x} (t; ξ_{0})) N_{i}^{*} (d t), η_{i 2} = \int_{0}^{τ_{0}} \int_{τ_{0}}^{τ_{1}} (X_{i} - \bar{x} (t; θ_{0})) M_{i}^{*} (d t, d u)$ and $η_{i 3} = \int_{0}^{τ_{0}} \int_{τ_{0}}^{τ_{1}} (X_{i} - \bar{x} (t; ϕ_{0})) V_{i}^{*} (d t, d u)$ . It follows that the limiting variance of $\sqrt{n} U^{*} (τ_{1}; ψ_{0})$ is Σ.

By Taylor series expansion,

\sqrt{n} (\hat{ψ} - ψ_{0}) = \sqrt{n} {\hat{A}}^{- 1} (ψ^{*}) U^{*} (τ_{1}; ψ_{0})

where Â(ψ) = −∂U*(τ₁; ψ)/∂ψ and ψ* is on the line segment joining ψ̂ and ψ₀. Let A(ψ) = E[−∂U* (τ₁; ψ)/∂ψ]. By similar arguments as in the proof of consistency, one can show that ${sup}_{ψ \in Ψ} ∣ \hat{A} (ψ) - A (ψ) ∣ \overset{a . s .}{\to} 0$ where Ψ = Ξ × Θ × Φ. Together with the fact that ψ̂ is strongly consistent for ψ₀, A(ψ) is continuous at ψ₀ and A(ψ₀) = A, we have Â(ψ*) converges almost surely to A. Therefore,

\sqrt{n} (\hat{ψ} - ψ_{0}) = \sqrt{n} A^{- 1} U^{*} (τ_{1}; ψ_{0}) + o_{p} (1)

and $\sqrt{n} (\hat{ψ} - ψ_{0})$ converges in distribution to a multivariate normal distribution with mean 0 and variance V = A⁻¹ΣA⁻¹.

A.2 Consistency of B̂^M(t; θ̂), B̂^V (t; ϕ̂) and V̂

Since {V(τ₀)N(t), t ∈ [τ₀, τ₁]} is a Donsker class, it is also Glivenko-Cantelli. Combined with the Glivenko-Cantelli property of S⁽⁰⁾(t; θ), we have

{\hat{B}}^{M} (t; θ) \overset{a . s .}{\to} \int_{τ_{0}}^{t} {\frac{s^{(0)} (u; θ_{0})}{s^{(0)} (u; θ)}} {d B}_{0}^{M} (u)

uniformly for t ∈ [τ₀, τ₁] and θ in a neighborhood of θ₀. Also the derivative of B̂^M(t; θ) with respect to θ is uniformly bounded for all large n and for θ in a neighborhood of θ₀, strong consistency of θ̂ implies B̂^M(t; θ̂) converges almost surely to $B_{0}^{M} (t)$ uniformly in t ∈ [τ₀, τ₁]. Similarly, B̂^V(t; ϕ̂) converges almost surely to $B_{0}^{V} (t)$ uniformly in t ∈ [τ₀, τ₁]. These results, together with the almost sure convergence of ϕ̂, X̄ (t; ξ̂), X̄(t; θ̂) and X̄(t; ϕ̂) to ϕ₀, x̄(t, ξ₀), x̄(t, θ₀) and x̄(t, ϕ₀) respectively, implies that $n^{- 1} \sum_{i = 1}^{n} {| | {\hat{η}}_{i} - η_{i} | |}_{F}^{2} \overset{a . s .}{\to} 0$ , where ||·||_F is the matrix Frobenius norm. In addition, $n^{- 1} \sum_{i = 1}^{n} η_{i} η_{i}^{T} \overset{a . s .}{\to} \sum$ by the strong law of large numbers. Therefore, $\sum^{^} \overset{a . s .}{\to} \sum$ . Furthermore, ψ̂ and Â (ψ₀) converges almost surely to ψ₀ and A and this implies the almost sure convergence of Â to A. Therefore, $\hat{V} = {\hat{A}}^{- 1} \sum^{^} {({\hat{A}}^{- 1})}^{T} \overset{a . s .}{\to} A^{- 1} \sum {(A^{- 1})}^{T} = V$ .

A.3 Weak convergence of functional parameters

Consider the decomposition

\sqrt{n} [{\hat{B}}^{M} (t; \hat{θ}) - B_{0}^{M} (t)] = \sqrt{n} [{\hat{B}}^{M} (t; θ_{0}) - B_{0}^{M} (t)] + \sqrt{n} [{\hat{B}}^{M} (t; \hat{θ}) - {\hat{B}}^{M} (t; θ_{0})] .

(5)

The first term on the right-hand side of (5) is

n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} \frac{M_{i}^{*} (d s, d u)}{S^{(0)} (s; θ_{0})} = n^{- 1 / 2} \sum_{i = 1}^{n} \int_{0}^{τ_{0}} \int_{τ_{0}}^{t} \frac{M_{i}^{*} (d s, d u)}{s^{(0)} (s; θ_{0})} + o_{p} (1),

which can be shown by arguments as in Appendix A.1. Furthermore, Taylor series expansion shows that the second term on the right-hand side of equation (5) equals $- \sqrt{n} b^{M} (t; θ^{*}) (\hat{θ} - θ_{0})$ , and b^M(t; θ) converges uniformly to

b_{0}^{M} (t; θ) = \int_{τ_{0}}^{t} \bar{x} (u; θ) {d B}_{0}^{M} (u) .

Combine with the results we established earlier that $\sqrt{n} (\hat{θ} - θ_{0}) = n^{- 1 / 2} \sum_{i = 1}^{n} η_{i 2} + o_{p} (1)$ , the second term on the right-hand side of (5) equals

- n^{- 1 / 2} \sum_{i = 1}^{n} b_{0}^{M} (t; θ_{0}) A_{2}^{- 1} η_{i 2} + o_{p} (1) .

Hence, $\sqrt{n} [{\hat{B}}^{M} (t; \hat{θ}) - B_{0}^{M} (t)] = n^{- 1 / 2} \sum_{i = 1}^{n} Π_{i}^{M} (t) + o_{p} (1)$ . The desired result follows from the functional central limit theorem.

To show the weak convergence of $\sqrt{n} (\hat{L} (u) - L_{0} (u))$ , we first define

{\hat{B}}^{M} (u; \hat{θ}) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{u} \int_{τ_{0}}^{τ_{1}} \frac{M_{i} (d s, d v)}{S^{(0)} (s; \hat{θ})} .

Since ℳ(t, u) is increasing in t and u, the collection {ℳ(t, u), t ∈ [τ₀, τ₁], u ∈ [0, τ₀]} is Donsker and is also Glivenko-Cantelli. Using arguments as in Appendix A.2, we can show that ℬ̂^M(u; θ̂) converges almost surely to $L_{0} (u) B_{0}^{M} (τ_{1})$ uniformly in u ∈ [0, τ₀]. Note that ℬ̂^M(τ₀, θ) = B̂^M(τ₁; θ), and ℬ̂^M(τ₀, θ̂) converges almost surely to $L_{0} (τ_{0}) B_{0}^{M} (τ_{1}) = B_{0}^{M} (τ_{1})$ . Therefore, L̂₀(u) = ℬ̂^M(u; θ̂)/ℬ̂^M(τ₀; θ̂) converges almost surely to L₀(u) uniformly for u ∈ [0, τ₀]. Using similar arguments, we can show that N̂₀(u) converges almost surely to N₀(u) uniformly for u ∈ [0, τ₀]. Using similar arguments as above, we can show that $\sqrt{n} ({\hat{B}}^{M} (u; \hat{θ}) - L_{0} (u) B_{0}^{M} (τ_{1})) = n^{- 1 / 2} \sum_{i = 1}^{n} Λ_{i}^{M} (u) + o_{p} (1)$ where

Λ_{i}^{M} (u) = \int_{0}^{u} \int_{τ_{0}}^{τ_{1}} \frac{M_{i}^{*} (d s, d v)}{s^{(0)} (s; θ)} - L_{0} (u) b_{0}^{M} (τ_{1}; θ_{0}) A_{2}^{- 1} η_{2 i} .

By functional delta method, we can show that

\sqrt{n} (\hat{L} (u) - L_{0} (u)) = n^{- 1 / 2} \sum_{i = 1}^{n} [\frac{Λ_{i}^{M} (u) - L_{0} (u) Λ_{i}^{M} (τ_{0})}{B_{0}^{M} (τ_{1})}] + o_{p} (1) .

A.4 The null distribution of 𝒮

Let a = (x, y, z) ∈ ℝ³, 𝒯(a) = (𝒯₁(x), 𝒯₂(y), 𝒯₃(z))^T. By arguments as in Appendix A.3 and in the Appendices of Lin and Ying (1993) and Lin et al. (2000), we can show that under the null hypothesis that the model assumption is correct, $\sqrt{n} T (a) = n^{- 1 / 2} \sum_{i = 1}^{n} T_{i}^{*} (a) + o_{p} (1)$ , where $T_{i}^{*} (a) = {(T_{1 i}^{*} (x), T_{2 i}^{*} (y), T_{3 i}^{*} (z))}^{T}$ ,

\begin{matrix} T_{1 i}^{*} (x) = \int_{0}^{τ_{1}} [I (ξ_{0}^{T} X \leq x) - \frac{s_{r} (t, x; ξ_{0})}{s^{(0)} (t; ξ_{0})} - b_{r} (x; ξ_{0}) A_{1}^{- 1} {X_{i} - \bar{x} (t; ξ_{0})}] N_{i}^{*} (d t) \\ T_{2 i}^{*} (y) = \int_{0}^{τ_{1}} [I (θ_{0}^{T} X \leq y) - \frac{s_{r} (t, y; θ_{0})}{s^{(0)} (t; θ_{0})} - b_{r}^{M} (y; θ_{0}) A_{2}^{- 1} {X_{i} - \bar{x} (t; θ_{0})}] M_{i}^{*} (d t, τ_{0}), \\ T_{31}^{*} (z) = \int_{τ_{0}}^{τ_{1}} [I (ϕ_{0}^{T} X \leq z) - \frac{s_{r} (t, z; ϕ_{0})}{s^{(0)} (t; ϕ_{0})} - b_{r}^{V} (z; ϕ_{0}) A_{3}^{- 1} {X_{i} - \bar{x} (t; ϕ_{0})}] V_{i}^{*} (d t, τ_{0}), \\ s_{r} (t, x, c) = E [I (Y \geq t, c^{T} X \leq x) exp (c^{T} X)], \\ b_{r} (x; ξ_{0}) = E [\int_{0}^{τ_{1}} I (Y \geq t, ξ_{0}^{T} X \leq x) exp (ξ_{0}^{T} X) (X - \bar{x} (t; ξ_{0})) d H_{0} (t)], \\ b_{r}^{M} (y; θ_{0}) = E [\int_{τ_{0}}^{τ_{1}} I (Y \geq t, θ_{0}^{T} X \leq y) exp (θ_{0}^{T} X) (X - \bar{x} (t; θ_{0})) {d B}_{0}^{M} (t)], \\ b_{r}^{V} (z; ϕ_{0}) = E [\int_{τ_{0}}^{τ_{1}} I (Y \geq t, ϕ_{0}^{T} X \leq z) exp (ϕ_{0}^{T} X) (X - \bar{x} (t; θ_{0})) {d B}_{0}^{V} (t)] . \end{matrix}

Consistent estimators of these functions are given by

\begin{matrix} S_{r} (t, x, c) = \frac{1}{n} \sum_{i = 1}^{n} I (Y_{i} \geq t, c^{T} X_{i} \leq x) exp (c^{T} X_{i}), \\ B_{r} (x; {\hat{ξ}}_{0}) = \frac{1}{n} \sum_{i = 1}^{n} \int_{0}^{τ_{1}} I ({\hat{ξ}}^{T} X_{i} \leq x) (X_{i} - \bar{X} (s; \hat{ξ})) N_{i} (d s), \\ B_{r}^{M} (y; {\hat{θ}}_{0}) = \frac{1}{n} \sum_{i = 1}^{n} \int_{τ_{0}}^{τ_{1}} I ({\hat{θ}}^{T} X_{i} \leq y) (X_{i} - \bar{X} (s; \hat{θ})) M_{i} (d s, τ_{0}), \\ B_{r}^{V} (z; {\hat{ϕ}}_{0}) = \frac{1}{n} \sum_{i = 1}^{n} \int_{τ_{0}}^{τ_{1}} I ({\hat{ϕ}}^{T} X_{i} \leq z) (X_{i} - \bar{X} (s; \hat{ϕ})) V_{i} (d s, τ_{0}) . \end{matrix}

By the strong consistency of ξ̂, θ̂, ϕ̂ and the uniform strong law of large numbers, S_r(u, x, c), B_r(x; ξ̂), $B_{r}^{M} (y; \hat{θ}), B_{r}^{V} (z; \hat{ϕ})$ converge almost surely to s_r(u, x, c), b_r(x; ξ₀), $b_{r}^{M} (y; θ_{0}), b_{r}^{V} (z; ϕ_{0})$ .

Contributor Information

Kwun Chuen Gary Chan, Department of Biostatistics and Department of Health Services, University of Washington, Seattle, Washington 98105, U.S.A.

Mei-Cheng Wang, Department of Biostatistics, Johns Hopkins University, Baltimore, Maryland 21205, U.S.A.

References

Andersen PK, Gill RD. Cox’s regression model for counting processes: a large sample study. The Annals of Statistics. 1982;10(4):1100–1120. [Google Scholar]
Breslow N. Contribution to the discussion of the paper by DR Cox. Journal of the Royal Statistical Society, Series B. 1972;34(2):216–217. [Google Scholar]
Cai J, Zeng D, Pan W. Semiparametric proportional means model for marker data contingent on recurrent event. Lifetime data analysis. 2010;16(2):250–270. doi: 10.1007/s10985-009-9146-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chan IS, Neaton JD, Saravolatz LD, Crane LR, Osterberger J. Frequencies of opportunistic diseases prior to death among HIV-infected persons. AIDS. 1995;9:1145–1152. doi: 10.1097/00002030-199510000-00005. [DOI] [PubMed] [Google Scholar]
Chan KCG, Wang M-C. Backward estimation of stochastic processes with failure events as time origins. The Annals of Applied Statistics. 2010;4(3):1602–1620. doi: 10.1214/09-AOAS319. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cook R, Lawless J. Marginal analysis of recurrent events and a terminating event. Statistics in Medicine. 1997;16(8):911–924. doi: 10.1002/(sici)1097-0258(19970430)16:8<911::aid-sim544>3.0.co;2-i. [DOI] [PubMed] [Google Scholar]
Cox DR. Regression models and life-tables. Journal of the Royal Statistical Society Series B. 1972;34(2):187–220. [Google Scholar]
Cox DR, Isham V. Point processes. CRC Press; London: 1980. [Google Scholar]
Efron B. The two-way proportional hazards model. Journal of the Royal Statistical Society: Series B. 2002;64(4):899–909. doi: 10.1111/rssc.12098. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gill RD. Non-and semi-parametric maximum likelihood estimators and the von mises method (part 1) Scandinavian Journal of Statistics. 1989;16(2):97–128. [Google Scholar]
Hall CB, Lipton RB, Sliwinski M, Stewart WF. A change point model for estimating the onset of cognitive decline in preclinical Alzheimer’s disease. Statistics in medicine. 2000;19(11–12):1555–1566. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1555::aid-sim445>3.0.co;2-3. [DOI] [PubMed] [Google Scholar]
Huang C-Y, Qin J, Wang M-C. Semiparametric Analysis for Recurrent Event Data with Time-Dependent Covariates and Informative Censoring. Biometrics. 2010;66(1):39–49. doi: 10.1111/j.1541-0420.2009.01266.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Huang C-Y, Wang M-C. Joint Modeling and Estimation for Recurrent Event Processes and Failure Time Data. Journal of the American Statistical Association. 2004;99(468):1153–1165. doi: 10.1198/016214504000001033. [DOI] [PMC free article] [PubMed] [Google Scholar]
Huang Y. Calibration regression of censored lifetime medical cost. Journal of the American Statistical Association. 2002;97(457):318–327. [Google Scholar]
Huang Y, Louis TA. Nonparametric estimation of the joint distribution of survival time and mark variables. Biometrika. 1998;85(4):785–798. [Google Scholar]
Huang Y, Wang M-C. Frequency of Recurrent Events at Failure Time. Journal of the American Statistical Association. 2003;98(463):663–670. doi: 10.1080/01621459.2016.1173557. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kalbfleisch JD, Schaubel DE, Ye Y, Gong Q. An Estimating Function Approach to the Analysis of Recurrent and Terminal Events. Biometrics. 2013;69(2):366–374. doi: 10.1111/biom.12025. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kooperberg C, Stone CJ. A study of logspline density estimation. Computational Statistics & Data Analysis. 1991;12(3):327–347. [Google Scholar]
Lawless J, Nadeau C. Some simple robust methods for the analysis of recurrent events. Technometrics. 1995;37(2):158–168. [Google Scholar]
Lin DY. Proportional means regression for censored medical costs. Biometrics. 2000;56(3):775–778. doi: 10.1111/j.0006-341x.2000.00775.x. [DOI] [PubMed] [Google Scholar]
Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society: Series B. 2000;62(4):711–730. [Google Scholar]
Lin DY, Wei LJ, Ying Z. Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika. 1993;80(3):557–572. [Google Scholar]
Lin DY, Ying Z. Semiparametric and nonparametric regression analysis of longitudinal data. Journal of the American Statistical Association. 2001;96(453):103–126. [Google Scholar]
Liu D, Schaubel DE, Kalbfleisch JD. Computationally efficient marginal models for clustered recurrent event data. Biometrics. 2012;68(2):637–647. doi: 10.1111/j.1541-0420.2011.01676.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu L, Wolfe RA, Huang X. Shared frailty models for recurrent events and a terminal event. Biometrics. 2004;60(3):747–756. doi: 10.1111/j.0006-341X.2004.00225.x. [DOI] [PubMed] [Google Scholar]
Liu L, Wolfe RA, Kalbfleisch JD. A shared random effects model for censored medical costs and mortality. Statistics in medicine. 2007;26(1):139–155. doi: 10.1002/sim.2535. [DOI] [PubMed] [Google Scholar]
Lunney JR, Lynn J, Foley DJ, Lipson S, Guralnik JM. Patterns of functional decline at the end of life. Journal of the American Medical Association. 2003;289(18):2387–2392. doi: 10.1001/jama.289.18.2387. [DOI] [PubMed] [Google Scholar]
Neaton J, Wentworth D, Rhame F, Hogan C, Abrams D, Deyton L. Considerations in choice of a clinical endpoint for AIDS clinical trials. Terry Beirn Community Programs for Clinical Research on AIDS (CPCRA) Statistics in medicine. 1994;13(19–20):2107–2125. doi: 10.1002/sim.4780131919. [DOI] [PubMed] [Google Scholar]
Nelson W. Graphical analysis of system repair data. Journal of Quality Technology. 1988;20(1):24–35. [Google Scholar]
Pawitan Y, Self S. Modeling disease marker processes in AIDS. Journal of the American Statistical Association. 1993;88(423):719–726. [Google Scholar]
Pepe M, Cai J. Some graphical displays and marginal regression analyses for recurrent failure times and time dependent covariates. Journal of the American Statistical Association. 1993;88(423):811–820. [Google Scholar]
Pollard D. Empirical processes: Theory and applications. Institute of Mathematical Statistics; Hayward, CA: 1990. [Google Scholar]
Prentice RL, Kalbfleisch JD, Peterson AV, Jr, Flournoy N, Farewell V, Breslow N. The analysis of failure times in the presence of competing risks. Biometrics. 1978;34:541–554. [PubMed] [Google Scholar]
Rockafellar R. Convex analysis. Princeton; Princeton NJ: 1970. [Google Scholar]
Sun Y, Gilbert PB, McKeague IW. Proportional hazards models with continuous marks. The Annals of statistics. 2009;37(1):394–426. doi: 10.1214/07-AOS554. [DOI] [PMC free article] [PubMed] [Google Scholar]
Therneau TM, Grambsch PM, Fleming TR. Martingale-based residuals for survival models. Biometrika. 1990;77(1):147–160. [Google Scholar]
Tsiatis AA. A large sample study of Cox’s regression model. The Annals of Statistics. 1981;9(1):93–108. [Google Scholar]
Usvyat LA, Barth C, Bayh I, Etter M, von Gersdorff GD, Grassmann A, Guinsburg AM, Lam M, Marcelli D, Marelli C, et al. Interdialytic weight gain, systolic blood pressure, serum albumin, and C-reactive protein levels change in chronic dialysis patients prior to death. Kidney International. 2013;84:149–157. doi: 10.1038/ki.2013.73. [DOI] [PMC free article] [PubMed] [Google Scholar]
Van der Vaart AW, Wellner JA. Weak Convergence. Springer; New York NY: 1996. [Google Scholar]
Wang M-C, Qin J, Chiang C. Analyzing recurrent event data with informative censoring. Journal of the American Statistical Association. 2001;96(455):1057–1065. doi: 10.1198/016214501753209031. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wilson RS, Beck TL, Bienias JL, Bennett DA. Terminal cognitive decline: accelerated loss of cognition in the last years of life. Psychosomatic Medicine. 2007;69(2):131–137. doi: 10.1097/PSY.0b013e31803130ae. [DOI] [PubMed] [Google Scholar]
Wulfsohn MS, Tsiatis AA. A joint model for survival and longitudinal data measured with error. Biometrics. 1997;53(1):330. [PubMed] [Google Scholar]
Ye Y, Kalbfleisch JD, Schaubel DE. Semiparametric analysis of correlated recurrent and terminal events. Biometrics. 2007;63(1):78–87. doi: 10.1111/j.1541-0420.2006.00677.x. [DOI] [PubMed] [Google Scholar]
Zeng D, Lin DY. Semiparametric transformation models with random effects for recurrent events. Journal of the American Statistical Association. 2007;102(477):167–180. [Google Scholar]
Zhao X, Zhou J, Sun L. Semiparametric Transformation Models with Time-Varying Coefficients for Recurrent and Terminal Events. Biometrics. 2011;67(2):404–414. doi: 10.1111/j.1541-0420.2010.01458.x. [DOI] [PubMed] [Google Scholar]

[R1] Andersen PK, Gill RD. Cox’s regression model for counting processes: a large sample study. The Annals of Statistics. 1982;10(4):1100–1120. [Google Scholar]

[R2] Breslow N. Contribution to the discussion of the paper by DR Cox. Journal of the Royal Statistical Society, Series B. 1972;34(2):216–217. [Google Scholar]

[R3] Cai J, Zeng D, Pan W. Semiparametric proportional means model for marker data contingent on recurrent event. Lifetime data analysis. 2010;16(2):250–270. doi: 10.1007/s10985-009-9146-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Chan IS, Neaton JD, Saravolatz LD, Crane LR, Osterberger J. Frequencies of opportunistic diseases prior to death among HIV-infected persons. AIDS. 1995;9:1145–1152. doi: 10.1097/00002030-199510000-00005. [DOI] [PubMed] [Google Scholar]

[R5] Chan KCG, Wang M-C. Backward estimation of stochastic processes with failure events as time origins. The Annals of Applied Statistics. 2010;4(3):1602–1620. doi: 10.1214/09-AOAS319. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] Cook R, Lawless J. Marginal analysis of recurrent events and a terminating event. Statistics in Medicine. 1997;16(8):911–924. doi: 10.1002/(sici)1097-0258(19970430)16:8<911::aid-sim544>3.0.co;2-i. [DOI] [PubMed] [Google Scholar]

[R7] Cox DR. Regression models and life-tables. Journal of the Royal Statistical Society Series B. 1972;34(2):187–220. [Google Scholar]

[R8] Cox DR, Isham V. Point processes. CRC Press; London: 1980. [Google Scholar]

[R9] Efron B. The two-way proportional hazards model. Journal of the Royal Statistical Society: Series B. 2002;64(4):899–909. doi: 10.1111/rssc.12098. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] Gill RD. Non-and semi-parametric maximum likelihood estimators and the von mises method (part 1) Scandinavian Journal of Statistics. 1989;16(2):97–128. [Google Scholar]

[R11] Hall CB, Lipton RB, Sliwinski M, Stewart WF. A change point model for estimating the onset of cognitive decline in preclinical Alzheimer’s disease. Statistics in medicine. 2000;19(11–12):1555–1566. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1555::aid-sim445>3.0.co;2-3. [DOI] [PubMed] [Google Scholar]

[R12] Huang C-Y, Qin J, Wang M-C. Semiparametric Analysis for Recurrent Event Data with Time-Dependent Covariates and Informative Censoring. Biometrics. 2010;66(1):39–49. doi: 10.1111/j.1541-0420.2009.01266.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Huang C-Y, Wang M-C. Joint Modeling and Estimation for Recurrent Event Processes and Failure Time Data. Journal of the American Statistical Association. 2004;99(468):1153–1165. doi: 10.1198/016214504000001033. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] Huang Y. Calibration regression of censored lifetime medical cost. Journal of the American Statistical Association. 2002;97(457):318–327. [Google Scholar]

[R15] Huang Y, Louis TA. Nonparametric estimation of the joint distribution of survival time and mark variables. Biometrika. 1998;85(4):785–798. [Google Scholar]

[R16] Huang Y, Wang M-C. Frequency of Recurrent Events at Failure Time. Journal of the American Statistical Association. 2003;98(463):663–670. doi: 10.1080/01621459.2016.1173557. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Kalbfleisch JD, Schaubel DE, Ye Y, Gong Q. An Estimating Function Approach to the Analysis of Recurrent and Terminal Events. Biometrics. 2013;69(2):366–374. doi: 10.1111/biom.12025. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] Kooperberg C, Stone CJ. A study of logspline density estimation. Computational Statistics & Data Analysis. 1991;12(3):327–347. [Google Scholar]

[R19] Lawless J, Nadeau C. Some simple robust methods for the analysis of recurrent events. Technometrics. 1995;37(2):158–168. [Google Scholar]

[R20] Lin DY. Proportional means regression for censored medical costs. Biometrics. 2000;56(3):775–778. doi: 10.1111/j.0006-341x.2000.00775.x. [DOI] [PubMed] [Google Scholar]

[R21] Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society: Series B. 2000;62(4):711–730. [Google Scholar]

[R22] Lin DY, Wei LJ, Ying Z. Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika. 1993;80(3):557–572. [Google Scholar]

[R23] Lin DY, Ying Z. Semiparametric and nonparametric regression analysis of longitudinal data. Journal of the American Statistical Association. 2001;96(453):103–126. [Google Scholar]

[R24] Liu D, Schaubel DE, Kalbfleisch JD. Computationally efficient marginal models for clustered recurrent event data. Biometrics. 2012;68(2):637–647. doi: 10.1111/j.1541-0420.2011.01676.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] Liu L, Wolfe RA, Huang X. Shared frailty models for recurrent events and a terminal event. Biometrics. 2004;60(3):747–756. doi: 10.1111/j.0006-341X.2004.00225.x. [DOI] [PubMed] [Google Scholar]

[R26] Liu L, Wolfe RA, Kalbfleisch JD. A shared random effects model for censored medical costs and mortality. Statistics in medicine. 2007;26(1):139–155. doi: 10.1002/sim.2535. [DOI] [PubMed] [Google Scholar]

[R27] Lunney JR, Lynn J, Foley DJ, Lipson S, Guralnik JM. Patterns of functional decline at the end of life. Journal of the American Medical Association. 2003;289(18):2387–2392. doi: 10.1001/jama.289.18.2387. [DOI] [PubMed] [Google Scholar]

[R28] Neaton J, Wentworth D, Rhame F, Hogan C, Abrams D, Deyton L. Considerations in choice of a clinical endpoint for AIDS clinical trials. Terry Beirn Community Programs for Clinical Research on AIDS (CPCRA) Statistics in medicine. 1994;13(19–20):2107–2125. doi: 10.1002/sim.4780131919. [DOI] [PubMed] [Google Scholar]

[R29] Nelson W. Graphical analysis of system repair data. Journal of Quality Technology. 1988;20(1):24–35. [Google Scholar]

[R30] Pawitan Y, Self S. Modeling disease marker processes in AIDS. Journal of the American Statistical Association. 1993;88(423):719–726. [Google Scholar]

[R31] Pepe M, Cai J. Some graphical displays and marginal regression analyses for recurrent failure times and time dependent covariates. Journal of the American Statistical Association. 1993;88(423):811–820. [Google Scholar]

[R32] Pollard D. Empirical processes: Theory and applications. Institute of Mathematical Statistics; Hayward, CA: 1990. [Google Scholar]

[R33] Prentice RL, Kalbfleisch JD, Peterson AV, Jr, Flournoy N, Farewell V, Breslow N. The analysis of failure times in the presence of competing risks. Biometrics. 1978;34:541–554. [PubMed] [Google Scholar]

[R34] Rockafellar R. Convex analysis. Princeton; Princeton NJ: 1970. [Google Scholar]

[R35] Sun Y, Gilbert PB, McKeague IW. Proportional hazards models with continuous marks. The Annals of statistics. 2009;37(1):394–426. doi: 10.1214/07-AOS554. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] Therneau TM, Grambsch PM, Fleming TR. Martingale-based residuals for survival models. Biometrika. 1990;77(1):147–160. [Google Scholar]

[R37] Tsiatis AA. A large sample study of Cox’s regression model. The Annals of Statistics. 1981;9(1):93–108. [Google Scholar]

[R38] Usvyat LA, Barth C, Bayh I, Etter M, von Gersdorff GD, Grassmann A, Guinsburg AM, Lam M, Marcelli D, Marelli C, et al. Interdialytic weight gain, systolic blood pressure, serum albumin, and C-reactive protein levels change in chronic dialysis patients prior to death. Kidney International. 2013;84:149–157. doi: 10.1038/ki.2013.73. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] Van der Vaart AW, Wellner JA. Weak Convergence. Springer; New York NY: 1996. [Google Scholar]

[R40] Wang M-C, Qin J, Chiang C. Analyzing recurrent event data with informative censoring. Journal of the American Statistical Association. 2001;96(455):1057–1065. doi: 10.1198/016214501753209031. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] Wilson RS, Beck TL, Bienias JL, Bennett DA. Terminal cognitive decline: accelerated loss of cognition in the last years of life. Psychosomatic Medicine. 2007;69(2):131–137. doi: 10.1097/PSY.0b013e31803130ae. [DOI] [PubMed] [Google Scholar]

[R42] Wulfsohn MS, Tsiatis AA. A joint model for survival and longitudinal data measured with error. Biometrics. 1997;53(1):330. [PubMed] [Google Scholar]

[R43] Ye Y, Kalbfleisch JD, Schaubel DE. Semiparametric analysis of correlated recurrent and terminal events. Biometrics. 2007;63(1):78–87. doi: 10.1111/j.1541-0420.2006.00677.x. [DOI] [PubMed] [Google Scholar]

[R44] Zeng D, Lin DY. Semiparametric transformation models with random effects for recurrent events. Journal of the American Statistical Association. 2007;102(477):167–180. [Google Scholar]

[R45] Zhao X, Zhou J, Sun L. Semiparametric Transformation Models with Time-Varying Coefficients for Recurrent and Terminal Events. Biometrics. 2011;67(2):404–414. doi: 10.1111/j.1541-0420.2010.01458.x. [DOI] [PubMed] [Google Scholar]

PERMALINK

Semiparametric modeling and estimation of the terminal behavior of recurrent marker processes before failure events

Kwun Chuen Gary Chan

Mei-Cheng Wang

Abstract

1. INTRODUCTION

2. A THREE-LEVEL SEMIPARAMETRIC MODEL

Level 1

Level 2

Level 3

3. ESTIMATION AND INFERENCE

3.1 Finite-dimensional target parameters

Theorem 1

3.2 Functional parameters

Theorem 2

3.3 Goodness-of-fit test

4. NUMERICAL STUDIES

Table 1.

Table 2.

Figure 1.

5. CONCLUDING REMARKS

Level 1

Level 2^†

Level 3^†

Level 2*

Level 3*

Acknowledgments

APPENDIX

A.1 Asymptotic properties of finite-dimensional parameters

A.2 Consistency of B̂^M(t; θ̂), B̂^V (t; ϕ̂) and V̂

A.3 Weak convergence of functional parameters

A.4 The null distribution of 𝒮

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Semiparametric modeling and estimation of the terminal behavior of recurrent marker processes before failure events

Kwun Chuen Gary Chan

Mei-Cheng Wang

Abstract

1. INTRODUCTION

2. A THREE-LEVEL SEMIPARAMETRIC MODEL

Level 1

Level 2

Level 3

3. ESTIMATION AND INFERENCE

3.1 Finite-dimensional target parameters

Theorem 1

3.2 Functional parameters

Theorem 2

3.3 Goodness-of-fit test

4. NUMERICAL STUDIES

Table 1.

Table 2.

Figure 1.

5. CONCLUDING REMARKS

Level 1

Level 2†

Level 3†

Level 2*

Level 3*

Acknowledgments

APPENDIX

A.1 Asymptotic properties of finite-dimensional parameters

A.2 Consistency of B̂M(t; θ̂), B̂V (t; ϕ̂) and V̂

A.3 Weak convergence of functional parameters

A.4 The null distribution of 𝒮

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Level 2^†

Level 3^†

A.2 Consistency of B̂^M(t; θ̂), B̂^V (t; ϕ̂) and V̂