Semiparametric Inference for a General Class of Models for Recurrent Events

Edsel A Peña; Elizabeth H Slate; Juan R González

doi:10.1016/j.jspi.2006.05.004

. Author manuscript; available in PMC: 2009 Oct 9.

Published in final edited form as: J Stat Plan Inference. 2007 Jun 1;137(6):1727–1747. doi: 10.1016/j.jspi.2006.05.004

Semiparametric Inference for a General Class of Models for Recurrent Events

Edsel A Peña ^a,^*, Elizabeth H Slate ^b, Juan R González ^c

PMCID: PMC2759672 NIHMSID: NIHMS10128 PMID: 19823592

Abstract

Procedures for estimating the parameters of the general class of semiparametric models for recurrent events proposed by Peña and Hollander (2004) are developed. This class of models incorporates an effective age function encoding the effect of changes after each event occurrence such as the impact of an intervention, it models the impact of accumulating event occurrences on the unit, it admits a link function in which the effect of possibly time-dependent covariates are incorporated, and it allows the incorporation of unobservable frailty components which induce dependencies among the inter-event times for each unit. The estimation procedures are semiparametric in that a baseline hazard function is nonparametrically specified. The sampling distribution properties of the estimators are examined through a simulation study, and the consequences of mis-specifying the model are analyzed. The results indicate that the flexibility of this general class of models provides a safeguard for analyzing recurrent event data, even data possibly arising from a frailtyless mechanism. The estimation procedures are applied to real data sets arising in the biomedical and public health settings, as well as from reliability and engineering situations. In particular, the procedures are applied to a data set pertaining to times to recurrence of bladder cancer and the results of the analysis are compared to those obtained using three methods of analyzing recurrent event data.

Keywords: Correlated inter-event times, counting process, effective age process, EM algorithm, frailty; intensity models, model mis-specification, sum-quota accrual scheme

1 Introduction

Recurrent events occur in many settings such as in biomedicine, public health, clinical trials, engineering and reliability studies, politics, economics, sociology, actuarial science, among others. Examples of recurrent events in the biomedical and public health settings are the re-occurrence of a tumor after surgical removal in cancer studies, epileptic seizures, drug or alcohol abuse of adolescents, outbreak of a disease such as encephalitis, recurring migraines, hospitalization, movement in the small bowel during fasting state, onset of depression, nauseous feeling when taking drugs for the dissolution of cholesterol gallstones, recurrence of caries, ulcers or inflammation in an oral health study, and angina pectoris for patients with coronary disease. Some other specific biomedical examples of recurrent events are described in Cook and Lawless (2002). In the engineering and reliability settings, recurrent events could be the breakdown or failure of a mechanical or electronic system, the discovery of a bug in an operating system software, the occurrence of a crack in concrete structures, the breakdown of a fiber in fibrous composites, among others. Non-life insurance claims, traffic accidents, terrorist attacks, the Dow Jones Industrial Average decreasing by more than 200 points on a trading day, change of employment, among many others, are but a few examples of recurrent phenomena in other settings.

There are several models and methods of analysis used for recurrent event data. See for example Hougaard (2000), Therneau and Hamilton (1997), and Therneau and Grambsch (2000) for some current approaches to analyzing recurrent event data. However, as pointed out in Peña and Hollander (2004), there is still a need for a general and flexible class of models that simultaneously incorporates the effects of covariates or concomitant variables, the impact on the unit of accumulating event occurrences, the effect of performed interventions after each event occurrence, as well as the effect of latent or unobserved variables which, for each unit, endow correlation among the inter-event times. In recognition of this need, Peña and Hollander (2004) proposed a general class of models for recurrent events which satisfies the above requirements. This class of models will be described in Section 2. The current paper deals with inference issues, specifically the estimation of parameters, for this new class of models. However, we limit the scope of this paper to examining the finite-sample properties through simulation studies of the resulting estimators and defer the analytical and asymptotic analysis of their properties to a forthcoming paper.

We consider an observational unit (e.g., a patient in a biomedical setting, an electronic system in a reliability setting) that is being monitored for the occurrence of a recurrent event over a study period $[0, τ]$ , where $τ$ may represent an administrative time, time of study termination, or some other right-censoring variable. The time $τ$ could be a random time governed by an unknown probability distribution function $G (t) = P (τ \leq t)$ . Let $S_{0} \equiv 0 < S_{1} < S_{2} < S_{3} < \dots$ be the successive calendar times of event occurrences, and let $T_{1}, T_{2}, T_{3}, \dots$ be the times between successive event occurrences. Thus, for $i = 1, 2, 3, \dots, T_{i} = S_{i} - S_{i - 1}$ and $S_{i} = T_{1} + T_{2} + \dots + T_{i}$ . Over the observation period $[0, τ]$ , the number of event occurrences is $K = max {k \in {0, 1, 2, \dots} : S_{k} \leq τ}$ , which is a random variable whose distribution depends on the distributional properties of the inter-occurrence times $T_{i} s$ and the distribution $G$ of $τ$ . As such, $K$ is informative with regards to the distributional properties of event occurrences.

Assume for this unit a, possibly time-varying, $q$ -dimensional vector of covariates such as gender, age, race, disease status, white blood cell counts (WBC), prostate specific antigen (PSA) level, weight, blood pressure, treatment regimen, etc. We suppose that over the period $[0, τ]$ , the realization of this covariate process is observable. We denote this covariate process by ${X (s) = (X_{1} (s), X_{2} (s), \dots, X_{q} (s))^{t} : 0 \leq s \leq τ}$ , with “ $^{t}$ ” representing vector/matrix transpose. For this subject, the observable entities over the study period $[0, τ]$ are therefore

D (τ) \equiv {(X (s) : 0 \leq s \leq τ), K, τ, T_{1}, T_{2}, \dots, T_{K}, τ - S_{K}} .

(1)

Notice that since $S_{K} = \sum_{j = 1}^{K} T_{j}$ , specifying $τ - S_{K}$ renders $τ$ redundant; however, we still include $τ$ to indicate that $τ - S_{K}$ is the right-censoring variable for the inter-occurrence time $T_{K + 1}$ . Furthermore, since $K$ is random, then the distributional properties of both $τ - S_{K}$ and $T_{K + 1}$ may be of a complicated form. When considering the data structure in this recurrent event situation, there is a need to recognize that $K$ is informative and that the censoring mechanism for $T_{K + 1}$ is informative (cf., Wang and Chang, 1999; Lin et al., 1999; Peña et al., 2001). These aspects are borne out of the sum-quota data accrual scheme since the number of observed events is tied-in to the distributions governing the event occurrences themselves.

The observable entities may also be represented more succinctly and beneficially through the use of stochastic processes. Still considering one unit, with $I {\cdot}$ denoting indicator function, define for calendar time $s$ , $N^{†} (s) = \sum_{j = 1}^{\infty} I {S_{j} \leq s, S_{j} \leq τ}$ , which is the process counting the number of events observed on or before calendar time $s$ during the study period $[0, τ]$ . Furthermore, define for calendar time $Y^{†} (s) = I {τ \geq s}$ , the “at-risk” process which indicates whether the subject is still under observation at calendar time $s$ or not. The data $D (τ)$ in (1) could be represented by

D (s^{*}) = {(X (s), N^{†} (s), Y^{†} (s)) : 0 \leq s \leq s^{*} < \infty},

(2)

where $s^{*} (\geq τ)$ is an upper limit of observation time. Note that even though $X (s)$ is not observed for $s > τ$ this does not pose a problem since for $s > τ, Y^{†} (s) = 0$ , and so for such a subject, there will be no information obtainable beyond $τ$ . If in the study there are $n$ subjects, the observables will be $D (s *) = (D_{1} (s *), D_{2} (s *), \dots, D_{n} (s *))$ , where $s^{*} \geq {max}_{1 \leq i \leq n} τ_{i}$ and for $i = 1, 2, \dots, n$ ,

D_{i} (s^{*}) = {(X_{i} (s) : s \leq τ_{i}), K_{i}, τ_{i}, T_{i 1}, T_{i 2}, \dots, T_{i K_{i}}, τ_{i} - S_{i K_{i}}} .

(3)

Equivalently,

D_{i} (s^{*}) = {(X_{i} (s), N_{i}^{†} (s), Y_{i}^{†} (s)) : 0 \leq s \leq s^{*} < \infty}, where

N_{i}^{†} (s) = \sum_{j = 1}^{\infty} I {S_{i j} \leq s, S_{i j} \leq τ_{i}} and Y_{i}^{†} (s) = I {τ_{i} \geq s},

and $S_{i 1} < S_{i 2} < \dots$ are the calendar times of successive event occurrences for the $i$ th subject, $T_{i j} = S_{i j - 1}, j = 1, 2, \dots$ and $τ_{i}$ is the censoring time of the $i$ th subject.

We provide an outline of the contents of this paper. Section 2 will present a description of the class of models for recurrent events that is under investigation. Section 3 will examine the problem of estimating the parameters of the model when there are no frailty components. The results here are needed for the estimation procedure in the presence of frailties described in Section 4. Section 5 will summarize results of the simulation studies pertaining to the properties of the estimators. We demonstrate the estimation procedures discussed in Sections 3 and 4 on real data sets in Section 6. Section 7 will provide some concluding thoughts.

2 A General Class of Models

In this section we describe the general class of models for recurrent events in Peña and Hollander (2004). Let $Z = (Z_{1}, Z_{2}, \dots Z_{n})$ be a vector of independent and identically distributed (i.i.d.) positive-valued random variables from a parametric distribution $H (z, ξ) = P (Z \leq z | ξ)$ where $ξ$ is a finite-dimensional parameter taking values in $Ξ \subseteq ℜ^{r}$ . These variables are unobservable random factors affecting the event occurrences for the subjects. Also, let $F = {ℱ_{s} : 0 \leq s \leq s *}$ be a filtration or history on some probability space $(Ω, ℱ, P)$ such that the $X_{i}$ and $Y_{i}^{†}$ are predictable and such that the $N_{i}^{†}$ are counting processes with respect to $F$ . The general class of models requires the specification, possibly done dynamically, of predictable observable processes ${ℰ_{i} (s) : 0 \leq s \leq s *}, i = 1, 2, \dots, n$ , satisfying the following conditions: (I) $ℰ_{i} (0) = e_{i 0}$ , almost surely (a.s.), where $e_{i 0}, i = 1, 2, \dots, n$ , are nonnegative real numbers; (II) $ℰ_{i} (s) \geq 0, i = 1, 2, \dots, n$ ; and (III) On $(S_{i k - 1}, S_{i k}], ℰ_{i} (s)$ is monotone and almost surely differentiable with a positive derivative $ℰ_{i}^{'} (s)$ . The class of models is obtained by postulating that, conditionally on $Z$ , the $F$ -compensator of $N_{i}^{†}$ is ${A_{i}^{†} (s | Z, X_{i}) : 0 \leq s \leq s *}$ with

A_{i}^{†} (s | Z, X_{i}) = \int_{0}^{s} Y_{i}^{†} (v) λ_{i} (v | Z, X_{i}) d v;

(4)

λ_{i} (s | Z, X_{i} | = Z_{i} λ_{0} [ℰ_{i} (s)] ρ [N_{i}^{†} (s -); α] ψ [β^{t} X_{i} (s)] .

(5)

This means that the process $M_{i}^{†} (s | Z, X_{i}) = N_{i}^{†} (s) - A_{i}^{†} (s | Z, X_{i})$ is a square-integrable $F$ -martingale. In (5), $λ_{0} (\cdot)$ is an unknown baseline hazard rate function; $ρ (\cdot; α) : Z_{+} \equiv {0, 1, 2, \dots} \to ℜ_{+}$ is of known functional form with $ρ (0; α) = 1$ and with $α \in A \subseteq ℜ^{p}$ ; and $ψ (\cdot)$ is a nonnegative link function of known functional form with $β \in B \subseteq ℜ^{q}$ . The unknown model parameters are $(λ_{0} (\cdot), α, β, ℰ)$ , where $λ_{0} (\cdot)$ is non-prarametrically specified, and $α, β$ , and $ε$ are finite-dimensional parameters. The main impetus in introducing this general class of models for recurrent events is that it incorporates simultaneously the effects of covariates through the link function $ψ (\cdot)$ , the associations among the event inter-occurrence times through the unobservable frailty variables $Z_{i},$ the effects attributable to the accumulating event occurrences for a subject through the component $ρ (\cdot; α)$ , and the effects of performed interventions after each event occurrence through the effective age processes ${ℰ_{i} (\cdot)}$ which act on the baseline hazard rate function $λ_{0} (\cdot)$ , and which may even have nonlinear forms. There is a potential interplay between the effective age process $ℰ (\cdot)$ and the $ρ (\cdot; \cdot)$ function. This issue will be discussed in Section 5 dealing with the simulation studies and in the Section 6 dealing with the applications.

With no frailty, the generality of this class of models was discussed in Peña and Hollander (2004). With the added feature of frailty, this class of models subsumes many models in the literature, as described below. Indeed, one may view this class of models as a general synthesis of several recurrent event models in survival analysis and reliability, such as the modulated renewal process of Cox (1972b) and those by Self and Prentice (1982), Prentice and Self (1983), Kessing et al. (1999), and in the examples that follow where the model is described for only one unit, i.e., with $n = 1$ .

Example 2.1

Beginning with no frailty $(Z = 1)$ , taking $ρ (k; α) = 1, ψ (w) = 1$ , and $ℰ (s) = s - S_{N^{†} (s -)}$ , we obtain i.i.d. inter-occurrence times, one of the models examined in Gill (1981) and Peña et al. (2001). With frailty, one obtains associations among the inter-occurrence times, a model also considered in Peña et al. (2001) and Wang and Chang (1999). Still with no frailty but taking $ψ (w) = exp (w)$ gives the extended Cox proportional hazards model considered by Prentice et al. (1981), Lawless (1987), and Aalen and Husebye (1991). Further changing to $ℰ (s) = s$ gives a model examined by Prentice et al. (1981), Brown and Proschan (1983), and Lawless (1987) referred to in the reliability literature as an imperfect repair model, since it arises by `restoring a system to the state just before it failed (minimally repaired)' whenever the system fails.

Example 2.2

Let $I_{1}, I_{2}, I_{3}, \dots$ be a sequence of i.i.d. Bernoulli random variables with success probability $p$ . Define the process ${η (s) : s \in [0, τ]}$ via $η (s) = \sum_{i = 1}^{N † (s)} I_{i}$ . Also let $0 \equiv Γ_{0}, Γ_{1}, Γ_{2}, < \dots$ be defined according to $Γ_{k} = min {j > Γ_{k - 1} : I_{j} = 1},$ , $k = 1, 2, 3, \dots$ . By setting $Z = 1, ρ (k; α) = 1$ and $ℰ (s) = s - S_{Γ_{η (s -)}}$ , we obtain

λ (s | Z, X) = λ_{0} (s - S_{Γ_{η (s -)}}) ψ (β^{t} X (s)) .

(6)

This is the Brown and Proschan (1983) imperfect repair model, also studied by Whitaker and Samaniego (1989) who noted that the inter-failure times and the repair modes suffice for model identifiability. If the success probability $p$ depends on the time of event occurrence, the Block et al. (1985) model obtains (see Hollander et al., 1992; Presnell et al., 1994). Note in this example that the $Γ_{k}$ s represent event occurrences in which intervention causes the unit to acquire an effective age of zero. Furthermore, $S_{Γ_{η (s -)}}$ is the last time prior to $s$ that the subject had an effective age of zero. More generally, the class of models also subsumes the general repair model of Last and Szekli (1998), which includes as special cases models of Dorado et al. (1997), Kijima (1989), Baxter et al. (1996), and Stadje and Zuckerman (1991).

Example 2.3

Lindqvist et al. (2003) proposed the trend-renewal process (TRP) model and the heterogeneous TRP (HTRP) model for repairable systems, which are models built on the idea behind the inhomogeneous gamma process model of Berman (1981). The TRP has two parameters: a distribution function $F$ and a cumulative hazard function $Λ$ , and is such that if $S_{1} < S_{2} < \dots$ are the event times, then ${Λ (S_{1}), Λ (S_{2}), \dots}$ forms a renewal process from the distribution function $F$ . The TRP becomes a special case of the general class of models with effective age process $ℰ (s) = (Λ_{0}^{1} ○ Λ_{F}^{- 1} ○ Λ_{0}) (s - S_{N^{†} (s -)})$ where $Λ_{F}$ is the cumulative hazard function associated with $F$ and $○$ is function composition, $ρ (k; α) \equiv 1$ and $ψ (v) \equiv 1$ . However, for the inference setting we are considering in this paper, this is not covered because $ℰ (\cdot)$ would not be observable since $Λ_{F} (\cdot)$ would not be known. Meanwhile, the HTRP model is simply the version with a frailty component.

Example 2.4

A special case obtains via $ρ [N^{†} (s -); α] = max {α - g [N^{†} (s -)], 0}$ , where $α$ is some positive real number, and $g (\cdot)$ is some nondecreasing function. One could interpret the parameter $α$ as an initial measure of the unit’s susceptibility to events, and $g (\cdot)$ specifies the rate at which this unit is becoming stronger as the event occurrences accumulate. If we take $ℰ (s) = s - S_{N^{†} (s -)}$ , the resulting model possesses the interesting property that the unit's defects contribute to the event occurrence intensity multiplicatively through the baseline hazard rate function $λ_{0} (\cdot)$ . If $g [N^{†} (s -)] = N^{†} (s -)$ and $λ_{0} (s) = λ_{0}$ , where $λ_{0}$ is some positive constant, then the Gail et al. (1980) tumor occurrence model and the Jelinski and Moranda (1972)) software reliability model are obtained.

Example 2.5

A popular load-sharing model is the equal load-share model considered in Kvam and Peña (2005). One context is a $K$ -component parallel system consisting of identical components, for which the event of interest is the occurrence of a component failure. Failed components are not replaced, and when a component fails, the load of the system is redistributed equally over the remaining functioning components. To model this, we let $α = (α_{0} \equiv 1, α_{1}, \dots, α_{K - 1})$ be an unknown vector of constants, and take the hazard rate of event occurrence at calendar time $s$ as $λ (s) = λ_{0} (s) [K - N^{†} (s -)] α_{N^{†} (s -)}$ , where $λ_{0} (\cdot)$ is the hazard rate of each component at time zero and $N^{†} (s)$ denotes the number of components that have failed up to time $s$ . This model is then a special case of the general model with $ε (s) = s, ρ (j; α) = (K - j) α_{j}$ , and one has the added flexibility of also incorporating a link function involving covariates if such are observed, as well as frailty components which could model unobserved operating environmental factors.

3 Estimation of Parameters: Model without Frailties

By virtue of the generality of the class of models, it is thus of importance to develop appropriate statistical inference methods. We address in this section the problem of estimating the model parameters $Λ_{0} (\cdot) = \int_{0} λ_{0} (w) d w, α$ , and $β$ for the model where it is assumed that $Z_{i} \equiv 1$ , that is, the model without frailties. Thus, the model of interest has intensity process

λ_{i} (s | X_{i}) = λ_{0} [ℰ_{i} (s)] ρ [N_{i}^{†} (s -); α] ψ (β^{t} X_{i} (s)) .

(7)

The observables for the $n$ subjects, which now include the observable effective age processes, are ${(X_{i} (s), N_{i}^{†} (s), Y_{i}^{†} (s), ℰ_{i} (s)) : 0 \leq s \leq s *}, i = 1, 2, \dots, n$ , where $N_{i}^{†} (s) = \sum_{j = 1}^{\infty} I {S_{i j} \leq s, S_{i j} \leq τ_{i}}$ and $Y_{i}^{†} (s) = I {τ_{i} \geq s}$ . The statistical identifiability of this class of models without frailties has been established in Theorem 1 of Peña and Hollander (2004). The two basic conditions to achieve identifiability, aside from the non-triviality of $ψ (\cdot)$ and sufficient variability of $X$ , are that for each value of the parameter set $(λ_{0} (\cdot), α, β)$ , the support of $ℰ (S_{1})$ should contain $[0, τ]$ , and that $ρ (\cdot, \cdot)$ should satisfy the condition that $ρ (k; α^{(1)}) = ρ (k; α^{(2)})$ for each $k \in {0, 1, 2, \dots}$ implies $α^{(1)} = α^{(2)}$ . These two conditions are henceforth assumed to hold.

For this model, letting $A_{i}^{†} (s) = \int_{0}^{s} Y_{i}^{†} (v) λ_{0} [ℰ_{i} (v)] ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) d v$ , then with respect to the filtration $F$ , the vector of processes

M^{†} = (M_{1}^{†}, \dots, M_{n}^{†}) = N^{†} - A^{†} = (N_{1}^{†} - A_{1}^{†}, \dots, N_{n}^{†} - A_{n}^{†})

consists of orthogonal square-integrable martingales with predictable quadratic covariation processes $〈 M_{i_{1}}^{†}, M_{i_{2}}^{†} 〉 (s) = A_{i_{1}}^{†} (s) I {i_{1} = i_{2}}$ . The usual martingale theory utilized by Aalen (1978), Gill (1980), Andersen and Gill (1982), and others (cf., Fleming and Harrington, 1991; Andersen et al., 1993) does not apply directly for the purpose of estimating $Λ_{0} (\cdot)$ . The reason is that the $λ_{0} (\cdot)$ appearing in $A_{i}^{†} (\cdot)$ is time-transformed by the observable predictable process $ℰ_{i} (\cdot)$ , while of interest is to estimate $Λ_{0} (t)$ for a given $t$ . It is tempting and would seem natural to simply define new processes involving the gap times between the event occurrences. However, as pointed out in Peña et al. (2001), this approach does not work since the resulting processes no longer satisfy martingale properties owing to the effect of the sum-quota accrual scheme.

The technique utilized in Peña et al. (2001), extending an idea of Sellke (1988) and Gill (1981), is to define a doubly-indexed process $Z_{i} (s, t) = I {ℰ_{i} (s) \leq t}, i = 1, 2, \dots, n$ . (Note that there is a notational, but tolerable, conflict with the frailty variables.) The index s represents calendar time, which is the natural time of data accrual; while the index $t$ represents gap times. This process indicates whether at calendar time $s$ , the effective age of the $i$ th subject is no more than $t$ . For $i = 1, 2, \dots, n$ , define also the doubly-indexed processes

\begin{array}{l} N_{i} (s, t) = \int_{0}^{s} Z_{i} (v, t) N_{i}^{†} (d v); A_{i} (s, t) = \int_{0}^{s} Z_{i} (v, t) A_{i}^{†} (d v); \\ M_{i} (s, t) = N_{i} (s, t) - A_{i} (s, t) = \int_{0}^{s} Z_{i} (v, t) M_{i}^{†} (d v) . \end{array}

Note that $N_{i} (s, t)$ is the number of events for the $i$ th unit that occurred over $[0, s]$ with effective ages at most $t$ . For a given $t$ , by utilizing the martingale property of $M_{i}^{†}$ and the predictability of $Z_{i} (\cdot, t)$ , the process $M_{i} (\cdot, t)$ is a square-integrable zero-mean martingale; however, for fixed $s$ , the process $M_{i} (s, \cdot)$ is not a martingale, but nevertheless, it also has mean zero.

A critical result is an equivalent expression for $A_{i} (s, t)$ which involves $λ_{0} (t)$ directly, instead of its time-transformed version. To reveal this expression, define for $j = 1, \dots, K_{i} + 1$ the processes

ℰ_{i j - 1} (v) = ℰ_{i} (v) I {S_{i j - 1} < v \leq S_{i j}} on {Y_{i}^{†} (v) > 0} .

(8)

Thus, $ℰ_{i j - 1} (\cdot)$ is the restriction of $ℰ_{i} (\cdot)$ on the $j$ th interval bounded by successive event occurrence times for the $i$ th subject. Note that on $(S_{i j - 1}, S_{i j}]$ , the paths of $ℰ_{i j - 1} (\cdot)$ are one-to-one, so its inverse exists; and furthermore, it is also differentiable. We now provide the alternative expression for $A_{i} (s, t)$ in Proposition 1. The proof of the result is analogous to that in Peña et al. (2000). To achieve a more concise notation, with $ℰ_{i j}^{'} (s) = \frac{d}{d s} ℰ_{i j} (s),$ we define

ϕ_{i j} (s; α, β) \equiv \frac{ρ [N_{i}^{†} (s -); α] ψ [β^{t} X_{i} (s)]}{ℰ_{i j}^{'} (s)} .

(9)

Proposition 1 For each $i = 1, 2, \dots, n, A_{i} (s, t) = \int_{0}^{t} Y_{i} (s, w) λ_{0} (w) d w$ , where

\begin{array}{l} Y_{i} (s, w) \equiv Y_{i} (s, w | α, β) = \sum_{j = 1}^{N_{i}^{†} (s -)} I_{(ℰ_{i j - 1} (S_{i j - 1} +), ℰ_{i j - 1} (S_{i j})]} (w) ϕ_{i j - 1} (ℰ_{i j - 1}^{- 1} (w); α, β) + \\ I_{(ℰ_{i N_{i}^{†} (s -)} (S_{i N_{i}^{†} (s -)} +), ℰ_{i N_{i}^{†} (s -)} (s \land τ_{i})]} (w) ϕ_{i N_{i}^{†} (s -)} (ℰ_{i N_{i}^{†} (s -)}^{- 1} (w); α, β) . \end{array}

The process $Y_{i} (s, w)$ is a generalized at-risk process and is an adjusted count of the number of events for the $i$ th unit which occurred over $[0, s]$ whose effective ages during their occurrences are at least $w$ . Using Proposition 1, we have the identity

M_{i} (s, t) = N_{i} (s, t) - \int_{0}^{t} Y_{i} (s, w) Λ_{0} (d w), i = 1, 2, \dots, n,

So that $\sum_{i = 1}^{n} M_{i} (s, d w) = \sum_{i = 1}^{n} N_{i} (s, d w) - S_{0} (s, w) Λ_{0} (d w)$ , where

S_{0} (s, t) \equiv S_{0} (s, t | α, β) = \sum_{i = 1}^{n} Y_{i} (s, t | α, β) .

(10)

Because $\sum_{i = 1}^{n} M_{i} (s, d w)$ has mean zero, a method-of-moments ‘estimator’ of $Λ_{0} (t)$ , given $(α, β)$ is

{\hat{Λ}}_{0} (s, t | α, β) = \int_{0}^{t} {\frac{J (s, w | α, β)}{S_{0} (s, w | α, β)}} {\sum_{i = 1}^{n} N_{i} (s, d w)},

(11)

with $J (s, w | α, β) = I {S_{0} (s, w | α, β) > 0}$ and with the convention that $0 / 0 = 0$ . Notice that this ‘estimator’ is of the same flavor as the Nelson-Aalen estimator or the Aalen-Breslow estimator in single-event settings, although it should be pointed out that the derivation as well as the structure of the processes are quite different.

Next we develop the profile likelihood for $(α, β)$ from which the estimator of $(α, β)$ will be obtained. Following Jacod (1975) (see also Andersen et al., 1993), if the distribution $G$ of $τ$ does not involve the model parameters, then the full likelihood process associated with the observables for the general model without frailties is

\begin{array}{l} L_{F}^{†} (s | λ_{0} (\cdot), α, β, D (s *)) = {\prod_{i = 1}^{n} {\prod_{v = 0}^{s} [Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v)]]}^{N_{i}^{†} (Δ v)}} \times \\ {exp [- \sum_{i = 1}^{n} \int_{0}^{s} Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v)] d v]} . \end{array}

(12)

The argument of the exponential function could be re-expressed via

\sum_{i = 1}^{n} \int_{0}^{s} Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v)] d v = \int_{0}^{\infty} S_{0} (s, w | α, β) Λ_{0} (d w) .

Since from (11), we have ${\hat{Λ}}_{0} (s, d w | α, β) = \sum_{i = 1}^{n} N_{i} (s, d w) / S_{0} (s, w | α, β),$ it therefore follows that $\int_{0}^{\infty} S_{0} (s, w | α, β) {\hat{Λ}}_{0} (s, d w | α, β) = \sum_{i = 1}^{n} N_{i} (s, \infty),$ which is independent of $(α, β)$ . Upon substituting the ‘estimator’ ${\hat{Λ}}_{0} (s, t | α, β)$ for $Λ_{0} (t)$ in the argument of the exponential function in (12), the resulting term will not contribute to the profile likelihood for $(α, β)$ .

On the other hand, substituting ${\hat{Λ}}_{0} (s, w | α, β)$ for $Λ_{0} (w)$ in the first term of (12), we obtain the relevant portion of the profile likelihood of $(α, β)$ to be

L_{P} (s | α, β, D (s *)) = \prod_{i = 1}^{n} \prod_{j = 1}^{N_{i}^{†} (s)} {[\frac{ρ (j - 1; α) ψ [β^{t} X_{i} (S_{i j})]}{S_{0} [s, ℰ_{i} (S_{i j}) | α, β]}]}^{Δ N_{i}^{†} (S_{i j})} .

(13)

This process could also be viewed as the partial likelihood process for $(α, β)$ , which is a generalization of the partial likelihood for the Cox model (cf., Cox, 1972a, 1975; Andersen and Gill, 1982). The logarithm of the profile likelihood could be conveniently expressed in integral form via

\begin{array}{l} l_{P} (s | α, β, D (s *)) = \sum_{i = 1}^{n} \int_{0}^{s} [log ρ [N_{i}^{†} (v -); α] + log ψ (β^{t} X_{i} (v)) - \\ log S_{0} (s, ℰ_{i} (v) | α, β)] N_{i}^{†} (d v) . \end{array}

(14)

From this profile likelihood, the estimators of $α$ and $β$ will be obtained. It is easy to see that the estimating equations for the profile maximum likelihood estimators are

\sum_{i = 1}^{n} \int_{0}^{s *} [\frac{\frac{\partial}{\partial α} ρ [N_{i}^{†} (v -); α]}{ρ [N_{i}^{†} (v -); α]} - \frac{\frac{\partial}{\partial α} S_{0} (s, ℰ_{i} (v) | α, β)}{S_{0} (s, ℰ_{i} (v) | α, β)}] N_{i}^{†} (d v) = 0;

(15)

\sum_{i = 1}^{n} \int_{0}^{s *} [\frac{\frac{\partial}{\partial β} ψ (β^{t} X_{i} (v))}{ψ (β^{t} X_{i} (v))} - \frac{\frac{\partial}{\partial β} S_{0} (s, ℰ_{i} (v) | α, β)}{S_{0} (s, ℰ_{i} (v) | α, β)}] N_{i}^{†} (d v) = 0 .

(16)

Because $N_{i}^{†} (\cdot)$ is a step process with a finite number of jumps, then both of these estimating equations are finite sums with respect to the calendar times $S_{i j} s$ . Also, just like estimating equations in simpler models, such as for the Cox proportional hazards model, it is clear that numerical techniques will be needed to obtain the estimates $\hat{α}$ and $\hat{β}$ .

Upon obtaining the estimators $\hat{α}$ and $\hat{β}$ from the estimating equations (15) and (16), the estimator of $Λ_{0} (t)$ based on the realizations of the observables over $[0, s *]$ is obtained by substituting ( $\hat{α}, \hat{β}$ ) for $(α, β)$ in the expression of ${\hat{Λ}}_{0} (s *, t | α, β)$ given in (11). Thus,

{\hat{Λ}}_{0} (s *, t) = \int_{0}^{t} {\frac{J (s *, w | \hat{α}, \hat{β})}{S_{0} (s *, w | \hat{α}, \hat{β})}} {\sum_{i = 1}^{n} N_{i} (s *, d w)} .

(17)

Finally, for an estimator of the baseline survivor function associated with $Λ_{0} (\cdot)$ defined via ${\bar{F}}_{0} (t) = exp {- Λ_{0} (t)},$ by the product-integral representation and the substitution principle, we obtain

\hat{\bar{F}} (s^{*}, t) = \prod_{w = 0}^{t} [1 - {\hat{Λ}}_{0} (s *, d w)] = \prod_{w = 0}^{t} [1 - \frac{\sum_{i = 1}^{n} N_{i} (s *, d w)}{S_{0} (s *, w | \hat{α}, \hat{β})}] .

(18)

This estimator is of a product-limit type analogous to those arising in the estimation of the baseline survivor function in the Cox proportional hazards model or the multiplicative intensity model (see Cox, 1972a; Andersen and Gill, 1982).

For the i.i.d. interoccurrence times model in Example $2.1$ , which obtains when $ψ (w) = 1$ (no covariate effects), $ρ (w) = 1$ (no effects of accumulating event occurrences), and $ℰ_{i} (s) = s - S_{N_{i}^{†} (s -)}$ (upon each event occurrence, effective age is reset to zero, so this is just the backward recurrence time), the estimator of ${\bar{F}}_{0} (t)$ in (18) simplifies to that considered in Peña et al. (2001). Note, in particular, that for this special model, $ℰ'_{i} (s) = 1$ , and since $ℰ_{i j - 1} (S_{i j}) = S_{i j} - S_{i j - 1 =} T_{i j}$ , then the process $Y_{i} (s, w)$ simplifies to

Y_{i} (s, w) = \sum_{j = 1}^{N_{i}^{†} (s -)} I {T_{i j} \geq w} + I {min (s, τ_{i}) - S_{i N_{i}^{†} (s -)} \geq w},

which is the natural at-risk process for the gap times over the observation period $[0, s]$ .

4 Estimation of Parameters: Model with Frailties

We now consider the estimation of the parameters when the class of models includes frailties. It will be assumed that the frailties $Z_{1}, Z_{2}, ..., Z_{n}$ are i.i.d. from a distribution $H (\cdot | ξ)$ where $ξ \in Ξ \subseteq ℜ^{r}$ . A common choice for this $H$ , which we adopt here, is the gamma distribution with unit mean and variance 1/ $ξ, H = Gamma (ξ, ξ)$ . Imposing the restriction that the gamma shape and scale parameters are identical, together with the identifiability conditions for the model without frailty stated in the beginning of Section 3, is needed to have model identifiability. We do not provide a rigorous proof of this identifiability result since it will lead us to excursions into product spaces and measures and ideas behind identifiability proofs for mixture models, but see, for example, Parner (1998) for such ideas. Recall at this stage that the $Z_{i}$ are not observed. For the model at hand, the conditional intensity function is as given in (5), which for convenience is again displayed below:

λ_{i} (s | Z_{i}, X_{i}) = Z_{i} λ_{0} [ℰ_{i} (s)] ρ [N_{i}^{†} (s -); α] ψ (β^{t} X_{i} (s)) .

To achieve brevity, we let $θ \equiv (Λ_{0} (\cdot), α, β, ξ)$ . If the $Z = z = (z_{1}, z_{2, . . .,} z_{n})$ are observed, the complete likelihood process for the model parameters $θ$ is given by

\begin{array}{l} L_{C}^{†} (s * | θ, z, D (s *)) = \prod_{i = 1}^{n} [\frac{ξ^{ξ}}{Γ (ξ)} z_{i}^{ξ - 1} exp {- ξ z_{i}} \times \\ {\prod_{v = 0}^{s *} {[z_{i} Y_{i}^{†} (v) λ_{0} [ℰ_{i} (v)] ρ [N_{i}^{†} (v -); α] ψ [β^{t} X_{i} (v)]]}^{N_{i}^{†} (Δ v)}} \times \\ exp {- \int_{0}^{s *} z_{i} Y_{i}^{†} (v) λ_{0} [ℰ_{i} (v)] ρ [N_{i}^{†} (v -); α] ψ [β^{t} X_{i} (v)] d v}] . \end{array}

(19)

Since the $Z_{i}$ are unobserved, integrating them out in (19) yields the full likelihood process, which is

\begin{array}{l} L_{F} (s * | θ, D (s *)) = \prod_{i = 1}^{n} {[\frac{Γ (ξ + N_{i}^{†} (s *))}{Γ (ξ)}] \times \\ {[\frac{ξ}{ξ + \int_{0}^{s *} Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v)] d v}]}^{ξ + N_{i}^{†} (s *)} \times \\ (\prod_{v = 0}^{s *} {[\frac{Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v)]}{ξ}]}^{N_{i}^{†} (Δ v)})} . \end{array}

(20)

The maximum likelihood estimators of the model parameters are the maximizers of this full likelihood process, with the proviso that the maximizing $Λ_{0} (\cdot)$ jumps only at observed values of $ℰ_{i} (S_{i j})$ . The expectation-maximization (EM) algorithm described in the sequel finds this set of maximizers.

In estimating the model parameters $ξ, Λ_{0} (\cdot), α$ , and $β$ , we generalize and extend the approach implemented in Peña et al. (2001) which dealt with the frailty model without covariates, and without the $ρ (α, α)$ term, and with $ℰ_{i} (s) = s - S_{i N_{i}^{†} (s -)}$ . The computations of the estimates will be facilitated through the EM algorithm introduced by Dempster et al. (1977), and implemented in counting process frailty models by Nielsen et al. (1992). The main ingredients of this algorithm for the general class of recurrent event models are as follows. For the expectation-step, given $θ$ and $D (s *)$ , the conditional expectations of $Z_{i}$ and log $(Z_{i})$ are, respectively,

E {Z_{i} | θ, D (s *)} = \frac{ξ + N_{i}^{†} (s *)}{ξ + \int_{0}^{s *} Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v)] d v};

(21)

E {log Z_{i} | θ, D (s *)} = DG [ξ + N_{i}^{†} (s *)] - log [ξ + N_{i}^{†} (s *)] + log [E {Z_{i} | θ, D (s *)}],

(22)

where $DG (\cdot)$ is the di-gamma function, that is, $DG (α) = \frac{d}{d α} log Γ (α) .$ For the maximization-step, with $l_{C}^{†}$ denoting the logarithm of the complete likelihood function $L_{C}^{†}$ and with $E_{Z | θ (0)}$ denoting expectation with respect to $Z$ when the parameter vector equals $θ^{(0)}$ , define the function

\begin{array}{l} Q (θ | θ^{(0)}, D (s *)) = E_{Z | θ^{(0)}} {l_{C}^{†} (s * | θ, Z, D (s *))} \\ = n ξ log ξ - n log Γ (ξ) + \sum_{i = 1}^{n} (N_{i}^{†} (s *) + ξ - 1) {\hat{l o g Z}}_{i}^{(1)} - \\ \sum_{i = 1}^{n} (ξ + \int_{0}^{s *} Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v)] d v) {\hat{Z}}_{i}^{(1)} + \\ \sum_{i = 1}^{n} \int_{0}^{s *} log (Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v)]) N_{i}^{†} (d v), \end{array}

where

{\hat{l o g Z}}_{i}^{(1)} \equiv E {log Z_{i} | θ^{(0)}, D (s *)} and {\hat{Z}}_{i}^{(1)} \equiv E {Z_{i} | θ^{(0)}, D (s *)} .

In this maximization step, the function $Q (θ | θ^{(0)}, D (s^{*}))$ is maximized with respect to $θ$ . This is achieved by separate maximization of the mappings given by

\begin{array}{l} (Λ_{0} (\cdot), α, β) \mapsto \sum_{i = 1}^{n} {\int_{0}^{s *} log [{\hat{Z}}_{i}^{(1)} Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v)]] N_{i}^{†} (d v) - \\ \int_{0}^{s *} {\hat{Z}}_{i}^{(1)} Y_{i}^{†} (v) ρ [N_{i}^{†} (v -); α] ψ (β^{t} X_{i} (v)) λ_{0} [ℰ_{i} (v) d v}; \end{array}

(23)

ξ \mapsto n ξ log ξ - n log Γ (ξ) + ξ \sum_{i = 1}^{n} {\hat{l o g Z}}_{i}^{(1)} - ξ \sum_{i = 1}^{n} {\hat{Z}}_{i}^{^{(1)}} .

(24)

For the maximization of the mapping in (23), we basically adopt the procedures developed in the case without frailties. Examining the mapping, we note that the only difference with the case without frailties is that $Y_{i}^{†} (v)$ gets replaced by $Z_{i} Y_{i} (v)$ . Consequently, given $Z = (Z_{1, . . . .,} Z_{n}), α, β$ , and the data $D (s *)$ , the ‘estimator’ of $Λ_{0} (\cdot)$ is given by

{\hat{Λ}}_{0} (s *, t | Z, α, β) = \int_{0}^{t} {\frac{J (s *, w | Z, α, β)}{S_{0} (s *, w | Z, α, β)}} {\sum_{i = 1}^{n} N_{i} (s *, d w)},

(25)

where $J (s, w | Z, α, β) = I {S_{0} (s, w | Z; α, β) > 0}$ with $S_{0} (s, w | Z, α, β) = \sum_{i = 1}^{n} Z_{i} Y_{i} (s, w | α, β)$ . Analogously to the estimating equations for $α$ and $β$ in the model without frailties in (15) and (16), given $Z$ and ${\hat{Λ}}_{0} (s *, \cdot | Z, α, β)$ , we may estimate $α$ and $β$ by solving the estimating equations

\sum_{i = 1}^{n} \int_{0}^{s *} [\frac{\frac{\partial}{\partial α} ρ [N_{i}^{†} (v -); α]}{ρ [N_{i}^{†} (v -); α]} - \frac{\frac{\partial}{\partial α} S_{0} (s, ℰ_{i} (v) | Z, α, β)}{S_{0} (s, ℰ_{i} (v) | Z, α, β)}] N_{i}^{†} (d v) = 0;

(26)

\sum_{i = 1}^{n} \int_{0}^{s *} [\frac{\frac{\partial}{\partial β} ψ (β^{t} X_{i} (v))}{ψ (β^{t} X_{i} (v))} - \frac{\frac{\partial}{\partial β} S_{0} (s, ℰ_{i} (v) | Z, α, β)}{S_{0} (s, ℰ_{i} (v) | Z, α, β)}] N_{i}^{†} (d v) = 0,

(27)

which we implemented through a Newton-Raphson procedure. For the maximization of mapping (24), we also implemented the Newton-Raphson procedure, though clearly there are other options for maximizing this mapping.

With these ingredients at hand, the EM recipe for obtaining the estimates of the model parameters in this general model with frailties is described by the following steps:

Step 0 (Initialization)

Specify initial estimates ${\hat{ξ}}^{(0)}, {\hat{α}}^{(0)}$ and ${\hat{β}}^{(0)}$ of $ξ, α$ , and $β$ , respectively. By setting ${\hat{Z}}_{i}^{(0)} = 1, i = 1, 2, ..., n$ , obtain the initial estimate of $Λ_{0} (\cdot)$ via

{\hat{Λ}}_{0}^{(0)} (s *, t | {\hat{Z}}^{(0)}, {\hat{α}}^{(0)}, {\hat{β}}^{(0)}) = \int_{0}^{t} {\frac{J (s *, w | {\hat{Z}}^{(0)}, {\hat{α}}^{(0)}, {\hat{β}}^{(0)})}{S_{0} (s *, w | {\hat{Z}}^{(0)}, {\hat{α}}^{(0)}, {\hat{β}}^{(0)})}} {\sum_{i = 1}^{n} N_{i} (s *, d w)}

which is just the ‘estimator’ in (11) under the model without frailties.

Step 1 (E-step)

Given $({\hat{ξ}}^{(0)}, {\hat{α}}^{(0)}, {\hat{β}}^{(0)}),$ and ${\hat{Λ}}_{0}^{(0)} (s *, \cdot | {\hat{Z}}^{(0)}, {\hat{α}}^{(0)}, {\hat{β}}^{(0)}),$ obtain ${\hat{Z}}_{i}^{(1)}$ and ${\hat{l o g Z}}_{i}^{(1)}$ via formulas (21) and (22). Denote by ${\hat{Z}}^{(1)} = ({\hat{Z}}_{1}^{(1)}, \dots, {\hat{Z}}_{n}^{(1)}) .$ By exploiting the property that the estimator ${\hat{Λ}}_{0} (\cdot)$ is a step function, these quantities could be obtained according to the following expressions: For $i = 1, 2, ..., n$ ,

{\hat{Z}}_{i}^{(1)} = \frac{{\hat{ξ}}^{(0)} + N_{i}^{†} (s *)}{{\hat{ξ}}^{(0)} + {\hat{A}}_{i} (s *; {\hat{Λ}}_{0}^{(0)}, {\hat{α}}^{(0)}, {\hat{β}}^{(0)})};

(28)

{\hat{l o g Z}}_{i}^{(1)} = DG ({\hat{ξ}}^{(0)} + N_{i}^{†} (s *)) - log ({\hat{ξ}}^{(0)} + N_{i}^{†} (s *)) + log {\hat{Z}}_{i}^{(1)},

(29)

where with $t_{(1)} < t_{(2)} < \dots < t_{(D)}$ being the $D$ distinct jump times of ${\hat{Λ}}_{0}^{(0)} (s *, \cdot)$ and ${\hat{λ}}_{0}^{(0)} (s *, t_{(l)}) = {\hat{Λ}}_{0}^{(0)} (s *, t_{(l)}) - {\hat{Λ}}_{0}^{(0)} (s *, t_{(l)} -)$ is the jump of ${\hat{Λ}}_{0}^{(0)} (s *, \cdot)$ at $t = t_{(l)}$ , we have

{\hat{A}}_{i}^{(0)} \equiv \hat{A_{i}} (s *; {\hat{Λ}}_{0}^{(0)}, {\hat{α}}^{(0)}, {\hat{β}}^{(0)}) = \sum_{l = 1}^{D} Y_{i} (s *, t_{(l)} | {\hat{α}}^{(0)}, {\hat{β}}^{(0)}) {\hat{λ}}_{0}^{(0)} (s *, t_{(l)}) .

(30)

Step 2 (M-step #1)

Applying formula (25), obtain ${\hat{Λ}}_{0}^{(1)} (s *, t | {\hat{Z}}^{(1)}, {\hat{α}}^{(0)}, {\hat{β}}^{(0)}) .$

Step 3 (M-step #2)

After substituting ${\hat{Z}}^{(1)}$ for $Z$ in the estimating equations (26) and (27), obtain the solutions of these equations and denote them by ${\hat{α}}^{(1)}$ and ${\hat{β}}^{(1)}$ .

Step 4 (M-step #3)

Obtain ${\hat{ξ}}^{(1)}$ by maximizing the mapping in (24) in $ξ$ . Alternatively, for this step, we may obtain ${\hat{ξ}}^{(1)}$ by maximizing the full likelihood in (20) with respect to $ξ$ given the current values $({\hat{Λ}}_{0}^{(1)} (\cdot), α^{(1)}, β^{(1)}) .$ Through our numerical investigations, and via a mathematical proof (see appendix), using this alternative step also leads to the maximizing values of the full likelihood. In the simulation studies, the code using this alternative implementation was utilized.

Step 5 (Convergence)

Compare the values $(Z^{(1)}, θ^{(1)})$ with the values $(Z^{(0)}, θ^{(0)})$ , according to some distance function, e.g., Euclidean distance. If the distance between the old and the new values satisfies a tolerance criterion, the algorithm terminates and the estimates are the final values in the iteration. If the distance criterion is not satisfied, then replace $(ξ^{(0)}, θ^{(0)})$ by $(ξ^{(1)}, θ^{(1)})$ , and proceed to Step 1 of the algorithm. Because of the possibility of very large, possibly infinite, estimates of $ξ$ , corresponding to the situation of approximate ‘uncorrelatedness,’ when comparing old and new iterates for $ξ$ , we compare instead the associated values for $η = ξ = (1 + ξ)$ since this ratio takes values in $(0, 1]$ .

Having obtained an estimator of the baseline hazard function $Λ_{0} (\cdot)$ given by ${\hat{Λ}}_{0} (s *, \cdot)$ , through the product integral representation, the semiparametric estimator of the baseline survivor function ${\bar{F}}_{0} (\cdot)$ for this model with frailty is ${\hat{\bar{F}}}_{0} (s *, t) = \prod_{{w : w \leq t}} [1 - {\hat{Λ}}_{0} (s * -, d w)]$ . A computational implementation of the procedures and algorithms described in Sections 3 and 4 have been implemented in an R package (Ihaka and Gentleman, 1996) called gcmrec in González et al. (2003).

5 Properties of Estimators

5.1 Simulation Design

We performed computer simulation studies to examine numerically the properties of the parameter estimators developed in Sections 3 and 4. The specific goals of these studies are: (i) to examine the effect of sample size $(n)$ on the distributional properties of the estimators; (ii) to examine the bias, variance, and root-mean-square error (rmse) of the estimators; (iii) to examine the performance of the semiparametric estimator of the baseline survivor function ${\bar{F}}_{0}$ in terms of its bias function, variance function, and root-mean-squared error function at specified time points. The latter function is based on the loss function $L (\hat{\bar{F}} (t), \bar{F} (t)) = {(\hat{\bar{F}} (t) - \bar{F} (t))}^{2}$ ; (iv) to examine the consequences when data that have been generated with frailty components are analyzed using the model without frailties, an under-specified model; and (v) to examine the consequences, such as the loss in efficiency, when data that were generated using the model without frailties are analyzed with methods developed under the model with frailties, an over-specified model. For the first three items, simulation runs were performed for both the frailty-less model and for the model with frailty. We describe the settings for the different simulation parameters.

Sample Size

To examine the impact of sample size, we choose two values of $n : n \in {30, 50}$ . Though we do not report results here, we also performed simulation runs with $n = 10$ , which may not be realistic in biomedical and public health studies since they will usually have many subjects. However, small sample sizes may arise in the reliability and engineering settings, as in the hydraulic data set example. The simulation runs with $n = 10$ did provide us some insights of the limitations of the numerical procedures for obtaining the estimates, such as non-convergence or convergence to a minimizing, instead of a maximizing, value of the likelihood.

Censoring Mechanism

The censoring variables $τ_{i}$ , $i = 1, 2, ..., n$ , are generated according to a uniform distribution over $[0, B]$ where $B$ is chosen in order that under perfect repair (i.e., $ℰ (s) = s - S_{N^{†} (s -)}$ ) and with $α = 1$ , there are, on average, approximately 10 events per unit. Moreover, to place an upper limit to the number of events that could occur for a unit, when the number of events for a unit reaches 50 then we cease observing this unit and set $τ_{i} = S_{i, 50}$ . This has the potential consequence of introducing some bias because this amounts to doing a combination of Type II and random censoring. Nevertheless, because the value of 50 is large enough, we conjecture that the bias introduced is negligible.

ρ Function

The $ρ$ function which handles the impact of accumulating event occurrences is assumed to be of form $ρ (k, α) = α^{k}$ with $α \in {0.9, 1.0, 1.05}$ , which models the situations where an increasing number of event occurrences has a beneficial effect, has no effect, or has an adverse effect, respectively.

Effective Age Function

For the simulation studies we considered an effective age process corresponding to the general imperfect repair model (see Example 2.2) with perfect repair probability of $0.6$ . Recall that the upper bound for the uniform censoring was determined under the perfect repair model and with $α = 1$ to have an average of approximately 10 events per unit. Because this did not take into consideration the exact form of $ℰ (\cdot)$ and $α$ , the effective average number of events per unit in the simulations may either be smaller or larger than 10. This is a consequence of the interplay among the baseline hazard rate function (if it is increasing failure rate (IFR) or decreasing failure rate (DFR)), the minimal repairs performed, and the effect of increasing number of event occurrence quantified by $α$ .

Baseline Survivor Function

For the baseline hazard function $λ_{0} (\cdot)$ we choose the flexible and commonly-used Weibull hazard function, with a unit scale parameter and shape parameter $(°)$ taking values in ${.9, 2}$ , the former leading to a DFR distribution, and the latter giving rise to an IFR distribution. Note that the estimation procedure proposed is semiparametric, hence the scale and shape parameters of this Weibull baseline distribution are not estimated.

Covariates

We consider a two-dimensional covariate vector $(X_{1}, X_{2})$ with $X_{1}$ having a Bernoulli distribution with success probability of $0.5$ , $X_{2}$ having a standard normal distribution, and with $X_{1}$ and $X_{2}$ stochastically independent. The regression coefficient vector $(β_{1}, β_{2})$ is set to be $(1, - 1)$ . The fact that the grouping induced by the first covariate is done using a symmetric Bernoulli mechanism leads sometimes to highly asymmetric allocations for some simulation replicates, which was the cause of some convergence problems in the iterative procedure when $n = 10$ .

Frailty Component

The parameter $ξ$ of the gamma distribution governing the frailty variable was set to ${2, 6, \infty}$ , with $\infty$ corresponding to the absence of frailties. With respect to the parametrization $ξ \mapsto η = ξ / (1 + ξ)$ , these frailty values convert to having $η \in {\frac{2}{3}, \frac{6}{7}, 1}$ .

For each combination of these simulation parameters, $1000$ replications were performed. In the analysis, we set $s * = 10$ . Also, to create the bias, variance, and root-mean-squared-error curves for the estimator of the baseline survivor function, we choose time values corresponding to the ${0, 1, 2, ..., 99}$ th percentiles of the true baseline distribution function.

5.2 Discussions of Simulation Results

In the discussion of the simulation results that follows, we will focus on the effects of changing $n$ , changing $ξ$ or $η$ , changing $α$ , and changing $γ$ , on the distributional properties of the estimators of $α, β$ , and $η$ , as well as the estimator of the baseline survivor function ${\bar{F}}_{0}$ . In addition, we address the consequences of analyzing data that follows the general model with frailties using procedures developed for the general model without frailties, an under-specification; and also consider the impact of over-specification, which is the situation where procedures developed under the model with frailties are utilized to analyze data from a model without frailties. Such analyses will provide information on which type of mis-specification is of a more serious type.

Results of the simulation studies are presented in Tables 1–3. Table 1 summarizes the mean values and standard deviations (i.e., standard errors of the estimates) of the sampling distributions of the estimators of $α, β_{1}, β_{2}$ , and $η$ for $α$ values of $0.9, 1.0$ , and $1.05$ as $n$ varies in the set ${30, 50}$ . We do not show the cases with $n = 10$ to conserve space. Table 2 contains means and standard deviations summaries of the simulation runs pertaining to the under- and over-specified analysis. Table 3 contains plots of the bias and rmse curves for the estimator of ${\bar{F}}_{0}$ under the case where $α = .9$ for $ξ \in {2, 6, \infty}$ with the plots for different values of $n$ superimposed on each plot frame for a Weibull shape parameter of $γ = 2.0$ .

Table 1.

Summary of simulated means and standard deviations of the estimators of $α, β$ , and $η = ξ / (1 + ξ)$ . The true value of $β$ is $(1; - 1)$ , and 1000 replications were run for each parameter combination. The other columns of this table are: $γ$ denotes the Weibull shape parameter; $n$ is the sample size; NC is the number of replicates in which there was no convergence; ${\hat{μ}}_{E v}$ is the observed mean number of events per unit in all the simulation replications.

	$α$	$γ$	$ξ$	$η$	$n$	NC	${\hat{μ}}_{E v}$	$\hat{α}$	${\hat{σ}}_{\hat{α}}$	${\hat{β}}_{1}$	${\hat{σ}}_{{\hat{β}}_{1}}$	${\hat{β}}_{2}$	${\hat{σ}}_{{\hat{β}}_{2}}$	$\hat{η}$
A2	$0.9$	$0.9$	$2$	$0.67$	$30$	$0$	$4.1$	$0.898$	$0.031$	$1.012$	$0.379$	$- 1.008$	$0.240$	$0.734$
A3	$0.9$	$0.9$	$2$	$0.67$	$50$	$0$	$5.2$	$0.899$	$0.021$	$1.017$	$0.287$	$- 1.004$	$0.165$	$0.705$
A5	$0.9$	$0.9$	$6$	$0.86$	$30$	$0$	$4.3$	$0.900$	$0.030$	$0.988$	$0.300$	$- 1.015$	$0.175$	$0.904$
A6	$0.9$	$0.9$	$6$	$0.86$	$50$	$0$	$5.3$	$0.899$	$0.021$	$0.998$	$0.221$	$- 1.000$	$0.136$	$0.884$
A8	$0.9$	$0.9$	$\infty$	$1.00$	$30$	$0$	$4.8$	$0.893$	$0.025$	$1.031$	$0.222$	$- 1.030$	$0.135$
A9	$0.9$	$0.9$	$\infty$	$1.00$	$50$	$0$	$4.4$	$0.895$	$0.018$	$1.024$	$0.158$	$- 1.023$	$0.104$
A11	$0.9$	$2.0$	$2$	$0.67$	$30$	$0$	$7.8$	$0.902$	$0.016$	$1.010$	$0.348$	$- 1.018$	$0.202$	$0.721$
A12	$0.9$	$2.0$	$2$	$0.67$	$50$	$0$	$6.7$	$0.902$	$0.012$	$0.994$	$0.271$	$- 1.012$	$0.144$	$0.710$
A14	$0.9$	$2.0$	$6$	$0.86$	$30$	$0$	$8.9$	$0.900$	$0.016$	$1.009$	$0.236$	$- 1.008$	$0.135$	$0.895$
A15	$0.9$	$2.0$	$6$	$0.86$	$50$	$0$	$7.2$	$0.900$	$0.012$	$0.998$	$0.173$	$- 1.004$	$0.101$	$0.882$
A17	$0.9$	$2.0$	$\infty$	$1.00$	$30$	$0$	$8.4$	$0.898$	$0.015$	$1.017$	$0.155$	$- 1.014$	$0.095$
A18	$0.9$	$2.0$	$\infty$	$1.00$	$50$	$0$	$7.4$	$0.899$	$0.011$	$1.003$	$0.112$	$- 1.007$	$0.072$
B2	$1$	$0.9$	$2$	$0.67$	$30$	$2$	$9.5$	$1.000$	$0.011$	$1.010$	$0.374$	$- 1.000$	$0.227$	$0.735$
B3	$1$	$0.9$	$2$	$0.67$	$50$	$0$	$8.7$	$1.000$	$0.007$	$0.989$	$0.280$	$- 1.002$	$0.165$	$0.704$
B5	$1$	$0.9$	$6$	$0.86$	$30$	$0$	$7.7$	$1.000$	$0.012$	$1.014$	$0.286$	$- 0.993$	$0.164$	$0.901$
B6	$1$	$0.9$	$6$	$0.86$	$50$	$0$	$7.3$	$1.000$	$0.007$	$1.013$	$0.201$	$- 0.999$	$0.118$	$0.880$
B8	$1$	$0.9$	$\infty$	$1.00$	$30$	$0$	$8.1$	$0.998$	$0.008$	$1.029$	$0.185$	$- 1.024$	$0.114$
B9	$1$	$0.9$	$\infty$	$1.00$	$50$	$0$	$9.1$	$0.999$	$0.006$	$1.010$	$0.130$	$- 1.012$	$0.084$
B11	$1$	$2.0$	$2$	$0.67$	$30$	$0$	$9.5$	$1.000$	$0.008$	$1.016$	$0.336$	$- 1.028$	$0.194$	$0.725$
B12	$1$	$2.0$	$2$	$0.67$	$50$	$0$	$13.0$	$1.000$	$0.006$	$1.004$	$0.258$	$- 1.012$	$0.146$	$0.705$
B14	$1$	$2.0$	$6$	$0.86$	$30$	$0$	$13.8$	$1.000$	$0.008$	$1.006$	$0.228$	$- 1.002$	$0.132$	$0.889$
B15	$1$	$2.0$	$6$	$0.86$	$50$	$0$	$10.8$	$1.000$	$0.006$	$1.003$	$0.168$	$- 1.001$	$0.097$	$0.876$
B17	$1$	$2.0$	$\infty$	$1.00$	$30$	$0$	$14.0$	$0.999$	$0.007$	$1.017$	$0.133$	$- 1.010$	$0.083$
B18	$1$	$2.0$	$\infty$	$1.00$	$50$	$0$	$11.2$	$1.000$	$0.005$	$1.010$	$0.099$	$- 1.006$	$0.065$
C2	$1.05$	$0.9$	$2$	$0.67$	$30$	$3$	$11.8$	$1.051$	$0.007$	$0.994$	$0.366$	$- 0.994$	$0.222$	$0.730$
C3	$1.05$	$0.9$	$2$	$0.67$	$50$	$0$	$9.7$	$1.050$	$0.004$	$1.009$	$0.284$	$- 0.993$	$0.153$	$0.703$
C5	$1.05$	$0.9$	$6$	$0.86$	$30$	$1$	$12.9$	$1.051$	$0.007$	$1.002$	$0.271$	$- 0.993$	$0.160$	$0.899$
C6	$1.05$	$0.9$	$6$	$0.86$	$50$	$0$	$13.9$	$1.050$	$0.005$	$1.006$	$0.196$	$- 0.992$	$0.119$	$0.880$
C8	$1.05$	$0.9$	$\infty$	$1.00$	$30$	$0$	$10.9$	$1.049$	$0.007$	$1.020$	$0.154$	$- 1.012$	$0.101$
C9	$1.05$	$0.9$	$\infty$	$1.00$	$50$	$0$	$13.8$	$1.050$	$0.004$	$1.009$	$0.121$	$- 1.006$	$0.072$
C11	$1.05$	$2.0$	$2$	$0.67$	$30$	$0$	$12.3$	$1.050$	$0.006$	$1.026$	$0.336$	$- 1.018$	$0.184$	$0.726$
C12	$1.05$	$2.0$	$2$	$0.67$	$50$	$0$	$13.4$	$1.050$	$0.005$	$1.008$	$0.248$	$- 1.012$	$0.136$	$0.705$
C14	$1.05$	$2.0$	$6$	$0.86$	$30$	$0$	$10.9$	$1.050$	$0.006$	$1.019$	$0.225$	$- 1.000$	$0.124$	$0.890$
C15	$1.05$	$2.0$	$6$	$0.86$	$50$	$0$	$14.3$	$1.050$	$0.004$	$0.997$	$0.166$	$- 1.000$	$0.096$	$0.876$
C17	$1.05$	$2.0$	$\infty$	$1.00$	$30$	$0$	$18.5$	$1.050$	$0.005$	$1.004$	$0.123$	$- 1.010$	$0.076$
C18	$1.05$	$2.0$	$\infty$	$1.00$	$50$	$0$	$13.5$	$1.050$	$0.004$	$1.004$	$0.090$	$- 1.003$	$0.054$

Open in a new tab

Table 3.

Bias and root mean squared error curves for the estimator of the baseline survivor function as the sample size $(n)$ varies [ $n = 10$ is red; $n = 30$ is blue; $n = 50$ is green]. This is for the case where $α$ = $.90$ and a Weibull shape parameter of $γ$ = $2.0$ .

graphic file with name nihms10128f2.jpg

Open in a new tab

Table 2.

Summary of simulated means and standard deviations for the estimators of $α, β_{1}$ , and $β_{2}$ for the situation of under-specification (label UVW) and over-specification (label XYZ). The true regression coefficients are $β = (1; - 1)$ and 1000 replications were run for each parameter combination.

	$α$	$γ$	$ξ$	$n$	NC	${\hat{μ}}_{\hat{α}}$	${\hat{σ}}_{\hat{α}}$	${\hat{μ}}_{{\hat{β}}_{1}}$	${\hat{σ}}_{{\hat{β}}_{1}}$	${\hat{μ}}_{{\hat{β}}_{2}}$	${\hat{σ}}_{{\hat{β}}_{2}}$
U2.	$0.90$	$0.9$	$2$	$30$	$0$	$0.954$	$0.031$	$0.779$	$0.322$	$- 0.770$	$0.210$
U3	$0.90$	$0.9$	$2$	$50$	$0$	$0.959$	$0.023$	$0.747$	$0.239$	$- 0.740$	$0.154$
U5	$0.90$	$0.9$	$6$	$30$	$0$	$0.921$	$0.028$	$0.898$	$0.285$	$- 0.919$	$0.168$
U6	$0.90$	$0.9$	$6$	$50$	$0$	$0.923$	$0.020$	$0.883$	$0.212$	$- 0.888$	$0.131$
U8	$0.90$	$2.0$	$2$	$30$	$0$	$0.952$	$0.022$	$0.719$	$0.297$	$- 0.728$	$0.187$
U9	$0.90$	$2.0$	$2$	$50$	$0$	$0.956$	$0.017$	$0.700$	$0.223$	$- 0.707$	$0.139$
U11	$0.90$	$2.0$	$6$	$30$	$0$	$0.920$	$0.018$	$0.909$	$0.220$	$- 0.901$	$0.138$
U12	$0.90$	$2.0$	$6$	$50$	$0$	$0.922$	$0.013$	$0.879$	$0.167$	$- 0.879$	$0.101$
V2	$1.00$	$0.9$	$2$	$30$	$0$	$1.019$	$0.014$	$0.771$	$0.324$	$- 0.751$	$0.215$
V3	$1.00$	$0.9$	$2$	$50$	$0$	$1.020$	$0.009$	$0.726$	$0.251$	$- 0.715$	$0.157$
V5	$1.00$	$0.9$	$6$	$30$	$0$	$1.008$	$0.011$	$0.913$	$0.271$	$- 0.888$	$0.172$
V6	$1.00$	$0.9$	$6$	$50$	$0$	$1.009$	$0.008$	$0.886$	$0.198$	$- 0.868$	$0.129$
V8	$1.00$	$2.0$	$2$	$30$	$0$	$1.024$	$0.012$	$0.711$	$0.291$	$- 0.723$	$0.191$
V9	$1.00$	$2.0$	$2$	$50$	$0$	$1.024$	$0.009$	$0.685$	$0.221$	$- 0.695$	$0.136$
V11	$1.00$	$2.0$	$6$	$30$	$0$	$1.009$	$0.009$	$0.885$	$0.224$	$- 0.879$	$0.137$
V12	$1.00$	$2.0$	$6$	$50$	$0$	$1.009$	$0.008$	$0.871$	$0.173$	$- 0.867$	$0.104$
W2	$1.05$	$0.9$	$2$	$30$	$0$	$1.059$	$0.010$	$0.725$	$0.342$	$- 0.720$	$0.226$
W3	$1.05$	$0.9$	$2$	$50$	$0$	$1.058$	$0.006$	$0.696$	$0.261$	$- 0.691$	$0.158$
W5	$1.05$	$0.9$	$6$	$30$	$0$	$1.054$	$0.007$	$0.873$	$0.273$	$- 0.869$	$0.179$
W6	$1.05$	$0.9$	$6$	$50$	$0$	$1.053$	$0.005$	$0.851$	$0.198$	$- 0.842$	$0.128$
W8	$1.05$	$2.0$	$2$	$30$	$0$	$1.061$	$0.009$	$0.704$	$0.296$	$- 0.704$	$0.179$
W9	$1.05$	$2.0$	$2$	$50$	$0$	$1.062$	$0.007$	$0.686$	$0.224$	$- 0.684$	$0.138$
W11	$1.05$	$2.0$	$6$	$30$	$0$	$1.054$	$0.007$	$0.877$	$0.227$	$- 0.880$	$0.132$
W12	$1.05$	$2.0$	$6$	$50$	$0$	$1.054$	$0.005$	$0.870$	$0.162$	$- 0.870$	$0.104$
X2	$0.90$	$0.9$	$\infty$	$30$	$0$	$0.893$	$0.026$	$1.030$	$0.224$	$- 1.031$	$0.144$
X3	$0.90$	$0.9$	$\infty$	$50$	$2$	$0.895$	$0.018$	$1.030$	$0.173$	$- 1.022$	$0.105$
X5	$0.90$	$2.0$	$\infty$	$30$	$0$	$0.897$	$0.015$	$1.016$	$0.163$	$- 1.015$	$0.099$
X6	$0.90$	$2.0$	$\infty$	$50$	$2$	$0.898$	$0.011$	$1.014$	$0.115$	$- 1.015$	$0.076$
Y2	$1.00$	$0.9$	$\infty$	$30$	$6$	$0.998$	$0.010$	$1.023$	$0.186$	$- 1.022$	$0.116$
Y3	$1.00$	$0.9$	$\infty$	$50$	$1$	$0.999$	$0.006$	$1.019$	$0.136$	$- 1.019$	$0.086$
Y5	$1.00$	$2.0$	$\infty$	$30$	$2$	$0.999$	$0.007$	$1.013$	$0.138$	$- 1.011$	$0.084$
Y6	$1.00$	$2.0$	$\infty$	$50$	$0$	$0.999$	$0.006$	$1.010$	$0.100$	$- 1.007$	$0.066$
Z2	$1.05$	$0.9$	$\infty$	$30$	$6$	$1.050$	$0.005$	$1.015$	$0.162$	$- 1.011$	$0.099$
Z3	$1.05$	$0.9$	$\infty$	$50$	$6$	$1.050$	$0.004$	$1.017$	$0.112$	$- 1.013$	$0.073$
Z5	$1.05$	$2.0$	$\infty$	$30$	$2$	$1.050$	$0.005$	$1.014$	$0.126$	$- 1.008$	$0.078$
Z6	$1.05$	$2.0$	$\infty$	$50$	$2$	$1.050$	$0.004$	$1.005$	$0.094$	$- 1.005$	$0.055$

Open in a new tab

As is to be expected, for the simulation runs where there was no mis-specification, when the sample size increases, the performance of the estimators of the finite-dimensional parameters, as well as for the baseline survivor function, improved, with the biases decreasing and the standard errors also decreasing. This is also true for the over-specification runs. When the sample size is small, there is considerable over-estimation of $η = ξ = (1 + ξ)$ , though this bias decreases with increasing sample size. When there is under-specification however, all the estimators are extremely biased (see UVW-runs in Table 2), demonstrating the undesirable consequences of committing this under-specification. Regarding the effect of the frailty parameter $ξ$ , for estimating the finite-dimensional parameters, the amount of bias for $n = 30$ and $n = 50$ are negligible. The impact of the $ξ$ is on the standard errors of the estimators, with larger values of $ξ$ translating into less correlation, leading to smaller standard errors for the same sample size. When considering on the other hand the estimator ${\hat{\bar{F}}}_{0}$ of the baseline survivor function, by examining the curves in Table 3, as well as other curves from the simulations that are not shown here, we observe that the bias and rmse curves of this estimator decrease as $n$ increases, and the same could also be said as $ξ$ increases. Generally, the bias function is positive, and as is to be expected there is more bias and rmse in the middle portion of the survivor function.

Some care, however, must be observed when considering the effects of changing $α$ and changing Weibull shape parameter $γ$ in the context of the precision of the estimators because the interplay between these two parameters leads to differing observed number of events. To see this, examine the column ${\hat{μ}}_{E v}$ in Table 1, which represents the mean number of events observed per unit. In this table, we notice that when $α < 1$ and $γ < 1$ , the latter leading to a DFR Weibull baseline distribution, there tends to be a smaller number of observed events; whereas when $α > 1$ and $γ > 1$ , the latter making the Weibull baseline IFR, then there tends to be more events observed. These differences in the observed number of events can be explained by taking into account the minimal repair model considered in the simulation. In the first situation for instance, an $α$ value less than unity makes the unit less likely to have events as calendar time increases since more event occurrences become beneficial to the unit and, in addition, when a minimal repair is performed, then the DFR nature (because $γ < 1$ ) of the baseline distribution diminishes the rate of event occurrences thereby lengthening the inter-event times. Because the upper bound $B$ for the uniformly distributed follow-up time $τ$ was determined under $α = 1$ and with a backward recurrence time effective age corresponding to a perfect repair mechanism, the impact of $α < 1$ and $γ < 1$ is a smaller number of events compared to the target of approximately 10 events used in deriving $B$ . An analogous argument, but in the opposite direction, holds true when dealing with $α > 1$ and $γ > 1$ . The impact of the minimal repair effective age and its interplay with a DFR or IFR baseline distribution can be further seen from Table 1 with $α = 1$ , where we see that when the baseline distribution is DFR (IFR), the observed number of events per unit is less (more) than the target of approximately 10 events per unit used in deriving $B$ . A fascinating situation is when $α < 1$ and $γ > 1$ , or when $α > 1$ and $γ < 1$ , for the effects of $α$ and $γ$ are in opposite directions in the context of event occurrences. Examining the bottom portion of the A-runs in Table 1 and the upper half of the C-runs in Table 1, and with reference to the B-runs in this same table, we observe that for the chosen $α$ and $γ$ values in the simulation, there was a more pronounced effect of the $α$ values compared to the $γ$ values since when $α = .9$ and $γ = 2$ , the observed number of events is slightly below 10, whereas when $α = 1.05$ and $γ = .9$ , the observed number of events is more than 10. The greater effect of $α$ than $γ$ on the mean number of events is not surprising, because $γ$ was partially accommodated in the determination of the upper bound $B$ for the censoring distribution. Apart from the impact on the precision of the estimates arising from the varying number of events due to the combination of values of $α$ and $γ$ discussed above, the associate editor also perceptively pointed out that this interplay among whether $λ_{0} (\cdot)$ is IFR or DFR, whether $ρ (\cdot)$ is increasing or decreasing, and the form of the effective age $ℰ (\cdot)$ , will intrinsically impact the precision of the estimators. For instance, if $α < 1$ and $γ < 1$ with $ℰ (s) = s$ in the simulation model, when both the $α$ and $γ$ parameters are inducing a decrease in the number of events, the precision of their estimators will diminish since there is added uncertainty about their contributions to event occurrences. This decrease in the precision of estimators is a natural consequence of using a richer class of models which has the potential of better delineating the varied factors affecting event occurrences.

In the presence of model mis-specification, we find that under-specification leads to a non-negligible systematic bias that increases with $n$ and also with $α$ . In fact, for this type of mis-specification, we have observed that the mean of the process $W_{n} (s *, t) = \sqrt{n} [{\hat{\bar{F}}}_{0} (s *, t) - {\bar{F}}_{0} (t)]$ in $t$ does not converge to the zero function as $n$ increases, implying that with this mis-specification, the estimator ${\hat{\bar{F}}}_{0}$ may be inconsistent. In contrast, with over-specification, we find that there is no recognizable loss in efficiency compared to the correct analysis, though we observe some very slight increase in the standard errors of the finite-dimensional parameter estimators (see XYZ-runs in Table 2 and compare the standard deviations in the A9 row of Table 1 and the X3 row, B9 row of Table 1 with the Y3 row, and the C9 row of Table 1 and the Z3 row). This indicates that there is much to be gained in the context of robustness by simply fitting the frailty-based model since, if the data did come from the frailty model, then the analysis is correct, while if the data came from the frailty-less model, there is no significant efficiency loss incurred; whereas, if there is under-specification of the model, then the consequences are unacceptable if the data actually came from the model with frailty. This lends strong support that this new class of models provides a general and flexible class for fitting recurrent event data and provides an avenue for a robust method of analysis for real data sets.

In addition, we also examined the impact of mis-specifying the effective age process, which also has bearing in regards to the interplay between the $ρ (\cdot, \cdot)$ and $ℰ (\cdot)$ components of the class of models. We considered the same simulation model with a perfect repair probability of $0.6$ , and examined the impact of two types of effective age process mis-specification: that the interventions following event occurrences are all minimal repair, or that they are all perfect repair. The results (not shown) indicate an interesting interplay between the nature of the baseline survivor function (DFR/IFR) and the behavior of ${\hat{\bar{F}}}_{0}$ and $\hat{α}$ . We observed that under the minimal repair mis-specification, when ${\bar{F}}_{0}$ is DFR, ${\hat{\bar{F}}}_{0}$ exhibits negative bias and $\hat{α}$ is positively biased. Additionally for this mis-specification, when ${\bar{F}}_{0}$ is IFR, ${\hat{\bar{F}}}_{0}$ exhibits positive bias and $\hat{α}$ is positively biased. Alternately, when the mis-specification is perfect repair, an underlying baseline DFR (IFR) is associated with positive (negative) bias in ${\hat{\bar{F}}}_{0}$ and negative (positive) bias in $\hat{α}$ . We explain these findings as follows: When the model mistakenly assumes minimal repair at each event occurrence, it tends to overestimate the effective age of units. Hence, in the case of DFR, the model anticipates longer interevent times than are realized in the data, creating the negative bias, especially for larger interevent times, in the estimates of the baseline survivor function in this situation. In the case of IFR, the minimal repair mis-specification leads to longer interevent times in the data than are anticipated by the model, creating a positive bias in the estimated baseline survivor function. When a perfect repair is incorrectly assumed at each event occurrence, the model tends to underestimate the effective age of units. Hence, using reasoning analogous to that for the minimal repair mis-specification, there is positive (negative) bias in the estimated baseline survivor function in the case of DFR (IFR). Especially interesting is that this behavior induces biases also in the finite-dimensional parameter estimates, with $\hat{α}$ , in particular, evidently compensating such that $\hat{α}$ is positively biased when the baseline distribution is DFR, and negatively biased when this distribution is IFR. These results indicate the importance of monitoring the effective age process.

6 Applications to Real Data

The first application is to the bladder cancer data used in Wei et al. (1989), which can be obtained from the survival package (Lumley and Therneau, 2003) in the R Library. These data provide the times to recurrence of bladder cancer for $n = 85$ subjects. The covariates are $X_{1}$ , the treatment indicator ( $1 =$ placebo, $2 =$ thiotepa); $X_{2}$ , the size (in cm) of the largest initial tumor; and $X_{3}$ , the number of initial tumors. We first fitted the general model using the backward recurrence time $ℰ (s) = s - S_{N^{†} (s -)}$ as effective age. With $s * = 64$ , the maximum observation period, we fitted the general model without frailties, and obtained $\hat{α} = 0.9826$ and $({\hat{β}}_{1}, {\hat{β}}_{2}, {\hat{β}}_{3}) = (- 0.3188, - 0.0154, 0.1353)$ . These are also the estimates obtained when the general model with frailty is fitted since in that case $\hat{ξ} = 5432999 (\hat{η} \approx 1)$ , a very large value indicating that there is no need for the frailty component when the effective age is the backward recurrence time. Thus, using the approximate inverse of the partial likelihood information matrix from fitting the model without frailties, the associated estimated standard errors are $.0736$ for $\hat{α}$ and $(0.2051, 0.0695, 0.0511)$ for $\hat{β}$ . It remains to establish formally that these are indeed valid standard error estimates, an issue to be addressed in a future paper addressing asymptotic properties of model estimators. Since a formal theory for estimating standard errors under the general model with frailties is under development, we utilize jacknife estimates of the standard errors, which are the standard deviations of the $n$ estimates computed after deleting a unit. For the model without frailties for instance, the jacknife estimates of the standard errors $(j s e)$ for the bladder cancer data are $.056$ for $\hat{α}$ and $(.226, .068, .052)$ for $({\hat{β}}_{1}, {\hat{β}}_{2}, {\hat{β}}_{3})$ , which are close to the estimates obtained from the observed partial likelihood information matrix.

For lack of information about the effective age, we also fitted the general model with frailties assuming a ‘minimal repair’ after each event, $ℰ (s) = s$ . In this situation, the estimates are $\hat{α} = .789 (j s e = .13)$ , $({\hat{β}}_{1}, {\hat{β}}_{2}, {\hat{β}}_{3}) = (- .5743, - .0315, - .2220) [j s e = (.36, .10, .10)]$ , and $\hat{ξ} = .974$ , indicating the importance of the frailty component in this case. When the general model without frailties is fitted to this ‘always minimal repair’ data set, the resulting estimates are $\hat{α} = 1.24 (j s e = .08)$ and $({\hat{β}}_{1}, {\hat{β}}_{2}, {\hat{β}}_{3}) = (- .32, - .03, - .14) [j s e = (.25, .07, .06)]$ . The estimates of the survivor functions for the two effective age specifications are presented in Figure 1. The lower curves (red), corresponding to the placebo group, are obtained by setting $X_{1} = 1$ in the expression given by ${{\hat{\bar{F}}}_{0} (t)}^{exp {{\hat{β}}_{1} X_{1} + {\hat{β}}_{2} {\bar{X}}_{2} + {\hat{β}}_{3} {\bar{X}}_{3}}}$ , while the upper curves (blue) are for the thiotepa group obtained by setting $X_{1} = 2$ . The observed means were ${\bar{X}}_{2} = 2.01$ and ${\bar{X}}_{3} = 2.11$ . The solid curves are for the backward recurrence time effective age, while the dashed curves are for $ℰ (s) = s$ . These plots seem to indicate that the thiotepa group has a higher survival rate than the placebo group, although the statistical significance of this difference depends on which effective age process was used. A question that we will address in future work is the assessment of which effective age process leads to a better fit. This issue is related to model validation and goodness-of-fit aspects of the model.

Fig. 1 — This plot contains estimates of the survivor function for the baldder cancer data set when the model with frailties is fitted. The red curve (lower curve in each line type) is for the placebo group $(X_{1} = 1)$ , while the blue curve is for the thiotepa group $(X_{1} = 2)$ , both evaluated at the mean values of $X_{2}$ and $X_{3}$ . The solid curves are for effective age E $(s) = s - S_{N}$ † $(s -)$ (perfect repair), while the dashed curves are when E $(s) = s$ (minimal repair).

It is of interest to compare the estimates of the regression coefficients from the general model with those obtained using the three existing methods of analysis described in Therneau and Hamilton (1997) and Therneau and Grambsch (2000). Table 4 summarizes the estimates from Andersen-Gill’s (AG) method, Wei, Lin and Weissfeld’s (WLW) marginal method, and Prentice, Williams and Peterson’s (PWP) conditional method as reported in Therneau and Grambsch (2000), together with the estimates obtained from the general model with frailty under these two specifications of the effective age process, $ℰ (s) = s - S_{N^{†} (s -)}$ and $ℰ (s) = s$ . From this table we note the crucial role that the effective age process and the $ρ (k, α) = α^{k}$ component play in this analysis and how they provide some reconciliation of the varied estimates from these different methods. When the effective age process corresponds to perfect repair (in which case $\hat{ξ} \approx \infty$ so that estimates arising from the frailty and no-frailty models coincide) or when the effective age corresponds to minimal repair and the model without frailties is fitted, then the $β$ -estimates from the general model are quite close to those obtained from PWP’s conditional method. On the other hand, when the effective age process corresponds to minimal repair and the model with frailties is fitted, the resulting estimates are close to those obtained from the WLW marginal method. The values from the AG method lie between these two cases. In the situation therefore where the model without frailties is fitted for both types of effective age specifications, it appears that the term $λ [ℰ (s)] α^{N^{†} (s -)}$ in the general model induces a robustness property in the context of estimating the $β$ coefficients. Note that when we assume ‘always perfect repair' the $α$ estimate is less than unity; whereas when we assume ‘always minimal repair' the $α$ estimate is greater than unity (see the discussions in the preceding section pertaining to misspecified effective age process and the impact on the estimation of the baseline hazard and the $α$ parameter). Interestingly, when the general model with frailties is fitted to the ‘always minimal repair' data, the $α$ estimate now becomes less than unity, and the estimate of the frailty parameter $ξ$ is quite close to unity, indicating a strong association among the inter-event times for each subject. The ability of the general model to seemingly explain these varied estimates from these different methods indicates its flexibility and the crucial role of the effective age. Thus, there is a need to monitor this information since in its absence, different methods of analysis may produce varied estimates, which could lead to contradictory conclusions.

Table 4.

Summary of estimates for the bladder data set from the Andersen-Gill (AG), Wei, Lin and Weissfeld (WLW), and Prentice, Williams and Peterson (PWP) methods as reported in Therneau and Grambsch (2000), together with the estimates obtained from the general model using two effective ages corresponding to ‘perfect repairs’ and ‘minimal repairs.’

					General Model
					Perfect ^a	Minimal ^b	Minimal
Term	Param	AG	WLW Marginal	PWP Cond*nal	Both ^c	Frailty ^d	No Frailty ^e
log $N (s -)$	$α$	-	-	-	$.98 (.07)$	$.79 (.13)$	$1.24 (.08)$
Frailty	$ξ$	-	-	-	$\infty$	$.97$	-
rx	$β_{1}$	$- .47 (.20)$	$- .58 (.20)$	$- .33 (.21)$	$- .32 (.21)$	$- .57 (.36)$	$- .32 (.25)$
Size	$β_{2}$	$- .04 (.07)$	$- .05 (.07)$	$- .01 (.07)$	$- .02 (.07)$	$- .03 (.10)$	$- .03 (.07)$
Number	$β_{3}$	$.18 (.05)$	$.21 (.05)$	$.12 (.05)$	$.14 (.05)$	$.22 (.10)$	$.14 (.06)$

Open in a new tab

Effective Age is backward recurrence time $(ℰ (s) = s - S_{N † (s -)})$ .

Effective Age is calendar time $(ℰ (s) = s)$ .

Same results are obtained for either the model with or without frailties.

Reported standard errors are jacknifed estimates.

Another biomedical example pertains to the rehospitalization of patients diagnosed with colorectal cancer. The data, which can be obtained from the gcmrec package in the R Library, provide the calendar times (in days) of the successive hospitalizations after the date of surgery. The first readmission time was considered as the time between the date of the surgical procedure and the first rehospitalization after discharge related to colorectal cancer. Each subsequent readmission time was defined as the difference between the current hospitalization date and the previous discharge date. There were a total of 861 rehospitalization events recorded for the 403 patients included in the analysis. This data set was analyzed in Gonzalez et al. (2005) using a gamma frailty model, which corresponds to the general model with $α = 1$ . Their goal being to determine whether there were differences regarding the time of the recurrent hospitalization due to social-demographic or clinical outcomes. We reanalyze this data set using the full general model where we consider the following variables: tumor stage (Dukes classification: A-B, C or D); whether the patient received chemotherapy; and the distance between the hospital and the patient's residence. We have coded these covariates using dummy variables such that the regression coefficients can be interpreted as follows: $β_{1}$ pertains to patients diagnosed with Dukes C stage, and $β_{2}$ for patients with Dukes D stage; $β_{3}$ for patients who did not receive chemotherapy, and $β_{4}$ for patients whose residence is more than 30 kilometers from the hospital. Since in this case we have no information about the effective age, we assumed the backward recurrence time, $ℰ (s) = s - S_{N † (s -)}$ We fitted the general model without frailties, taking $s * = 2060$ , the maximum follow-up time. The resulting estimates of the parameters, together with the information-based (se) and jacknife (jse) estimates of their standard errors, are $\hat{α} = 1.12 (s e = 0.01; j s e = .13)$ , ${\hat{β}}_{1} = 0.31 (s e = 0.12; j s e = .16)$ , ${\hat{β}}_{2} = 0.93 (s e = 0.14; j s e = .19)$ , ${\hat{β}}_{3} = - 0.12 (s e = 0.11; j s e = .13)$ , and ${\hat{β}}_{4} = - 0.01 (s e = 0.15; j s e = .18)$ . Observe that the information-based and jacknife estimates of the standard errors are somewhat discrepant for this data set. We also fitted the general model with frailties. After 35 iterations the EM algorithm converged. The estimate of the frailty parameter $ξ$ was quite small $[\hat{ξ} = 2.39 (j s e = 3.19)]$ so we conclude that the frailty component of the model is important for these data. The fitted frailty-based model provided the estimates: $\hat{α} = 1.08 (j s e = .14)$ , ${\hat{β}}_{1} = 1.31 (j s e = .17)$ , ${\hat{β}}_{2} = 1.05 (j s e = .20)$ , ${\hat{β}}_{3} = - .14 (j s e = .14)$ , and ${\hat{β}}_{4} = 0.03 (j s e = .23)$ , Based on these results, we conclude that among these covariates, only the advanced tumor stages (C or D) are associated with an elevated risk of rehospitalization. Furthermore, since the estimate of $α$ is larger than unity, there is an indication that each hospitalization increases the risk of further hospitalization.

The next data set, given in Blischke and Murthy (2000) and which was analyzed in Kumar and Klefsjo (1992), concerns hydraulic load-haul-dump (LHD) subsystems used in moving ore and rock in underground mines in Sweden. The data set provides the calendar times (in hours), excluding repair or down times, of the successive failures of $n = 6$ such systems during the two-year development phase. Because the censoring times were not provided, we set $τ_{i} = S_{i K_{i}}$ . The first two machines are the oldest, the second two machines are of medium age, and the last two are relatively new machines. The covariate is the categorized age of the machines, coded as $X = (0, 0)$ denoting old age, $X = (1, 0)$ denoting medium age, and $X = (0, 1)$ denoting young age. For our analysis, we assume that the effective age is the backward recurrence time $ℰ (s) = s - S_{N^{†} (s -)}$ The number of failure events for the six machines are $K = (24, 26, 28, 29, 27, 24)$ . When the general model without frailty is fitted, the resulting parameter estimates are $\hat{α} = 1.0265$ and $({\hat{β}}_{1}, {\hat{β}}_{2}) = (- 0.0764, - 0.0537)$ . The corresponding standard errors, obtained from the estimate of the inverse of the partial likelihood information matrix, are ${\hat{σ}}_{\hat{α}} = 0.0106$ and ${\hat{σ}}_{\hat{β}} = (0.2014, 0.2056)$ These estimates were obtained by setting $s *$ to any value larger than ${max}_{1 \leq i \leq 6} τ_{i} = 4743$ hours. For the general model with gamma frailties, we find $\hat{ξ} = 2.53 \times 10^{28}$ or $\hat{η} \approx 1$ , which indicates the absence of unobserved frailties which would have induced additional heterogeneity among the machines. As a consequence, the estimates of $α$ and ( $β_{1}, β_{2}$ ) were identical to those obtained when the model without frailties was fitted. A very large estimate of $ℰ$ is also obtained if we analyze the data under the assumption of ‘always minimal repair,’ that is, $ℰ (s) = s$ . In this situation, $\hat{α} = 1.014$ ( ${\hat{σ}}_{\hat{α}} = .0244$ and $\hat{β} = (- .1468, - .0520) [{\hat{σ}}_{\hat{β}} = (.2097, .2053)]$ , so the main difference with the previous analysis is in the estimates of the parameter $β_{1}$ .

7 Concluding Remarks

In this paper procedures for estimating the parameters of a general and flexible class of models for recurrent events were developed and their properties examined through computer simulation studies. The class of models, which includes as special cases many well-known models in survival analysis and reliability, possesses the appealing properties that it takes into account the effect of interventions which are administered after each event occurrence through the notion of an effective age, the possible weakening (or strengthening) effect of accumulating event occurrences, the possible presence of unobserved frailties that could be inducing correlations among the inter-event times per unit, and the effect of observable covariates. Some data sets in the biomedical and reliability/engineering settings were reanalyzed using this new class of models. It was found in the simulation studies that an under-specification of the model, in the sense of analyzing a data set generated from the model with frailties using procedures developed from the model without frailties, could have unacceptable consequences in that the resulting estimators will have non-negligible systematic biases. On the other hand, it was found that over-specification of the model may provide a robust method of analysis with an acceptable loss in efficiency. The application of the procedures to the bladder cancer data set also provided a reconciliation of seemingly varied estimates obtained from currently available methods of analyzing recurrent event data, and highlights the importance of monitoring the effective age process.

There are still many interesting and important questions that need to be examined with regards to this general model. The first is the ascertainment of asymptotic properties of the estimators, such as their asymptotic normality or the weak convergence to a Gaussian process of a properly normed estimator of the baseline survivor function. This will be the topic of another paper, and the resolution of this asymptotic problem may require empirical process methods utilized in Murphy (1994, 1995) and Parner (1998); see also the recent paper of Kosorok et al. (2004). Some asymptotic results for specific models subsumed by the general class of models could be found in Peña et al. (2001) and Kvam and Peña (2005). Through such asymptotic analysis we will be able to obtain expressions for approximating analytically the standard errors of the estimators which will reflect the effects of an informative right-censoring mechanism as well as the impact of the sum-quota accrual scheme (see Peña et al. (2001) for the special case of a renewal model).

The problem of validating the class of models after it has been fitted to a specific data set is open, and calls for suitable goodness-of-fit and model validation procedures. For example, in the illustration using the LHD data set, the survivor curve estimate for the medium age group is a little higher than for the new age group, and when one examines the data, there is a long gap in the third machine which might have led to this ordering. A question of interest is whether this particular inter-event time is an outlier. We anticipate that model validation and diagnostics procedures to be developed for this class of models will answer this question. Another issue of interest is in the absence of effective age data, might it have been better to fit a minimal repair effective age function, instead of the perfect repair effective age for this LHD data? This question leads to the recognition that an existing limitation of this class of models is that currently available data sets do not possess information regarding the effective age process. Thus, in applying this model to currently available data sets, we are forced to assume simple forms of the effective age process, such as the imperfect repair or perfect repair models discussed here. This problem of not knowing the effective age was first highlighted in Whitaker and Samaniego (1989), where they pointed out that if the repair modes, hence the effective ages, are not known in the minimal repair model, then the model is nonidentifiable. For the purpose of demonstrating their inference methods using Proschan (1963)'s air-conditioning data, which did not include the mode-of-repairs, they therefore augmented the inter-failure times data with assumed mode-of-repair data to illustrate the estimation of the reliability function. As demonstrated by our simulation studies to assess the impact of mis-specifying the effective age process in relation to the bladder cancer data application, a mis-specification on this effective age could lead to systematic biases on the estimators. It is therefore our hope that researchers will make an effort in assessing the effective age during the data gathering stage of studies. Though it may be potentially difficult to achieve in biomedical settings, such information, if acquired, will prove useful and informative in the modeling and analysis. This somehow calls for a paradigm shift in the data gathering of recurrent event data.

Acknowledgments

E. Peña acknowledges the research support provided by NSF Grant DMS 0102870, NIH Grant GM056182, NIH COBRE Grant RR17698, and the USC/MUSC Collaborative Research Program. E. Slate acknowledges the research support provided by NIH Grant CA077789, NIH COBRE Grant RR17696, DAMD Grant 17-02-1-0138 and the USC/MUSC Collaborative Research Program. J. González acknowledges the research support provided by National Center of Genotyping-CEGEN funded by Genoma España. We also thank the two reviewers, the Associate Editor, and the Editor, for carefully reading the manuscript and for providing us with invaluable comments, criticisms, and suggestions which lead to improvements.

8 Appendix: Partial EM Algorithm

Consider the problem of finding the maximizing values $(ξ θ)$ of a full likelihood function $L_{F} (ξ, θ | D)$ where $D$ is the observed data. We assume that when given $θ$ , maximizing $L_{F} (ξ, θ | D)$ in $ξ$ is practically feasible, but a joint maximization in $(ξ, θ)$ is difficult. We suppose that, when given a value $(ξ,^{(0)} θ^{(0)})$ , we could implement the EM-step to get the next iterate $(ξ_{E M}^{(1)} θ_{E M}^{(1)})$ and in such a way that in the M-step of the EM algorithm, $ξ_{E M}^{(1)}$ and $θ_{E M}^{(1)}$ are obtained via maximization of two separate mappings, one depending only on $ξ$ and the other depending only on $θ$ , such as in our case. In the partial EM algorithm we implemented, $ξ_{E M}^{(1)}$ is replaced by $ξ_{P E M,}^{(1)}$ which is $ξ_{P E M}^{(1)} = {arg max}_{ξ} L_{F} (ξ, θ_{E M}^{(1)} | D)$ , so the next iterate is obtained by setting $(ξ,^{(0)} θ^{(0)}) = (ξ,_{P E M}^{(0)} θ_{E M}^{(0)})$ , and proceeding as described above. That this algorithm will also lead to a maximizing value if the iteration converges follows by observing that the inequalities $L_{F} (ξ^{(0)}, θ^{(0)} | D) \leq L_{F} (ξ_{E M}^{(1)}, θ_{E M}^{(1)} | D) \leq L_{F} (ξ_{P E M}^{(1)}, θ_{E M}^{(1)} | D)$ hold. The first inequality is true because the EM-step guarantees an improved value in the likelihood, whereas the second inequality is immediate from the definition of $ξ_{P E M}^{(1)}$ In our specific implementation, in the notation above, $θ$ will be associated with $(Λ_{0} (\cdot), α, β)$ , while $ξ$ above will be associated with the frailty parameter in our model.

References

Aalen O. Nonparametric inference for a family of counting processes. Annals of Statistics. 1978;6:701–726. [Google Scholar]
Aalen O, Husebye E. Statistical analysis of repeated events forming renewal processes. Statistics in Medicine. 1991;10:1227–1240. doi: 10.1002/sim.4780100806. [DOI] [PubMed] [Google Scholar]
Andersen, P., O. Borgan, R. Gill, and N. Keiding (1993). Statistical Models Based on Counting Processes. New York: Springer-Verlag.
Andersen P, Gill R. Cox’s regression model for counting processes: a large sample study. Annals of Statistics. 1982;10:1100–1120. [Google Scholar]
Baxter L, Kijima M, Tortorella M. A point process model for the reliability of a maintained system subject to general repair. Stochastic Models. 1996;12:37–65. [Google Scholar]
Berman M. Inhomogeneous and modulated gamma processes. Biometrika. 1981;68(1):143–152. [Google Scholar]
Blischke, W. and P. Murthy (2000). Reliability: Modeling, Prediction, and Optimization. New York: Wiley-Interscience.
Block H, Borges W, Savits T. Age-dependent minimal repair. J Appl Prob. 1985;22:51–57. [Google Scholar]
Brown M, Proschan F. Imperfect repair. J Appl Prob. 1983;20:851–859. [Google Scholar]
Cook R, Lawless J. Analysis of repeated events. Statistical Methods in Medical Research. 2002;11:141–166. doi: 10.1191/0962280202sm278ra. [DOI] [PubMed] [Google Scholar]
Cox D. Regression models and life tables (with discussion) Journal of the Royal Statistical Society. 1972a;34:187–220. [Google Scholar]
Cox D. Partial likelihood. Biometrika. 1975;62:269–276. [Google Scholar]
Cox, D. R. (1972b). The statistical analysis of dependencies in point processes. In Stochastic point processes: statistical analysis, theory, and applications (Conf., IBM Res. Center, Yorktown Heights, N.Y., 1971), pp. 55–66. Wiley-Interscience, New York.
Dempster A, Laird N, Rubin D. Maximum likelihood estimation from incomplete data via the em algorithm (with discussion) J Roy Statist Soc B. 1977;39:1–38. [Google Scholar]
Dorado C, Hollander M, Sethuraman J. Nonparametric estimation for a general repair model. Ann Statist. 1997;25:1140–1160. [Google Scholar]
Fleming, T. and D. Harrington (1991). Counting Processes and Survival Analysis. New York: Wiley.
Gail M, Santner T, Brown C. An analysis of comparative carcinogenesis experiments based on multiple times to tumor. Biometrics. 1980;36:255–266. [PubMed] [Google Scholar]
Gill, R. (1980). Censoring and Stochastic Integrals. Amsterdam: Mathematisch Centrum.
Gill RD. Testing with replacement and the product-limit estimator. The Annals of Statistics. 1981;9:853–860. [Google Scholar]
González, J., E. Slate, and E. Peña (2003). The gcmrec Package. http://cran.r-project.org: The Comprehensive R Archive Network.
Gonzalez JR, Fernandez E, Moreno V, Ribes J, Peris M, Navarro M, Cambray M, Borras JM. Sex differences in hospital readmission among colorectal cancer patients. J Epidemiol Community Health. 2005;59(6):506–11. doi: 10.1136/jech.2004.028902. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hollander M, Presnell B, Sethuraman J. Nonparametric methods for imperfect repair models. Ann Statist. 1992;20:879–896. [Google Scholar]
Hougaard, P. (2000). Analysis of Multivariate Survival Data. New York: Springer.
Ihaka R, Gentleman R. R: a language for data analysis and graphics. Journal of Computational and Graphical Statistics. 1996;5:299–314. [Google Scholar]
Jacod J. Multivariate point processes: Predictable projection, Radon-Nikodym derivatives, representation of martingales. Z Wahrsch verw Geb. 1975;31:235–253. [Google Scholar]
Jelinski, J. and P. Moranda (1972). Software reliability research, pp. 465–484. New York: Academic Press.
Kessing L, Olsen E, Andersen P. Recurrence of affective disorders: analyses with frailty models. American Journal of Epidemiology. 1999;149:404–411. doi: 10.1093/oxfordjournals.aje.a009827. [DOI] [PubMed] [Google Scholar]
Kijima M. Some results for repairable systems with general repair. J Appl Prob. 1989;26:89–102. [Google Scholar]
Kosorok M, Lee B, Fine J. Robust inference for univariate proportional hazards frailty regression models. Annals of Statistics. 2004, August;32(4):1448–1491. [Google Scholar]
Kumar D, Klefsjo B. Reliability analysis of hydraulic systems of lhd machines using the power law process model. Reliability Engineering and System Safety. 1992;35:217–224. [Google Scholar]
Kvam P, Peña E. Journal of the American Statistical Association. 2005, March. Estimating load-sharing properties in a dynamic reliability system;100(469):262–272. doi: 10.1198/016214504000000863. [DOI] [PMC free article] [PubMed] [Google Scholar]
Last G, Szekli R. Asymptotic and monotonicity properties of some repairable systems. Adv in Appl Probab. 1998;30:1089–1110. [Google Scholar]
Lawless J. Regression methods for poisson process data. J Amer Statist Assoc. 1987;82:808–815. [Google Scholar]
Lin D, Sun W, Ying Z. Nonparametric estimation of the gap time distribution for serial events with censored data. Biometrika. 1999;86:59–70. [Google Scholar]
Lindqvist BH, Elvebakk G, Heggland K. The trend-renewal process for statistical analysis of repairable systems. Technometrics. 2003;45(1):31–44. [Google Scholar]
Lumley, T. and T. Therneau (2003). The survival Package. http://cran.r-project.org: The Comprehensive R Archive Network.
Murphy S. Consistency in a proportional hazards model incorporating a random effect. The Annals of Statistics. 1994;22:712–731. [Google Scholar]
Murphy S. Asymptotic theory for the frailty model. The Annals of Statistics. 1995;23(1):182–198. [Google Scholar]
Nielsen G, Gill R, Andersen P, Sorensen T. A counting process approach to maximum likelihood estimation in frailty models. Scand J Statist. 1992;19:25–43. [Google Scholar]
Parner E. Asymptotic theory for the correlated gamma frailty model. Annals of Statistics. 1998;26:183–214. [Google Scholar]
Peña, E. and M. Hollander (2004). Mathematical Reliability: An Expository Perspective (eds., R. Soyer, T. Mazzuchi and N. Singpurwalla), Chapter 6. Models for Recurrent Events in Reliability and Survival Analysis, pp. 105–123. Kluwer Academic Publishers.
Peña E, Strawderman R, Hollander M. Nonparametric estimation with recurrent event data. J Amer Statist Assoc. 2001, December;96(456):1299–1315. [Google Scholar]
Peña, E. A., R. L. Strawderman, and M. Hollander (2000). A weak convergence result relevant in recurrent and renewal models. In Recent advances in reliability theory (Bordeaux, 2000), Stat. Ind. Technol., pp. 493–514. Boston, MA: BirkhÄauser Boston.
Prentice R, Williams B, Peterson A. On the regression analysis of multivariate failure time data. Biometrika. 1981;68:373–379. [Google Scholar]
Prentice RL, Self SG. Asymptotic distribution theory for Cox-type regression models with general relative risk form. Ann Statist. 1983;11(3):804–813. [Google Scholar]
Presnell B, Hollander M, Sethuraman J. Testing the minimal repair assumption in an imperfect repair model. J Amer Statist Assoc. 1994;89:289–297. [Google Scholar]
Proschan F. Theoretical explanation of observing decreasing failure rate. Techno-metrics. 1963;5:375–383. [Google Scholar]
Self SG, Prentice RL. Commentary on: "Cox's regression model for counting processes: a large sample study" [Ann. Statist. 10 (1982), no. 4, 1100–1120; MR 84c:62054a] by P. K. Andersen and R. D. Gill. Ann Statist. 1982;10(4):1121–1124. [Google Scholar]
Sellke, T. (1988). Weak convergence of the Aalen estimator for a censored renewal process. In Statistical Decision Theory and Related Topics IV (eds., S. Gupta and J. Berger) 2, 183–194.
Stadje W, Zuckerman D. Optimal maintenance strategies for repairable systems with general degree of repair. J Appl Prob. 1991;28:384–396. [Google Scholar]
Therneau, T. and P. Grambsch (2000). Modeling Survival Data: Extending the Cox Model.New York: Springer.
Therneau T, Hamilton S. rhdnase as an example of recurrent event analysis. Statistics in Medicine. 1997;16:2029–2047. doi: 10.1002/(sici)1097-0258(19970930)16:18<2029::aid-sim637>3.0.co;2-h. [DOI] [PubMed] [Google Scholar]
Wang MC, Chang SH. Nonparametric estimation of a recurrent survival function. Journal of the American Statistical Association. 1999;94:146–153. doi: 10.1080/01621459.1999.10473831. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wei L, Lin D, Weissfeld L. Regression analysis of multivariate incomplete failure time data by modeling marginal distributions. J Amer Statist Assoc. 1989;84:1065– 1073. [Google Scholar]
Whitaker L, Samaniego F. Estimating the reliability of systems subject to imperfect repair. J Amer Statist Assoc. 1989;84:301–309. [Google Scholar]

[R1] Aalen O. Nonparametric inference for a family of counting processes. Annals of Statistics. 1978;6:701–726. [Google Scholar]

[R2] Aalen O, Husebye E. Statistical analysis of repeated events forming renewal processes. Statistics in Medicine. 1991;10:1227–1240. doi: 10.1002/sim.4780100806. [DOI] [PubMed] [Google Scholar]

[R3] Andersen, P., O. Borgan, R. Gill, and N. Keiding (1993). Statistical Models Based on Counting Processes. New York: Springer-Verlag.

[R4] Andersen P, Gill R. Cox’s regression model for counting processes: a large sample study. Annals of Statistics. 1982;10:1100–1120. [Google Scholar]

[R5] Baxter L, Kijima M, Tortorella M. A point process model for the reliability of a maintained system subject to general repair. Stochastic Models. 1996;12:37–65. [Google Scholar]

[R6] Berman M. Inhomogeneous and modulated gamma processes. Biometrika. 1981;68(1):143–152. [Google Scholar]

[R7] Blischke, W. and P. Murthy (2000). Reliability: Modeling, Prediction, and Optimization. New York: Wiley-Interscience.

[R8] Block H, Borges W, Savits T. Age-dependent minimal repair. J Appl Prob. 1985;22:51–57. [Google Scholar]

[R9] Brown M, Proschan F. Imperfect repair. J Appl Prob. 1983;20:851–859. [Google Scholar]

[R10] Cook R, Lawless J. Analysis of repeated events. Statistical Methods in Medical Research. 2002;11:141–166. doi: 10.1191/0962280202sm278ra. [DOI] [PubMed] [Google Scholar]

[R11] Cox D. Regression models and life tables (with discussion) Journal of the Royal Statistical Society. 1972a;34:187–220. [Google Scholar]

[R12] Cox D. Partial likelihood. Biometrika. 1975;62:269–276. [Google Scholar]

[R13] Cox, D. R. (1972b). The statistical analysis of dependencies in point processes. In Stochastic point processes: statistical analysis, theory, and applications (Conf., IBM Res. Center, Yorktown Heights, N.Y., 1971), pp. 55–66. Wiley-Interscience, New York.

[R14] Dempster A, Laird N, Rubin D. Maximum likelihood estimation from incomplete data via the em algorithm (with discussion) J Roy Statist Soc B. 1977;39:1–38. [Google Scholar]

[R15] Dorado C, Hollander M, Sethuraman J. Nonparametric estimation for a general repair model. Ann Statist. 1997;25:1140–1160. [Google Scholar]

[R16] Fleming, T. and D. Harrington (1991). Counting Processes and Survival Analysis. New York: Wiley.

[R17] Gail M, Santner T, Brown C. An analysis of comparative carcinogenesis experiments based on multiple times to tumor. Biometrics. 1980;36:255–266. [PubMed] [Google Scholar]

[R18] Gill, R. (1980). Censoring and Stochastic Integrals. Amsterdam: Mathematisch Centrum.

[R19] Gill RD. Testing with replacement and the product-limit estimator. The Annals of Statistics. 1981;9:853–860. [Google Scholar]

[R20] González, J., E. Slate, and E. Peña (2003). The gcmrec Package. http://cran.r-project.org: The Comprehensive R Archive Network.

[R21] Gonzalez JR, Fernandez E, Moreno V, Ribes J, Peris M, Navarro M, Cambray M, Borras JM. Sex differences in hospital readmission among colorectal cancer patients. J Epidemiol Community Health. 2005;59(6):506–11. doi: 10.1136/jech.2004.028902. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] Hollander M, Presnell B, Sethuraman J. Nonparametric methods for imperfect repair models. Ann Statist. 1992;20:879–896. [Google Scholar]

[R23] Hougaard, P. (2000). Analysis of Multivariate Survival Data. New York: Springer.

[R24] Ihaka R, Gentleman R. R: a language for data analysis and graphics. Journal of Computational and Graphical Statistics. 1996;5:299–314. [Google Scholar]

[R25] Jacod J. Multivariate point processes: Predictable projection, Radon-Nikodym derivatives, representation of martingales. Z Wahrsch verw Geb. 1975;31:235–253. [Google Scholar]

[R26] Jelinski, J. and P. Moranda (1972). Software reliability research, pp. 465–484. New York: Academic Press.

[R27] Kessing L, Olsen E, Andersen P. Recurrence of affective disorders: analyses with frailty models. American Journal of Epidemiology. 1999;149:404–411. doi: 10.1093/oxfordjournals.aje.a009827. [DOI] [PubMed] [Google Scholar]

[R28] Kijima M. Some results for repairable systems with general repair. J Appl Prob. 1989;26:89–102. [Google Scholar]

[R29] Kosorok M, Lee B, Fine J. Robust inference for univariate proportional hazards frailty regression models. Annals of Statistics. 2004, August;32(4):1448–1491. [Google Scholar]

[R30] Kumar D, Klefsjo B. Reliability analysis of hydraulic systems of lhd machines using the power law process model. Reliability Engineering and System Safety. 1992;35:217–224. [Google Scholar]

[R31] Kvam P, Peña E. Journal of the American Statistical Association. 2005, March. Estimating load-sharing properties in a dynamic reliability system;100(469):262–272. doi: 10.1198/016214504000000863. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] Last G, Szekli R. Asymptotic and monotonicity properties of some repairable systems. Adv in Appl Probab. 1998;30:1089–1110. [Google Scholar]

[R33] Lawless J. Regression methods for poisson process data. J Amer Statist Assoc. 1987;82:808–815. [Google Scholar]

[R34] Lin D, Sun W, Ying Z. Nonparametric estimation of the gap time distribution for serial events with censored data. Biometrika. 1999;86:59–70. [Google Scholar]

[R35] Lindqvist BH, Elvebakk G, Heggland K. The trend-renewal process for statistical analysis of repairable systems. Technometrics. 2003;45(1):31–44. [Google Scholar]

[R36] Lumley, T. and T. Therneau (2003). The survival Package. http://cran.r-project.org: The Comprehensive R Archive Network.

[R37] Murphy S. Consistency in a proportional hazards model incorporating a random effect. The Annals of Statistics. 1994;22:712–731. [Google Scholar]

[R38] Murphy S. Asymptotic theory for the frailty model. The Annals of Statistics. 1995;23(1):182–198. [Google Scholar]

[R39] Nielsen G, Gill R, Andersen P, Sorensen T. A counting process approach to maximum likelihood estimation in frailty models. Scand J Statist. 1992;19:25–43. [Google Scholar]

[R40] Parner E. Asymptotic theory for the correlated gamma frailty model. Annals of Statistics. 1998;26:183–214. [Google Scholar]

[R41] Peña, E. and M. Hollander (2004). Mathematical Reliability: An Expository Perspective (eds., R. Soyer, T. Mazzuchi and N. Singpurwalla), Chapter 6. Models for Recurrent Events in Reliability and Survival Analysis, pp. 105–123. Kluwer Academic Publishers.

[R42] Peña E, Strawderman R, Hollander M. Nonparametric estimation with recurrent event data. J Amer Statist Assoc. 2001, December;96(456):1299–1315. [Google Scholar]

[R43] Peña, E. A., R. L. Strawderman, and M. Hollander (2000). A weak convergence result relevant in recurrent and renewal models. In Recent advances in reliability theory (Bordeaux, 2000), Stat. Ind. Technol., pp. 493–514. Boston, MA: BirkhÄauser Boston.

[R44] Prentice R, Williams B, Peterson A. On the regression analysis of multivariate failure time data. Biometrika. 1981;68:373–379. [Google Scholar]

[R45] Prentice RL, Self SG. Asymptotic distribution theory for Cox-type regression models with general relative risk form. Ann Statist. 1983;11(3):804–813. [Google Scholar]

[R46] Presnell B, Hollander M, Sethuraman J. Testing the minimal repair assumption in an imperfect repair model. J Amer Statist Assoc. 1994;89:289–297. [Google Scholar]

[R47] Proschan F. Theoretical explanation of observing decreasing failure rate. Techno-metrics. 1963;5:375–383. [Google Scholar]

[R48] Self SG, Prentice RL. Commentary on: "Cox's regression model for counting processes: a large sample study" [Ann. Statist. 10 (1982), no. 4, 1100–1120; MR 84c:62054a] by P. K. Andersen and R. D. Gill. Ann Statist. 1982;10(4):1121–1124. [Google Scholar]

[R49] Sellke, T. (1988). Weak convergence of the Aalen estimator for a censored renewal process. In Statistical Decision Theory and Related Topics IV (eds., S. Gupta and J. Berger) 2, 183–194.

[R50] Stadje W, Zuckerman D. Optimal maintenance strategies for repairable systems with general degree of repair. J Appl Prob. 1991;28:384–396. [Google Scholar]

[R51] Therneau, T. and P. Grambsch (2000). Modeling Survival Data: Extending the Cox Model.New York: Springer.

[R52] Therneau T, Hamilton S. rhdnase as an example of recurrent event analysis. Statistics in Medicine. 1997;16:2029–2047. doi: 10.1002/(sici)1097-0258(19970930)16:18<2029::aid-sim637>3.0.co;2-h. [DOI] [PubMed] [Google Scholar]

[R53] Wang MC, Chang SH. Nonparametric estimation of a recurrent survival function. Journal of the American Statistical Association. 1999;94:146–153. doi: 10.1080/01621459.1999.10473831. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R54] Wei L, Lin D, Weissfeld L. Regression analysis of multivariate incomplete failure time data by modeling marginal distributions. J Amer Statist Assoc. 1989;84:1065– 1073. [Google Scholar]

[R55] Whitaker L, Samaniego F. Estimating the reliability of systems subject to imperfect repair. J Amer Statist Assoc. 1989;84:301–309. [Google Scholar]

PERMALINK

Semiparametric Inference for a General Class of Models for Recurrent Events

Edsel A Peña

Elizabeth H Slate

Juan R González

Abstract

1 Introduction

2 A General Class of Models

Example 2.1

Example 2.2

Example 2.3

Example 2.4

Example 2.5

3 Estimation of Parameters: Model without Frailties

4 Estimation of Parameters: Model with Frailties

Step 0 (Initialization)

Step 1 (E-step)

Step 2 (M-step #1)

Step 3 (M-step #2)

Step 4 (M-step #3)

Step 5 (Convergence)

5 Properties of Estimators

5.1 Simulation Design

Sample Size

Censoring Mechanism

ρ Function

Effective Age Function

Baseline Survivor Function

Covariates

Frailty Component

5.2 Discussions of Simulation Results

Table 1.

Table 3.

Table 2.

6 Applications to Real Data

Fig. 1.

Table 4.

7 Concluding Remarks

Acknowledgments

8 Appendix: Partial EM Algorithm

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases