Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Jun 13.
Published in final edited form as: Biometrics. 2008 Nov 13;65(3):753–761. doi: 10.1111/j.1541-0420.2008.01157.x

Flexible Estimation of Differences in Treatment-Specific Recurrent Event Means in the Presence of a Terminating Event

Qing Pan 1, Douglas E Schaubel 2
PMCID: PMC4465273  NIHMSID: NIHMS694117  PMID: 19053997

Summary

In this report, we consider the setting where the event of interest can occur repeatedly for the same subject (i.e., a recurrent event; e.g., hospitalization) and may be stopped permanently by a terminating event (e.g., death). Among the different ways to model recurrent/terminal event data, the marginal mean (i.e., averaging over the survival distribution) is of primary interest from a public health or health economics perspective. Often, the difference between treatment-specific recurrent event means will not be constant over time, particularly when treatment-specific differences in survival exist. In such cases, it makes more sense to quantify treatment effect based on the cumulative difference in the recurrent event mean, as opposed to the instantaneous difference in the rate. We propose a method that compares treatments by separately estimating the survival probabilities and recurrent event rate given survival, then integrating to get the mean number of events. The proposed method combines an additive model for the conditional recurrent event rate and a proportional hazards model for the terminating event hazard. The treatment effects on survival and on recurrent event rate among survivors are estimated in constructing our measure and explain the mechanism generating the difference under study. The example which motivates this research is the repeated occurrence of hospitalization among kidney transplant recipients, where the effect of Expanded Criteria Donor (ECD) compared to non-ECD kidney transplantation on the mean number of hospitalizations is of interest.

Keywords: Additive rates model, Competing risks, Marginal mean, Proportional hazards model, Rate regression, Semiparametric model

1. Introduction

Recurrent events are frequently of interest in clinical and epidemiologic studies. Examples include repeated infections among HIV patients and multiple hospitalizations in a health services utilization study. A large variety of semiparametric recurrent event models exist in the literature. These methods can generally be classified as intensity models or mean/rate models. Intensity models consider the instantaneous probability of event occurrence conditional on the event history (e.g., Prentice, Williams and Peterson, 1981; Andersen and Gill, 1982), while mean/rate (the rate being the derivative of mean) models consider the marginal mean number of events (e.g., Pepe and Cai, 1993; Lawless and Nadeau, 1995; Lin, Wei, Yang and Ying, 2000). Depending on the assumed form of the covariate effects, recurrent event models can be classified as multiplicative or additive. Proportional means/rates models assume covariate effects on a multiplicative scale, while additive models assume them on an additive scale.

Often in biomedical applications, the recurrent event sequence can be stopped permanently by a terminating event (e.g., death). Various approaches have been proposed for modelling recurrent events in the presence of a terminating event (e.g., Li and Lagakos, 1997; Cook and Lawless, 1997; Ghosh and Lin, 2000; Ghosh and Lin, 2002), and this area has attracted much attention recently (e.g., Liu, Wolfe and Huang, 2004; Huang and Wang, 2004; Ye, Kalbfleisch and Schaubel, 2007). The existing approaches generally fall into one of three categories. First, there are methods for modelling the marginal mean number of events (e.g., Ghosh and Lin, 2000; Ghosh and Lin, 2002). In this case, the mean averages over surviving and deceased subjects. In the second category of methods, one models the conditional recurrent event rate given survival. This is among the approaches suggested by Cook and Lawless (1997), and was mentioned by Lin et al. (2000). This method has been applied quite frequently (e.g., Schaubel and Cai, 2005). A variation of this approach employs a latent (frailty) variable (e.g., Liu et al., 2004; Ye et al., 2007), conditional on which the recurrent and terminal events are assumed independent. The marginal and conditional methods explicitly acknowledge the fact that subjects no longer experience recurrent events after death and that time to death may differ among the groups being compared. The third approach considers death to be a censoring event, with the recurrent events essentially being a latent process unobservable after death (e.g., Ghosh and Lin, 2003).

The example motivating our method is to compare the mean number of hospitalizations for transplant recipients with an Expanded Criteria Donor (ECD) kidney and patients transplanted with a non-ECD kidney. An ECD is defined (Port et al., 2002) as a deceased donor either (i) age ≥ 60, or (ii) age 50 – 59 and with at least two of the following three characteristics: hypertensive, serum creatinine concentration > 1.5mg/dl, or death due to stroke. The ECD (0, 1) classification is a well-accepted quality index for donated kidneys in the nephrology community. In fact, wait-listed patients willing to accept an ECD kidney are essentially listed separately and generally have reduced waiting time until transplant. The relative frequency of ECD transplantation has increased over time, and several authors have examined the impact of an ECD on the post-transplant death hazard. Our analysis targets the impact of ECD transplantation on the mean number of post-transplant hospitalizations. Hospitalization frequency is a composite index of a patient’s health status and resource utilization, and therefore serves as a concrete and objective measure of post-transplant performance. Among transplant recipients, hospitalizations can occur repeatedly for the same patient and are often terminated by death.

Ghosh and Lin (2000) proposed nonparametric tests for treatment-specific differences with respect to the marginal mean number of recurrent events. The Kaplan-Meier estimator for survival probabilities and a Nelson-Aalen estimator of the conditional recurrent event rate function given survival are integrated to calculate the recurrent event means. Ghosh and Lin (2002) later proposed a semiparametric proportional means model. The recurrent event rate after death was still taken to be zero, and the marginal rates in the outcome are implicitly averaged over the recurrent event rates of living and deceased subjects.

Often investigators are more interested in the absolute difference between recurrent event means (as opposed to their ratio) which suggests using an additive (as opposed to multiplicative) model. Schaubel, Zeng and Cai (2006) extended the method of Lin and Ying (1994) to develop an additive rates model, but it was not designed to handle terminating events. One could develop an extension of Schaubel et al. (2006) to accommodate terminating events. However, treatment-specific differences in mean number of events are often not constant over time, particularly when the treatment-specific survival functions differ. That is, as follow-up time, t, increases, due to subjects dying, the composition of the study population will shift and the pattern of the shift will be treatment-specific if treatment affects survival. Therefore, treatment effects on the recurrent event mean would not be expected to be constant over time in the presence of treatment effects on survival. In cases where the treatment effect depends on follow-up time, the cumulative effect is of much greater interest for patient decision-making (at time t = 0) than the instantaneous effect.

In this article, we propose a novel semiparametric method for comparing treatment-specific recurrent event mean functions in the presence of a terminating event. The proposed method estimates the treatment effect as a process over time, without forcing the effect to be a constant difference or ratio. The method combines a proportional hazards model for the terminating event and an additive model for the conditional recurrent event rate given survival, with the treatment effects measured by differences in treatment-specific recurrent event means.

In comparing treatments, investigators are often interested in differences between treatment-specific means, which motivates our measure. However, under this method, an increase in the event mean could be due to patients surviving longer, or could be due to patients experiencing events at an increased rate while they survive, and the proposed estimator alone cannot distinguish between these two phenomena. Researchers wanting to further examine the nature of the treatment effect on the event mean would examine separately the treatment effects on the terminating event hazard and the conditional event rates given survival. This additional step is always necessary even when the proposed measure turns out to be non-significant since a lower survival probability can serve to offset higher event rate among survivors, and hence cancel out the treatment effect on the mean.

The remainder of this article is organized as follows. Section 2 describes the proposed treatment effect measure and methods of estimation. Asymptotic properties of the estimator are listed in Section 3, with proofs provided in Web Appendix A and B. Section 4 evaluates the performance of the asymptotic results in moderate size samples. We compare mean numbers of hospitalizations among kidney transplant recipients using the proposed methods in Section 5. A discussion and some concluding remarks are contained in Section 6.

2. Proposed Methods

In this section, we describe the proposed methods, after establishing notation and explaining the data structure of interest.

2.1 Notation and Set-up

We begin by setting up the necessary notation. We use Di for the time of the terminating event (e.g., death), and Ci for the censoring time. The observed counting process for the terminating event is represented by NiD(t)=I(Dit,Di<Ci). We let Yi(t) = I(CiDit) be the at risk indicator for subject i at time t, and set π^(t)=n1i=1nYi(t). The number of recurrent events as of time t is represented by NiR*(t)=0tdNiR*(s), where dNiR*(s)=NiR*(s)NiR*(s). In the data structure under consideration, NiR*(t)=NiR*(tDi); that is, recurrent events can not occur after death. What we observe is a quantity subject to right censoring NiR(t)=NiR*(tDiCi). For the ith subject (i = 1, ⋯, n), we let Zi and Xi denote the vector of covariates for the terminating event model and recurrent event model respectively. We set Zi=(Zi1,Zi2),Xi=(Xi1,Xi2) where Zi1, Xi1 are (1/0) indicators for the treatment/placebo and Zi2 and Xi2 represent adjustment covariates. Correspondingly, we let β0=(β01,β02) and θ0=(θ01,θ02) represent the regression coefficients for Zi and Xi. In addition, for convenience we denote the covariate vectors for a treated subject by Zi1=(1,Zi2) and Xi1=(1,Xi2). Similarly, for a placebo subject, Zi0=(0,Zi2) and Xi0=(0,Xi2). The objective is to compare the treatment and placebo with respect to the marginal mean number of recurrent events. That is, the outcome being modeled is the mean number of recurrent events E{NiR*(t)|Zi}, which is the integral of recurrent event rate function over time E{NiR*(t)|Zi}=0tE{dNiR*(s)|Zi}. It should be mentioned that we are not modeling the recurrent event intensity function, E{dNiR*(t)|Zi,𝒩iR*(t)}, where 𝒩iR*(t)={NiR*(s);s(0,t]}, representing the event history up to time t, an approach considered for example by Andersen and Gill (1982).

As described in Section 1, in the presence of a terminating event, the recurrent event mean is a function of survival probability and the conditional recurrent event rate given survival. In estimating the mean function, it is natural to model these two entities separately. We denote the conditional recurrent event rate given survival by

dRi(t)=E{dNiR*(t)|Di>t},

with the recurrent event mean function then given by

μi(t)=0tSi(u)dRi(u),

where Si(t) = Pr(Di > t). We assume that the terminating event hazard function follows a proportional hazards model,

dΛi(t)=dΛ(t|Zi)=dΛ0(t)eβ0Zi, (1)

where dΛ0(t) is the unspecified baseline hazard function and the vector β0 represents the true value of the regression coefficients. The parameter β0 is estimated by β̂, the solution to the partial likelihood score function, UD(β) = 0, where

UD(β)=i=1n0τ{ZiZ¯(t;β)}dNiD(t)
Z¯(β;t)=S(1)(t;β)S(0)(t;β)
S(k)(t:β)=n1i=1nYi(t)eβZiZik,

with Zi0=1,Zi1=Zi and Zi2=ZiZi. The Breslow-Aalen baseline hazard estimator, Λ̂0(t; β̂), is employed, where

Λ^0(t;β)=n1i=1n0tS(0)(s;β)1dNiD(s).

The subject-specific survival function is then estimated by Ŝi(t) = exp{−Λ̂0(t)eβ̂′Zi}.

Both multiplicative and additive models have been used in modeling the recurrent event mean. For certain outcomes such as costs or number of hospitalizations, investigators are typically more interested in the absolute (as opposed to relative) dollar amount or hospital admission numbers. As such, we use an additive model to estimate the recurrent event rates among surviving subjects,

dRi(t)=dR(t|Xi)=dR0(t)+θ0Xidt, (2)

where dR0(t) is the baseline recurrent event rate function and the vector θ0 represents the true additive effects of the corresponding covariate vector Xi. Usually, there are common covariates affecting both the terminating event hazard and recurrent event rates. As such, the covariate vectors Zi and Xi usually overlap and will often be identical. Adapting the model of Schaubel et al. (2006) to the recurrent/terminal event setting, we estimate θ0 as follows:

θ^=B^1UR
B^=i=1n0τYi(s){XiX¯(s)}2ds
X¯(s)=π^(s)1n1i=1nYi(s)Xi
UR=i=1n0τ{XiX¯(t)}dNiR(t).

Models for the conditional event rate, dRi(t), and the terminating event hazard, dΛi(t), assume covariate effects which are constant over time, t. Two points are important in this regard. First, unlike μi(t), covariate effects on both dRi(t) and dΛi(t) will often be constant over time in practice. In a sense, the recurrent event rate conditional on survival, dRi(t), is an easier measure to model directly, compared to the marginal recurrent event mean, since the survival population is more homogeneous than the combined population of both living and deceased subjects. The model dΛi(t) is a standard Cox model frequently employed in biomedical studies, while the quantity dRi(t) is analogous to the cause-specific hazard in competing risk studies. Second, models (1) and (2) could always be extended to allow for time-varying effects, and the procedures proposed in this paper would still be applicable.

2.2 Proposed Measure and its Estimator

We propose a treatment effect measure which is the difference in treatment-specific marginal recurrent event means. The means are given by

μ1(t)=E[NiR(t)|Zi1=1]
μ0(t)=E[NiR(t)|Zi1=0].

We model dRi(t) and dΛi(t) separately, compute fitted means, then average over the observed adjustment covariates. From this perspective, it is useful to write the treatment-specific marginal means as

μj(t)=E[E[NiR(t)|Zij,Xij]]=E[0tS(r|Zij)dR(r|Xij)],

for j = 0, 1. For the iterated expectations above, the inner one is the expected value of NiR(t) conditional on a specific set of Zij,Xij values and the outer marginal expectation is taken with respect to the distribution of adjustment covariates (Zi2,Xi2). By substituting in survival and conditional rate function estimators, we obtain the proposed treatment-specific mean estimators:

μ^1(t)=n1i=1n0tŜ(r|Zi1){dR^0(r)+(θ^1+Xi2θ^2)dr}
μ^0(t)=n1i=1n0tŜ(r|Zi0){dR^0(r)+Xi2θ^2dr},

where the baseline rate estimator is given by R^0(t)=n1i=1n0tπ^(s)1Yi(s){dNiR(s)θ^Xids}. The treatment effect on the recurrent event mean can be measured by

ψ(t)=μ1(t)μ0(t),

which can be estimated as

ψ^(t)=μ^1(t)μ^0(t)

for t ∈ (0, τ] using Ŝ(t|Zi) from the proportional hazards model and dR̂0(t), θ̂1 and θ̂2 from the additive conditional rates model.

2.3 Analyzing Treatment Effects

Based on the proposed measure alone, it is not clear if an estimated treatment effect is the result of treatment-specific differences in survival and/or from treatment-specific differences in the conditional event rate given survival. For example, the marginal recurrent event mean could be greater for treated patients than placebo patients because either (i) conditional on survival the event rates are equal, but treated subjects live longer; or (ii) treated and placebo subjects have approximately equal survival, but dR(t|Xi1)>dR(t|Xi0). Therefore, the hazard ratio for the terminating event (eβ1) and the rate difference for recurrent events among survivors (θ1) provide important supplemental information. Collectively, the three estimators ψ̂(t), eβ̂1 and θ̂1 describe treatment effects from each of these angles.

3. Asymptotic Properties

In this section, we describe the essential asymptotic properties of the proposed estimators. We begin by listing the assumed regularity conditions, for i = 1, …, n.

  1. {NiR*(·),Di,Ci,Zi,Xi} are independent and identically distributed.

  2. E[dNiR*(t)|Di>t,Ci>t,Xi]=E[dNiR*(t)|Di>t,Xi].

  3. limδ01δPr{tDi<t+δ|Di>t,Ci>t,Zi}=limδ01δPr{tDi<t+δ|Di>t,Zi}.

  4. Pr(Yi(τ) = 1) > 0.

  5. 0τdΛ0(t)<,0τdR0(t)< and NiR(τ)<.

  6. Elements of Zi2 and Xi2 are bounded almost surely.

  7. Positive-definiteness of the matrices, A(β) and B, where
    A(β)=E[0τ{Ziz¯(t;β)}2Yi(t)eβZidΛ0(t)]
    Z¯(t;β)a.s.z¯(t;β)
    B=E[0τYi(s){Xix¯(s)}2ds]
    X¯(t)a.s.x¯(t).

Condition (a) is the basis for the Central Limit Theorem and is usually satisfied; an exception would be clustered data. Conditions (b) and (c) correspond to independent censoring assumptions for the recurrent and terminating event processes, respectively. Condition (d) is a standard identifiability condition.

It should be noted that our methods also assume that there are no unmeasured factors which predict both dRi(t) and dΛi(t). The potential impact resulting from violations of the no-unmeasured-predictors assumption is discussed and evaluated empirically in Section 5.

We now describe the essential asymptotic results for the proposed estimators, with proofs outlined in Web Appendix A–B.

Theorem 1

Under conditions (a) to (g), ψ̂ is a uniformly consistent estimator of ψ; that is, ψ^(t)a.s.ψ(t) for t ∈ (0, τ].

The proof of Theorem 1 proceeds through a Taylor Series expansion, followed by repeated application of the Uniform Strong Law of Large Numbers (USLLN).

Theorem 2

Under conditions (a) to (g), n12{ψ^(t)ψ(t)} converges weakly to a zero-mean Gaussian process with covariance function E[{ξi(s; β0, θ0) − ξi(s; β0, θ0)}{ξi(t; β0, θ0) − ξi(t; β0, θ0)}], where

ξij·(t;β,θ)=ξij1(t;β,θ)+ξij2(t;β,θ)+ξij3(t;β,θ)+ξij4(t;β,θ)
ξij1(t;β,θ)=E[eβZk00tS(u|Zk0)0u{Zk0z¯(r;β)}dΛ0(r)dR(u|Xkj)]A(β)1UiD(β)
ξij2(t;β,θ)=E[0tS(u|Zkj){Xkjx¯(u)}du]B1UiR(θ)
ξij3(t;β,θ)=0tE{S(u|Zkj)}π(u)1dMiR(u;θ)
ξij4(t;β,θ)=0tE[eβZkj{μ(t|Zkj,Xkj)μ(r|Zkj,Xkj)}]dMiD(r;β)s(0)(r;β),

where k indexes a subject, with subject-specific asymptotic score contributions given by

UiD(β)=0τ{Ziz¯(t;β)}dMiD(t;β)
dMiD(t;β)=dNiD(t)Yi(t)eβZidΛ0(t)
UiR(θ)=0t{Xix¯(u)}dMiR(u;θ)
MiR(t;θ)=NiR(t)0tYi(u){dR0(u)+θXidu} (3)
s(d)(t;β)=limnS(d)(t;β),

for d = 0, 1, 2. Since n12(ψ^ψ)=n12(μ^1μ1)n12(μ^0μ0), we work on n12(μ^1μ1) and n12(μ^0μ0) separately as follows:

n12{μ^j(t;β^,θ^)μj(t)}=n12{μ^j(t;β^,θ^)μ^j(t;β0,θ^)}+n12{μ^j(t;β0,θ^)μ^j(t;β0,θ0)}+n12i=1n0tŜ(r|Zij){dR^0(r;θ0)dR0(r)}+n12i=1n0t{Ŝ(r;β0|Zij)S(r|Zij)}{dR0(r)+θ0Xijdr}.

for j = 0, 1. Taylor series expansions are applied, along with several applications of the Strong Law of Large Numbers (Sen and Singer 1993). Results from each of the four parts are combined to show that n12{μ^j(t)μj(t)} is asymptotically equivalent to n12i=1nξij·(t;β0,θ0). After taking the difference of n12(μ^1μ1) and n12(μ^0μ0),n12i=1n{ξi1·(t;β0,θ0)ξi0·(t;β0,θ0)} can be shown to converge to a zero-mean Normal distribution for fixed t by the Central Limit Theorem. A demonstration of tightness completes the proof of weak convergence using results from empirical process theory (Pollard, 1990; van der Vaart and Wellner, 1996). The ξij·(s; β0, θ0) quantities can be consistently estimated by replacing all limiting values by their empirical counterparts, then averaging across i = 1, ⋯, n.

4. Simulation Study

For each data configuration, we generated n = 200 independent and identically distributed subjects with both terminating and recurrent events. The terminating event hazard follows the following proportional hazards model,

dΛ(t|Zi)=dΛ0(t)eβ1Zi1+β2Zi2,

where Zi1 (treatment) is distributed as Bernoulli (0.5), the adjustment covariate Zi2 follows a Uniform (0, 10) distribution and λ0 = 0.04. We set the coefficient β1 at 0.5 and 1, to examine scenarios of low or high treatment effect on survival. Censoring is uniformly distributed on (0, 20) which leads to an average censoring percentage of approximately 42%. The recurrent events follow a Poisson process, with rate function

dR(t|Xi,Qi)=dR0(t)+Qidt+θXidt, (4)

where Qi follows a Gamma distribution with mean 0.25 and variance V (Qi), and V (Qi) = 0.5 or 1. The Qi variate represents a frailty term shared by all the recurrent event times for the same subject and may be thought of as an unmeasured predictor. Note that the frailty term Qi does not affect dΛ(t | Zi) and the assumptions of our proposed methods hold under this set-up. The above model was simulated by generating gap times between successive events as:

Ti,j+1=Ti,jlog(Uij){dR0(t)+Qi+θXi}1,

for j = 0, ⋯, 50, where Uij ~ Unif(0, 1), Xi = Zi1 and Ti,0 ≡ 0. We varied the baseline recurrent event rate from 0.125 to 0.25. The covariate Xi is the same as Zi1 in the proportional hazards model for the terminating event, both representing the treatment or exposure of interest. The regression coefficient for Xi is set at θ = 0.5.

Table 1 lists the performance of our proposed estimator ψ̂(t) in eight scenarios of different β1, V (Qi) and dR0(t) combinations. The average observed number of recurrent events per subject ranges from 3.0 to 4.4. Three evenly spaced time points 5, 10 and 15 are picked to examine the performance of the estimators at early, middle and late follow-up times. For all configurations examined, ψ̂(t) is very close to their true values (obtained by numerical integration), and average asymptotic standard errors (ASE) agree well with empirical standard deviations (ESD). Correspondingly, empirical coverage probabilities (CP) are close to the nominal value of 0.95. Overall, the asymptotic properties appear to be applicable to moderate size samples based on the simulation results.

Table 1.

Simulation Results: Performance of Proposed Estimator

β1 V(Qi) dR0(t) t ψ(t) bias ESD ASE CP
1 0.5 0.25 5 1.16 0.02 0.34 0.34 0.95
10 1.01 0.05 0.57 0.56 0.95
15 0.53 0.06 0.74 0.74 0.95
0.125 5 1.28 0.01 0.34 0.33 0.95
10 1.34 0.02 0.54 0.54 0.95
15 1.05 0.04 0.70 0.71 0.95
1 0.25 5 1.16 −0.01 0.45 0.43 0.95
10 1.01 0.00 0.74 0.71 0.94
15 0.53 0.03 0.96 0.91 0.94
0.125 5 1.28 0.00 0.44 0.43 0.96
10 1.34 0.02 0.71 0.69 0.95
15 1.05 0.05 0.89 0.89 0.95
0.5 0.5 0.25 5 1.71 −0.02 0.36 0.35 0.96
10 2.42 −0.03 0.62 0.61 0.96
15 2.65 −0.03 0.83 0.84 0.97
0.125 5 1.76 −0.02 0.33 0.34 0.96
10 2.58 −0.04 0.58 0.59 0.96
15 2.91 −0.03 0.77 0.80 0.97
1 0.25 5 1.71 0.02 0.47 0.45 0.95
10 2.42 0.03 0.80 0.77 0.95
15 2.65 0.05 1.04 1.02 0.95
0.125 5 1.76 −0.03 0.47 0.44 0.95
10 2.58 −0.05 0.80 0.75 0.95
15 2.91 −0.04 1.05 0.99 0.95

ESD = empirical standard deviation

ASE = average asymptotic standard error

CP = coverage probability.

Next, we evaluated the sensitivity of our proposed methods to the no-unmeasured-predictors assumption. Specifically, we set up a model with the gamma frailty, Qi, affecting both dΛi(t) and dRi(t), in violation of our underlying assumptions. The simulated proportional hazards model (1) changes to

dΛ(t|Zi)=QieβZidΛ0(t), (5)

such that Qi represents an unmeasured covariate affecting, now, both the death hazard and conditional event rate given survival. In this set-up, parameters in the conditional recurrent event rate model are no longer estimated consistently under the proposed approach. The estimators θ̂ and 0(t) are roots of the estimating functions i=1n0τXidMiR(t) and i=1n0τdMiR(s), respectively, where MiR(t) is as defined in (3). These estimating functions are unbiased and lead to consistent estimation when E[dMiR(t)|Xi,Di>t]=0. However, under models (4) and (5), E[dMiR(t)|Xi,Di>t]=P(Ci>t|Xi)E[Qi|Xi,Di>t]dt, which need not equal 0, hence leading to inconsistent estimation.

The impact of such violations is evaluated through simulation. In addition to the scenarios examined in the simulation study of the correct model, an extreme case where V (Qi) = 2, which is 8 times E(Qi), was also examined to assess the impact of highly variable frailty terms on the performance of ψ̂(t). Table 2 demonstrates the robustness of ψ̂(t) under this misspecified model. The estimator is biased, although the bias is not large, particularly in relative terms. The proposed standard error estimators remain close to the empirical standard deviations and hence coverage probabilities are still close to 0.95. Generally, the proposed methods appear fairly robust to the no-unmeasured-predictors assumption.

Table 2.

Simulation Results: Robustness of Proposed Estimator under a Misspecified Model

V(Qi) β1 dR0(t) ψ(t) bias ESD ASE CP
2 1 0.25 4.18 0.01 0.39 0.38 0.95
0.125 4.22 0.01 0.33 0.33 0.94
0.5 0.25 4.30 0.00 0.37 0.38 0.94
0.125 4.31 0.01 0.33 0.32 0.94
1 1 0.25 3.65 0.04 0.43 0.41 0.94
0.125 3.71 0.03 0.35 0.34 0.95
0.5 0.25 3.86 0.03 0.41 0.40 0.95
0.125 3.88 0.05 0.34 0.34 0.96
0.5 1 0.25 2.91 0.01 0.44 0.43 0.94
0.125 3.01 0.04 0.37 0.36 0.94
0.5 0.25 3.25 0.01 0.43 0.42 0.94
0.125 3.30 −0.01 0.35 0.35 0.96

Estimators are evaluated at t = 10, the mean of the censoring distribution.

5. Application

We applied our proposed method to the study of renal transplant patients. Patients began follow-up (t = 0) at the time of kidney transplantation. Some patients received an ECD transplant, while others got a non-ECD kidney. Patients who receive a kidney transplant are suspected of being subject to an increased mortality hazard and hospitalization rate in the weeks immediately following the transplant, due to the risk of surgical complications, which are suspected to be more serious in ECD recipients. From a public health perspective, it is of interest to compare ECD and non-ECD transplanted patients with respect to mean number of hospitalizations. Due to the strong time-dependence in the effect of ECD kidney transplantation, the instantaneous effect on the hospitalization rate is generally of less interest than the cumulative effect. Moreover, survival probabilities are known to be reduced for ECD relative to non-ECD recipients (Port et al., 2002). Given these facts, the ECD effect on the mean number of hospitalizations would not be expected to be constant over time.

We combine demographic, clinical and mortality data from the Scientific Registry of Transplant Recipients (SRTR) and hospitalization history information from Centers for Medicare and Medicaid Services (CMS). The study population is restricted to patients whose primary payer was Medicare. To increase homogeneity, we also exclude from our target population repeat and multi-organ transplant recipients. All Medicare patients age ≥ 18 who received a kidney transplant from a deceased donor in year 2000 were included in our study sample. In total, 3,816 recipients with follow-up and complete covariate information were included and tracked from the time of transplant until death, loss to follow-up, or at the end of the observation period (December 31, 2005). Among the 3,816 patients, 970 (25.42%) were observed to die during the 6-year follow-up period, with the remaining 2,846 (74.58%) recipients censored, either due to loss to follow-up or administratively at 12/31/2005. Among the n =3816 patients in our analysis, 3213 have no hospitalization in the follow up period. Among the 603 hospitalized patients, 255 had 1–3 hospitalizations, and 279 had 4–10 hospitalizations. There were three patients with more than 30 hospitalizations, with the maximum number of hospitalizations being 34. On average, each recipient experienced 0.85 hospitalizations during the follow-up period.

As stated, the treatment of interest is deceased donor source: ECD (Zi1 = 1) or non- ECD (Zi1 = 0). We adjusted for the same set of covariates in the proportional hazards and additive rates models: candidate demographics (gender, age, race), pre-transplant years on dialysis, various comorbid conditions (drug treated chronic obstructive pulmonary disease (COPD), angina, symptomatic cerebral vascular disease, symptomatic peripheral vascular disease, pretransplant malignancy), primary diagnosis (polycystic kidney disease, diabetes, hypertension), functional status (fully active to severely disabled), and stay in the intensive care unit (ICU).

Hazard ratios and conditional recurrent event rate differences for each covariate are listed in Table 3, as well as their corresponding p-values. Recipients of an ECD kidney have a 1.3 times higher hazard of death compared to non-ECD recipients. At the same time, ECD kidney recipients experienced 84.4 more hospital admissions per 1,000 patients per year survived. Lower survival probabilities for ECD patients could lead to a reduced mean number of hospitalizations, since hospitalizations are terminated by death. However, ECD patients also experience a significantly elevated conditional hospitalization rate given survival. The combination of these two effects results in the marginal effect of ECD on mean hospitalizations.

Table 3.

Analysis of Kidney Transplant Data: Estimated Regression Parameters from Proportional Hazards and Additive Rates Models

Covariate, Zik = Xik eβ̂k p θ̂k
SE^(θ^k)
p
ECD 1.28 0.002 84.4 29.4 0.004
Female 1.00 0.951 −6.6 19.2 0.732
Age 18 – 24 0.34 0.003 −80.7 41.1 0.045
Age 25 – 34 0.55 0.0001 −87.6 29.0 0.003
Age 35 – 44 0.83 0.092 −56.2 26.4 0.033
Age 45 – 54 1 - 0 - -
Age 55 – 64 1.62 < 0.0001 50.3 28.9 0.082
Age 65 – 70 2.12 < 0.0001 109.5 40.9 0.007
Age ≥ 70 2.88 < 0.0001 146.5 47.8 0.002
African American 0.99 0.908 5.7 24.3 0.813
COPD 1.25 0.457 7.1 136.5 0.960
Angina 1.40 0.0002 71.1 39.5 0.072
Pretransplant Malignancy 2.67 0.092 750.7 517.9 0.147
Cerebral vascular disease 1.19 0.323 95.4 71.5 0.182
Peripheral vascular disease 1.03 0.848 31.7 56.1 0.571
Polycystic kidneys 0.56 0.0006 −138.7 22.7 < 0.001
Diabetes 1.59 < 0.0001 123.3 32.2 < 0.001
Hypertension 1.19 0.067 16.5 24.3 0.500
Years on dialysis 1.04 0.002 −0.3 3.0 0.932
Functional status: minor disability 0.31 < 0.0001 −114.2 106.3 0.282
Intensive Care Unit 1.24 0.631 −220.1 136.2 0.106

θ: additional hospitalizations per 1,000 per year survived.

Based the proposed estimator ψ̂(t), ECD recipients on average experience 79 more hospital admissions per 1000 persons compared to a non-ECD recipients at the end of one year. This difference increases with time post-transplant and reaches 318 per 1,000 patients at t = 5 years, as shown in Table 4. At each time point, the difference between ECD transplant patients and non-ECD transplant patients is highly significant. In summary, receiving an ECD kidney leads to a significantly greater number of hospitalizations post-transplant.

Table 4.

Analysis of Kidney Transplant Data (ECD vs. non-ECD): Difference in Mean Numbers of Hospitalizations

t ψ̂(t)
SE^{ψ^(t)}
p
1 year 79 28 0.005
2 year 153 54 0.005
3 year 217 79 0.006
4 year 270 104 0.010
5 year 318 128 0.014

ψ: differences in hospitalization means per 1,000 patients between ECD and non-ECD recipients at the end of 1 to 5 years.

The difference in mean number of hospitalizations (averaging over the observed adjustment covariate pattern) for ECD and non-ECD transplant recipients is plotted in Figure 1. We can see that ECD recipients have more hospitalizations immediately after transplant. The difference keeps increasing with time and reaches 318 hospital admissions per 1,000 patients at the end of the 5-year follow-up period. The increase in hospitalization rate due to ECD dominates the decrease in survival probability for ECD recipients and leads to positive ECD effects on the marginal event mean.

Figure 1.

Figure 1

ECD effect on mean number of hospitalizations

Linear trend: The dashed line represents the difference in means which would result if the difference across the (0, 400] day interval persisted across the (400, 1825] day interval.

We present ψ̂(t) and point-wise 95% confidence intervals in Figure 2. It is clear that the increase in mean number of hospitalizations associated with ECD transplantation is highly significant.

Figure 2.

Figure 2

95% point-wise confidence intervals for the difference between ECD and non-ECD mean number of hospitalizations

The two outside lines represent 95% point-wise confidence intervals for the year 0 – 5 interval.

6. Discussion

We propose semiparametric methods to compare marginal treatment-specific mean numbers of recurrent events in the presence of a terminating event. The proposed methods involve modeling terminating event hazard and the conditional recurrent event rate given survival separately, then integrating to estimate treatment-specific marginal recurrent event means. A measure of the combined effects is proposed, with its asymptotic properties derived and evaluated in moderate size samples. We demonstrate that the estimator works reasonably well under misspecified models; that is, models failing to incorporate unmeasured predictors of both the terminating event hazard function and conditional recurrent event rate. The proposed estimator is applied to national kidney transplant data to study the effect of ECD transplantation on post-transplant hospitalization admission numbers. ECD recipients are found to have consistently more hospitalizations during the whole follow-up period, with the difference between ECD and non-ECD recipients increasing with time.

The proposed method estimates the difference in treatment-specific marginal means, while incorporating treatment-specific differences in survival. Thus, ψ(t) combines the actual survival probabilities and recurrent event rates for the treatment and placebo respectively and reflects the difference in marginal recurrent event numbers which would be observed between treatment and placebo patients with the same adjustment covariates. At the same time, researchers may also be interested in contrasts between treatment-specific survival and/or recurrent event rates among survivors. For a complete interpretation of their findings, researchers should carefully consider all three estimators, ψ̂(t), β̂1 and θ̂1.

Our strongest assumption is that there are no unmeasured predictors of both the terminating event hazard and conditional event rate and this assumption will fail frequently, particularly in observational studies. Several issues are important in this regard. First, the unmeasured predictor must be a risk factor for both the terminating and recurrent event conditional on all measured covariates. If the most important predictors are included in the adjustment covariate vector, bias may be minimized and the estimated treatment effect may be a fairly accurate approximation to the reality. Second, the most popular alternative to a no-unmeasured-predictor assumption is to incorporate a frailty. However, most frailty approaches for recurrent event data assume that the events follow a Poisson process, which is restrictive in its own right. To the best of our knowledge, there is only one method (Ye, Kalbfleisch and Schaubel; unpublished manuscript) which employs a frailty in the absence of the Poisson-process assumption, and this method does not propose cumulative effect measures. Third, our sensitivity analysis reveals that if an unmeasured frailty affects both the terminating event hazard and conditional rate, bias is relatively small.

Supplementary Material

Supp

Acknowledgements

This research was supported in part by National Institutes of Health grant R01 DK-70869 (DES). The authors thank the Editor, the Associate Editor and the Referees for helpful comments which lead to substantial improvement of the manuscript. They also thank Jack Kalbfleisch for his thoughtful suggestions. They wish to thank the Scientific Registry of Transplant Recipients (SRTR) for access to the renal transplant data, as well as the programmers and analysts at the Arbor Research Collaborative for Health and the University of Michigan Kidney Epidemiology and Cost Center for assistance with preparing the analysis files. The SRTR is funded by contract number 231-00-0116 from the Health Resources and Services Administration (HRSA), U.S. Department of Health and Human Services.

Footnotes

Supplementary Materials

Web Appendix A–B, referenced in Section 3, are available under the Paper Information link at the Biometrics website http://www.tibs.org/biometrics.

References

  1. Andersen PK, Gill RD. Cox’s regression model for counting processes: A large sample study. Annals of Statistics. 1982;10:1110–1120. [Google Scholar]
  2. Bilias Y, Gu M, Ying Z. Towards a general asymptotic theory for the Cox model with staggered entry. Annals of Statistics. 1997;25:662–682. [Google Scholar]
  3. Cook RJ, Lawless JF. Marginal analysis of recurrent events and a terminating event. Statist. Med. 1997;16:911–924. doi: 10.1002/(sici)1097-0258(19970430)16:8<911::aid-sim544>3.0.co;2-i. [DOI] [PubMed] [Google Scholar]
  4. Cox DR. Regression models and life-tables (with discussion) J. R. Statist. Soc. B. 1972;34:187–220. [Google Scholar]
  5. Cox DR. Partial likelihood. Biometrika. 1975;62:262–276. [Google Scholar]
  6. Fleming TR, Harrington DP. Counting Processes and Survival Analysis. New York: Wiley; 1991. [Google Scholar]
  7. Ghosh D, Lin DY. Nonparametric analysis of recurrent events and death. Biometrics. 2000;56:554–562. doi: 10.1111/j.0006-341x.2000.00554.x. [DOI] [PubMed] [Google Scholar]
  8. Ghosh D, Lin DY. Marginal regression models for recurrent and terminal event. Statist. Sinica. 2002;12:663–688. [Google Scholar]
  9. Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. New York: Wiley; 2002. [Google Scholar]
  10. Lawless JF, Nadeau C. Some simple robust methods for the analysis of recurrent event. Technometrics. 1995;37:158–168. [Google Scholar]
  11. Li Q, Lagakos S. Use of the Wei-Lin-Weissfeld method for the analysis of a recurring and a terminating event. Statist. Med. 1997;16:925–940. doi: 10.1002/(sici)1097-0258(19970430)16:8<925::aid-sim545>3.0.co;2-2. [DOI] [PubMed] [Google Scholar]
  12. Lin DY, Fleming TR, Wei LJ. Confidence Bands for Survival Curves under the Proportional Hazards Model. Biometrika. 1994;81:73–81. [Google Scholar]
  13. Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. J. R. Statist. Soc. B. 2000;62:711–730. [Google Scholar]
  14. Huang X. Shared frailty models for recurrent events and a terminal event. Biometrics. 2004;60:747–756. doi: 10.1111/j.0006-341X.2004.00225.x. [DOI] [PubMed] [Google Scholar]
  15. Pepe MS, Cai J. Some graphical displays and marginal regression analyses for recurrent failure times and time-dependent covariates. J. Am. Statist. Assoc. 1993;88:811–820. [Google Scholar]
  16. Pollard D. Empirical Processes: Theory and Applications. Hayward: Institute of Mathematical Statistics; 1990. [Google Scholar]
  17. Prentice RL, Williams B, Peterson AV. On the regression analysis of multivariate failure time data. Biometrika. 1981;68:373–379. [Google Scholar]
  18. Schaubel DE, Cai J. Analysis of clustered recurrent event data with application to hospitalization rates among renal failure patients. Biostatistics. 2005;6:404–419. doi: 10.1093/biostatistics/kxi018. [DOI] [PubMed] [Google Scholar]
  19. Schaubel DE, Zeng D, Cai J. A semiparametric additive rates model for recurrent event data. Lifetime Data Analysis. 2006;12:389–406. doi: 10.1007/s10985-006-9017-x. [DOI] [PubMed] [Google Scholar]
  20. Sen PK, Singer JM. Large Sample Methods in Statistics. New York: Chapman and Hall; 1993. [Google Scholar]
  21. van der Vaart AW. Asymptotic Statistics. Melbourne: Cambridge; 2000. [Google Scholar]
  22. van der Vaart AW, Wellner JA. Weak Convergence and Empirical Processes. New York: Springer; 1996. [Google Scholar]
  23. Ye Y, Kalbfleisch JD, Schaubel DE. Semiparametric analysis of correlated recurrent and terminal events. Biometrics. 2007;63:78–87. doi: 10.1111/j.1541-0420.2006.00677.x. [DOI] [PubMed] [Google Scholar]
  24. Ye Y, Kalbfleisch JD, Schaubel DE. An estimating function approach to the analysis of recurrent and terminal events. doi: 10.1111/biom.12025. (submitted manucript). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp

RESOURCES