Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Oct 23.
Published in final edited form as: Lifetime Data Anal. 2010 Mar 14;16(3):353–373. doi: 10.1007/s10985-010-9160-2

Additive–multiplicative rates model for recurrent events

Yanyan Liu 1, Yuanshan Wu 2,3, Jianwen Cai 4,, Haibo Zhou 5
PMCID: PMC3199147  NIHMSID: NIHMS327688  PMID: 20229314

Abstract

Recurrent events are frequently encountered in biomedical studies. Evaluating the covariates effects on the marginal recurrent event rate is of practical interest. There are mainly two types of rate models for the recurrent event data: the multiplicative rates model and the additive rates model. We consider a more flexible additive–multiplicative rates model for analysis of recurrent event data, wherein some covariate effects are additive while others are multiplicative. We formulate estimating equations for estimating the regression parameters. The estimators for these regression parameters are shown to be consistent and asymptotically normally distributed under appropriate regularity conditions. Moreover, the estimator of the baseline mean function is proposed and its large sample properties are investigated. We also conduct simulation studies to evaluate the finite sample behavior of the proposed estimators. A medical study of patients with cystic fibrosis suffered from recurrent pulmonary exacerbations is provided for illustration of the proposed method.

Keywords: Recurrent events, Rate regression, Additive–multiplicative rates model, Counting process, Empirical process

1 Introduction

Recurrent event data are common in biomedical studies. For example, patients with cystic fibrosis may suffer from repeated pulmonary exacerbations of respiratory symptoms (Therneau and Grambsch 2000); HIV patients may experience recurrent opportunistic infections (Li and Lagakos 1997). Other examples include myocardial infarctions, tumor metastases etc.

Modeling the occurrence of recurrent events has been a much discussed topic in the last few years and recurrent event data can be viewed as a special case of multivariate failure time data since the different event times within the same subject are ordered and thus correlated. Therefore, these data can be analyzed by well-established marginal intensity model approaches (e.g., Wei et al. 1989; Lee et al. 1992) and conditional intensity model approaches (e.g., Prentice et al. 1981; Andersen and Gill 1982; Chang and Wang 1999). These models are commonly used due to the convention and availability of statistical software. However, when the true underlying covariate effects may add to, rather than multiply, the baseline event intensity, a plausible alternative would be the additive hazards model (e.g., Aalen 1980; Breslow and Day 1980; Cox and Oakes 1984; Buckley 1984; Breslow and Day 1987; Aalen 1989; Huffer and McKeague 1991; Andersen et al. 1993). Most of these approaches are fully parametric except the nonparametric methods of Aalen (1980, 1989) and Huffer and McKeague (1991). Lin and Ying (1994) proposed a semiparametric additive model and gave an explicit estimator of the model parameters through the techniques analogous to partial-likelihood-based method (Andersen and Gill 1982). The additive and multiplicative hazards models postulate two rather different relationship between the covariate and the hazard function. However, these two types of effects could exist for different convariates. Lin and Ying (1995) proposed a general additive–multiplicative intensity model which allows some covariate effects to be additive while others to be multiplicative, and studied the asymptotic properties of their proposed estimates.

In the context of recurrent event data, because the mean number of events is more interpretable than the event intensity, some authors have proposed to model the mean function under the assumption that the covariates act multiplicatively on the unspecified baseline rate function (e.g., Pepe and Cai 1993; Lawless and Nadeau 1995; Lin et al. 2000; Cai and Schaubel 2004). Schaubel et al. (2006) developed a semiparametric additive model for the marginal recurrent event rate, wherein they assume the covariates effects are added to the unspecified baseline rate function. In this article, we consider a general additive–multiplicative rates model in the spirit of the work of Lin and Ying (1995) which includes the multiplicative model and the additive model as special cases. The model assumes that some of the covariates act additively while others multiplicatively on the unspecified baseline rate function.

The remainder of this article is organized as follows. In Sect. 2, we introduce the proposed models and derive a class of estimating equations for the estimates of regression parameters and the baseline rate function. The asymptotic properties of the proposed estimators are investigated in Sect. 3 with the proofs deferred to the Appendix. Simulation studies are conducted in Sect. 4 to evaluate the finite-sample behavior of the asymptotic approximation. The proposed methods are applied to a cystic fibrosis clinical trial in Sect. 5. Some discussions are given in Sect. 6.

2 Model and method

In this section, we describe the inference procedure for the proposed model, beginning with the establishment of some notation.

2.1 Notation and model

Let Ni*(t)(i=1,,n) be the number of events that occur over the interval [0, t] for the ith subject under study. The corresponding right censoring time is denoted by Ci. Due to the censoring, the number of observed events at time t is Ni(t)=Ni*(tCi), where ab equals the minimum of a and b. Let Yi(t) = I(Cit) denote the at-risk process, where I(·) is the indicator function. Associated with each Ni(t) is a p-dimensional covariate process Wi(t)={ZiT(t),XiT(t)}T, with any time-dependent elements to be external as defined by Kalbfleisch and Prentice (2002); i.e., to have paths which are known at time t = 0, where aT is the transpose of a vector or matrix a. Furthermore, the censoring mechanism is assumed to be independent in the sense that

E{dNi*(t)|Wi(t),Cit}=E{dNi*(t)|Wi(t)}.

Assume that the follow-up interval is [0, τ], where τ is prespecified constant typically meaning the terminal time of study. Thus, the observable data consist of {Ni(t), Yi(t), Wi(t); t ∈ [0, τ]}(i = 1, …, n). Subjects are assumed to be statistically independent, although events within the same subject are ordered and thus naturally expected to be correlated.

We consider the following additive–multiplicative rates model:

E{dNi*(t)|Wi(t)}=g{γ0TZi(t)}dt+h{β0TXi(t)}dμ0(t), (1)

where θ0=(γ0T,β0T)T is p-vector of unknown regression parameters, g and h are knownlink functions and μ0(·) is an unspecified continuous baseline mean function for subjects with covariates zi(t) and xi(t) such that g{γ0Tzi(t)}=0 and h{β0Txi(t)}=1. Let γ0 = (γ01, …, γ0p1)T and β0 = (β01, …, β0p2)T, then γ0j can be interpreted as the rate difference for one unit change in Zji while holding Xi and other components in Zi the same when g(x) = x. As suggested by the associate editor, by holding Zi and other components in Xi the same, β0j can be interpreted as the logarithm of the ratio of residual rate of risk for one unit change in Xji when h(x) = ex, where the residual rate of risk is defined as the overall rate E{dNi*(t)|Wi(t)} minus the additive rate of risk. Apparently, model (1) defines a very rich family of models through the link functions g and h, which contains the additive rates model and the multiplicative rates model as its special cases. Selection of these appropriate link functions may be based on prior data or the resulting interpretation of the regression parameters. If the rate of interest is conditioning on the time-independent covariates, i.e., E{dNi*(t)|Wi}, then by integration or summing one can obtain E{Ni*(t)|Wi}, to which we refer as means model. Since the means model is a special case of rates model, we focus on the rates model (1) hereafter.

2.2 Inference procedure

We now describe the estimating procedure under model (1), beginning by defining the process:

Mi(t,θ)=Ni(t)0tYi(u){g{γTZi(u)}du+h{βTXi(u)}dμ0(u)},

where θ (γT, βT)T is included in the parameter space denoted by Θ, which contains the true value θ0 as its interior point and is assumed to be compact for technical proof. Abbreviate Mi(t, θ0) as Mi(t). Since E{dMi(t)|Wi(t)} = 0 under model (1) and independent censoring, an approach analogous to generalized estimating equations (Liang and Zeger 1986) suggests the following estimating equations for μ0(t) and θ0, respectively:

i=0n0tdMi(u)=0, (2)
i=0n0τQi(u,θ)dMi(u,θ)=0, (3)

where Qi(t, θ) is a smooth p-vector-value function of Wi(t) and θ not involving μ0(t).

Solving (2), a class of the baseline mean function estimators for μ0(t) (with given θ0) is directly obtained as follows:

μ^0(t,θ0)=0ti=1n{dNi(u)Yi(u)g{γ0TZi(u)}du}i=1nYi(u)h{β0TXi(u)}.

Substituting μ̂0(t, θ) for μ0(t) into (3), followed by some algebraic manipulation, yields an estimating function U(τ, θ) for θ0 which is free of μ0(t), where

U(t,θ)=i=1n0t{Qi(u,θ)Q¯(u,θ)}dMi(u,θ), (4)

and

Q¯(t,θ)=i=1nYi(t)Qi(t,θ)h{βTXi(t)}i=1nYi(t)h{βTXi(t)}.

Hence, we obtain an estimator, denoted by θ̂, for θ0, which solves equation U(τ, θ) = 0, and the corresponding estimator for μ0(t) is then denoted by μ̂0(t, θ̂). To ensure the monotonicity of the estimate of the baseline mean function, the estimator for μ0(t) is modified to be:

μ˜0(t,θ^)=max0utμ^0(u,θ^).

Instead of constraining θ̂ to force the baseline rate estimator to be positive, the baseline mean estimator μ̃0(t, θ̂) is constrained to be monotone non-decreasing, analogous to the approach of Lin and Ying (1994).

Similarly, one possible choice of Qi(t, θ) discussed by Lin and Ying (1995) is obviously suitable here to encompass inference procedures arising from multiplicative rates model and additive rates model, respectively, as its special cases. To be specific, let

Qi(t,θ)=[g{γTZi(t)}Zi(t)/h{βTXi(t)}h{βTXi(t)}Xi(t)/h{βTXi(t)}], (5)

where and in what follows f′(x) = df(x)/dx. Putting (5) into (4), we have the estimating equations U(τ, θ) and therefore give the corresponding estimate of θ0.

3 Asymptotic properties

In this section, we establish the large sample properties of the proposed estimators, beginning with the following regularity conditions, analogous to those of Andersen and Gill (1982, Theorem 4.1) and assumed to be held throughout our discussion for i = 1, …, n.

  • C1. {Ni*(·),Yi(·),Wi(·),Qi(·,θ)} are independent and identically distributed;

  • C2. P(Ci ≥ τ) > 0;

  • C3. Ni(τ) is bounded by a constant;

  • C4. Wi(·) has bounded total variation, i.e., for the j-th element of Wi(·), |Wij(0)|+0τ|dWji(u)|<C* almost surely, where C* is a constant; moreover, Qi(·, θ) has bounded total variation, uniformly in θ ∈ Θ;

  • C5. The following defined matrix A is nonsingular:
    A=E{0τY1(u){Q1(u,θ0)q¯(u,θ0)}[g{γ0TZ1(u)}Z1(u)duh{β0TX1(u)}X1(u)dμ0(u)]T},
    where q¯(t,θ)=E[Y1(t)Q1(t,θ)h{βTX1(t)}]E[Y1(t)h{βTX1(t)}];
  • C6. g is nonnegative and hT Xi(t)} is locally bounded away from 0 for β in a small neighborhood of β0; g and h are continuous differentiable and
    {Qi(t,θ)/θ,h{βTXi(t)},g{γTZi(t)}Zi(t),h{βTXi(t)}Xi(t):t[0,τ],i=1,,n}
    are equicontinuous and bounded uniformly in parameter space Θ.

Conditions C1, C3, C4, and C6 simplify our derivation of the asymptotic results but do not impose practical limitations. Condition C2 can be enforced by choosing τ to be not greater than the maximum observation time. Condition C5 is a technique assumption. Note that (t, θ) is well defined since E[Y1(t)hT X1(t)}] is bounded away from zero whenever t ∈ [0, τ] under conditions C2 and C6. We describe the asymptotic behavior of estimates of the regression parameters in the following theorem.

Theorem 1 Under conditions C1 to C6, θ̂ converges almost surely to θ0, while n(θ^θ0) converges weakly to a zero-mean normal distribution with covariance A−1 Σ(τ, τ)(AT)−1, where

Σ(s,t)=E[0s{Q1(u,θ0)q¯(u,θ0)}dM1(u)0t{Q1(υ,θ0)q¯(υ,θ0)}TdM1(υ)]

for s and t in [0, τ].

Theorem 1 can be proved by combining the multivariate central limit theorem and some results from empirical processes theory (Pollard 1990; Bilias et al. 1997) and a lemma from Lin et al. (2000). We present the outline of the proof in the Appendix.

Furthermore, using the Strong Law of Large Numbers (SLLN) and Lemma 1 from Lin et al. (2000) and combining with the results of Theorems 1 and 2 followed below imply that the asymptotic covariance can be consistently estimated by Â−1 Σ̂(τ, τ) (ÂT)−1, where

A^=1ni=1n0τYi(u){Qi(u,θ^)Q¯(u,θ^)}[g{γ^TZi(u)}Zi(u)duh{β^TXi(u)}Xi(u)dμ^0(u,θ^)]T,Σ^(s,t)=1ni=1n0s{Qi(u,θ^)Q¯(u,θ^)}dM^i(u)0t{Qi(υ,θ^)Q¯(υ,θ^)}TdM^i(υ),

with

dM^i(t)=dNi(t)Yi(t){g{γ^TZi(t)}dt+h{β^TXi(t)}dμ^0(t,θ^)}. (6)

The essential asymptotic results for the baseline mean function estimator are summarized by the following theorem. We firstly introduce some notation:

sg(t,θ)=E[Y1(t)g{γTZ1(t)}Z1(t)],sh(t,θ)=E[Y1(t)h{βTX1(t)}X1(t)],sh(t,θ)=E[Y1(t)h{βTX1(t)}],s˜(t,θ)=[0tsg(u,θ)sh(u,θ)du0tsh(u,θ)sh(u,θ)dμ0(u)].

Theorem 2 Under conditions C1 to C6, {μ̃0(t, θ̂) − μ0(t)} converges almost surely to 0, uniformly in t ∈ [0, τ], while n(μ˜0(t,θ^)μ0(t)) converges weakly to a zero-mean Gaussian process with covariance function ξ(s, t) = E1(s1(t)], where

Ψi(t)=0t1sh(u,θ0)dMi(u){s˜(t,θ0)}TA10τ{Qi(u,θ0)q¯(u,θ0)}dMi(u).

Using arguments analogous to those of Lin and Ying (1994), we can prove n{μ˜0(t,θ^)μ^0(t,θ^)}=op(1), where op(1) is uniformly in t ∈ [0, τ]. Therefore we work with μ̃0(t, θ̂) instead of μ̃0(t, θ̂) in the proof of Theorem 2. We then apply the central limit theorem and results from empirical processes theory to the decomposition, {μ̂0(t, θ̂) − μ0(t)} = {μ̂0(t, θ̂) − μ̂0(t, θ0)} + {μ̂0(t, θ0) − μ0(t)}, to finish the proof of Theorem 2. An outline of the proof is also provided in the Appendix.

The covariance function ξ(s, t) can be consistently estimated by replacing limiting quantities in Ψi(t) with their respectively empirical counterparts. To be specific, ξ^(s,t)=1ni=1nΨ^i(s)Ψ^i(t), where

Ψ^i(t)=0t1Sh(u,θ^)dM^i(u){S~(t,θ^)}TA^10τ{Qi(u,θ^)Q¯(u,θ^)}dM^i(u)

with Sh(t, θ) and (t, θ) listed in the Appendix.

4 Simulation studies

Simulation studies were conducted to assess the adequacy of the proposed large-sample approximations for practical sample size. Specially, event times were generated from the following mixed effects rate model:

E{dNi*(t)|Zi(t),Xi(t),ηi}=ηi[g{γ0TZi(t)}dt+h{β0TXi(t)}dμ0(t)],i=1,,n,

where the frailty variable ηi induces positive correlation among the within-subject events and follows a gamma distribution with unit-mean and variance ση. Note that larger ση corresponds to higher positive correlation among intra-subject event times and ση = 0, i.e., ηi = 1 for all i, indicates that within-subject event times are independent. We considered the simple cases: g{γ0TZi(t)}=γ0Zi,h{β0TXi(t)}=exp(β0Xi) and the baseline rate m0 ≡ dμ0(t)/dt is a constant. The (j + 1)th event time for the ith subject was generated by Ti,j + 1 = Ti,j − {ηi0Zi + exp(β0Xi)m0]}−1 log(Ui,j+1), where Ui,j were generated from uniform (0, 1) distribution independently, and Ti,0 = 0. The estimating function U(τ, θ) was derived by choosing Qi(t, θ) according to (5).

Covariates Zi’s were independently generated from uniform (0, 1). We generated Xi’s independently from Bernoulli distribution with success probability 0.5. The censoring times Ci’s were generated from uniform (0, 3). The baseline rate dμ0(t)/dt varied from 0.125 to 0.25. The expected number of observed events varied from 0.42 to 1.67 and 0.57 to 1.98 at (γ0, β0) = (0, 0.2) and (γ0, β0) = (0.2, 0.2), respectively. We set sample size n =50, 100, 200 and each simulation was repeated 2000 times. The sample mean and sample standard deviation of the 2000 estimates are given in the Mean and SD columns, respectively. The SE columns give the average of the estimated standard errors and CP columns give the coverage probability of the nominal 95% confidence interval for the true parameter using the estimated standard error.

Table 1 displays the simulation results. From Table 1, we make the following observations: (i) the estimators proposed are approximately unbiased when γ0 = 0 and slightly underestimated when γ0 = 0.2; (ii) the average asymptotic standard error estimators (SE) are approximately equal to the empirical standard deviations (SD); (iii) the corresponding 95% confidence intervals based on the estimated standard errors provide reasonable coverage probabilities; (iv) for fixed n and μ0, both SD and SE increase as the intra-subject event times correlation ση increases; (v) as expected, as the sample size increases, SE is decreasing and closer to SD.

Table 1.

Summary statistics for the simulation studies based on 2000 replicated data set when the baseline rate function dμ0(t)/dt is time-independent

n μ0(t) ση γ0 = 0 γ0 = 0.2


Mean SD SE CP Mean SD SE CP
50 0.125t 0 0.002 0.149 0.153 0.954 0.204 0.191 0.198 0.954
0.25 −0.001 0.158 0.159 0.950 0.190 0.206 0.206 0.947
0.25t 0 0.008 0.206 0.216 0.957 0.208 0.232 0.246 0.960
0.25 −0.003 0.225 0.227 0.940 0.178 0.257 0.258 0.947
100 0.125t 0 0.000 0.108 0.106 0.944 0.196 0.139 0.137 0.949
0.25 −0.001 0.107 0.108 0.950 0.192 0.143 0.142 0.950
0.25t 0 −0.002 0.149 0.149 0.949 0.188 0.170 0.169 0.948
0.25 0.000 0.154 0.154 0.950 0.184 0.176 0.177 0.954
200 0.125t 0 0.001 0.075 0.074 0.947 0.198 0.098 0.096 0.945
0.25 −0.001 0.076 0.076 0.952 0.190 0.101 0.100 0.949
0.25t 0 0.000 0.106 0.104 0.949 0.193 0.120 0.118 0.951
0.25 −0.002 0.108 0.109 0.953 0.178 0.125 0.124 0.949

The additive–multiplicative means model E(N*(t)|Z, X) = γ0 Zt + exp(0.2X0(t) is considered with γ0 = 0 or 0.2, where μ0(t) is the baseline mean function, Z is generated from uniform (0, 1), and X is a Bernoulli variable with success probability 0.5. ση describes the positive correlation within the recurrent events and n is the number of subjects under study. Mean is the sample mean of the estimator γ̂, SD is the sampling standard deviation of γ̂, SE is the sampling mean of the standard error estimator, and CP is the coverage probability of the 95% confidence interval for γ̂

As suggested by the referees, additional simulation studies were also conducted to investigate the effect of time-dependent baseline rate function dμ0(t)/dt on the performances of the proposed estimator. Two cases dμ0(t)/dt = 0.25t and dμ0(t)/dt = 0.5t are considered with other parameter values not changed. We then use the thinning algorithm (Ross 2006, p. 83) to generate the recurrent event times when the baseline rate function dμ0(t)/dt is time-dependent. To be specific, let m0(t) = dμ0(t)/dt and mi(t) = ηi0Zi + exp(β0Xi)m0(t)], then the recurrent event times {Ti,j} for the ith subject over [0, τ] can be generated as follows:

  • Step 1: t = 0, j = 0, and Ti,j = 0.

  • Step 2: Generate a random number U from uniform (0, 1) distribution.

  • Step 3: t=t1mi(τ)logU. If t > τ, stop.

  • Step 4: Generate a random number V from uniform (0, 1) distribution.

  • Step 5: If Vmi(t)mi(τ), set j = j + 1 and Ti,j = t.

  • Step 6: Go to Step 2.

The related results are summarized in Table 2. It is evident that the same conclusions can be drawn as that from Table 1.

Table 2.

Summary statistics for the simulation studies based on 2000 replicated data set when the baseline rate function dμ0(t)/dt is time-dependent

n μ0(t) ση γ0= 0 γ0 = 0.2


Mean SD SE CP Mean SD SE CP
50 0.125t2 0 −0.004 0.149 0.150 0.949 0.199 0.195 0.192 0.950
0.25 0.001 0.154 0.155 0.947 0.201 0.209 0.201 0.951
0.25t2 0 0.001 0.204 0.210 0.951 0.184 0.232 0.240 0.947
0.25 −0.009 0.213 0.218 0.953 0.182 0.248 0.250 0.953
100 0.125t2 0 0.000 0.101 0.105 0.951 0.199 0.134 0.137 0.949
0.25 −0.001 0.106 0.109 0.953 0.185 0.141 0.142 0.948
0.25t2 0 0.000 0.145 0.147 0.949 0.192 0.160 0.166 0.953
0.25 −0.002 0.147 0.153 0.952 0.178 0.169 0.173 0.953
200 0.125t2 0 0.002 0.073 0.075 0.943 0.192 0.093 0.095 0.956
0.25 −0.001 0.074 0.077 0.957 0.194 0.102 0.100 0.942
0.25t2 0 0.003 0.099 0.103 0.949 0.193 0.113 0.116 0.947
0.25 0.000 0.109 0.108 0.943 0.179 0.121 0.121 0.950

Note: See Table 1

5 A real example

We now apply the methods developed in Sect. 2 to data from a randomized clinical trial which was conducted to assess the efficacy of treatment rhDNase (TRT), a highly purified recombinant enzyme, in reducing pulmonary exacerbations of respiratory symptoms for patients with cystic fibrosis (Therneau and Grambsch 2000). In the study, 325 patients in the placebo arm and 322 patients in the rhDNase arm were monitored for pulmonary exacerbations, along with measuring of baseline level of forced expiratory volume per second (FEV). Most patients were followed for about 170 days. Table 3 shows the frequency of the number of exacerbations. The endpoint of this study is to evaluate the effects of rhDNase (possibly adjusted by covariate FEV) on the recurrences of pulmonary exacerbations over times.

Table 3.

Frequency distribution of pulmonary exacerbations in the rhDNase trial

Group Patients Number of exacerbations

0 1 2 3 4 5
rhDNase 322 218 65 30 6 3 0
placebo 325 186 97 24 13 4 1

We will fit the additive rates (AR) model:

E{dN*(t)|T RT,F EV}=(βT T RT+βF F EV)dt+dμ0(t),

the multiplicative rates (MR) model:

E{dN*(t)|T RT,F EV}=exp(βT T RT+βF F EV)dμ0(t),

and the additive–multiplicative rates models:

E{dN*(t)|T RT,F EV}=(βF F EV)dt+exp(βT T RT)dμ0(t)

and

E{dN*(t)|T RT,F EV}=(βT T RT)dt+exp(βF F EV)dμ0(t),

which are denoted by model AMR1 and model AMR2, respectively.

Mimicking Lin et al. (2001), we use mean-square-type distance between the observed and the expected recurrences, defined as

D*=l=1Li=1n[M^i(tl)]2/l=1Li=1nYi(tl),

to measure the overall lack of fit in the above four rates models, where i(t) is defined by (6) and 0 ≡ t0 < t1 < ⋯ < tL < tL+1 ≡ τ is the set that consists of all the jump points of {Ni(t), Yi(t); t ∈ [0, τ)}(i = 1, …, n) in addition to 0 and τ.

In order to examine graphically which model fit the rhDNase data better, we replace the continuous covariate FEV with its discretized version, denoted by dFEV=I(FEV>57.6), where 57.6 is the median of FEV, in the models AR, MR, AMR1, and AMR2, and thus we partition the subjects under study into four strata. The bold and thin solid curves in Fig. 1 display the Nelson-Aalen-type nonparametric and the model-based estimators for the mean numbers of pulmonary exacerbations, respectively, in each stratum under various models. Figure 2 presents the difference of nonparametric estimator for the mean numbers of pulmonary exacerbations in the stratum (TRT = 0, dFEV = 0) over that in the stratum (TRT = 1, dFEV = 0) indicated by bold solid curve and the corresponding difference of the model-based estimators indicated by the dash curve (for model AR), the dotted-dashed curve (for model AMR1), and the thin solid curve (for model AMR2). Likewise, the difference of the nonparametric and model-based estimators for the mean numbers of pulmonary exacerbations in the stratum (TRT = 0, dFEV = 1) over that in the stratum (TRT = 1, dFEV = 1) are displayed in Fig. 3. As indicated in these figures, the model AR is unlikely to fit the rhDNase data very well since the difference of the nonparametric means functions are not straight lines, and the model AMR2 may provide a reasonable description for the data set.

Fig. 1.

Fig. 1

Mean frequency functions of pulmonary exacerbations for each stratum in the rhDNase trial. The Nelson-Aalen-type nonparametric estimators for the mean numbers of pulmonary exacerbations are shown by bold solid curves in the strata (TRT = 0, dFEV = 0), (TRT = 1, dFEV = 0), (TRT = 0, dFEV = 1), and (TRT = 1, dFEV = 1), from top to bottom, respectively. The corresponding model-based estimators are pertained by thin solid curves

Fig. 2.

Fig. 2

Difference of the mean frequency functions for the stratum (TRT = 0, dFEV = 0) versus (TRT = 1, dFEV = 0). The bold solid curve pertains to the nonparametric estimator; the dashed curve pertains to the model AR based estimator; the dotted-dashed curve pertains to the model AMR1 based estimator; the thin solid curve pertains to the model AMR2 based estimator

Fig. 3.

Fig. 3

Difference of the mean frequency functions for the stratum (TRT=0, dFEV=1) versus (TRT=1, dFEV=1). The bold solid curve pertains to the nonparametric estimator; the dashed curve pertains to the model AR based estimator; the dotted-dashed curve pertains to the model AMR1 based estimator; the thin solid curve pertains to the model AMR2 based estimator

For further comparisons, we consider the models AR, MR, AMR1, and AMR2 using FEV instead of dFEV, and we also use D* as a criterion to select model. The analysis results are summarized in Table 4. First, under the measure of D*, the model AMR2 is the best among the four models AR, MR, AMR1, and AMR2, which has been suggested graphically as reflected in Fig. 1. This indicates that the effect of FEV on the recurrence of pulmonary exacerbations is more likely to be multiplicative while the effect of treatment rhDNase is more likely to be additive. Second, the treatment rhDNase is effective in reducing the recurrence of pulmonary exacerbations. Higher FEV level is associated with lower frequency of the recurrences. Figure 4 displays the mean frequency estimators for the rhDNase and placebo patients with two different levels of FEV under the selected model AMR2.

Table 4.

Analysis results for the rhDNase trial

Model β̂T SE (β̂T) p-Value β̂F SE (β̂F) p-Value D*
AR −0.311 0.145 0.032 −0.0179 0.0027 <0.001 0.8907
MR −0.256 0.121 0.034 −0.0162 0.0028 <0.001 0.8889
AMR1 −0.135 0.065 0.037 −0.0178 0.0027 <0.001 0.8901
AMR2 −0.313 0.140 0.026 −0.0142 0.0027 <0.001 0.8887

Fig. 4.

Fig. 4

Estimated mean frequency functions of pulmonary exacerbations in the rhDNase trial under the model AMR2. The curves from top to bottom pertain to the patients with placebo and FEV = 25, the patients with rhDNase and FEV = 25, the patients with placebo and FEV = 100, and the patients with rhDNase and FEV = 100, respectively

6 Discussions

We propose a semiparametric regression method for analyzing recurrent event data and consider an unified model in the sense that it allows for both multiplicative and additive covariate effects. The resulting regression parameter estimator is shown to be consistent and asymptotically normally distributed and a baseline mean estimator is proposed, which is also shown to be uniformly consistent and weakly convergent. Numerical results indicate that the proposed method performs well in finite sample. A medical study is provided as an illustration.

As the associate editor pointed out, one potential advantage of the additive–multiplicative rates model (1) is that the additive term is appropriate for detecting large difference in absolute risk, while the multiplicative term is sensitive for detecting large difference in relative risk (i.e., rate ratio). Thus, in some situations where both types of covariates exist at the same time, one can consider model (1) as a candidate regression model for recurrent event data. Several considerations can be used as guidances in the initial stage. The first factor in deciding the classification is the objectives of the investigator and the underlying biological process. If the investigator is interested in studying the risk difference of some risk factors, then those variables should be added to the additive part. If the investigator is interested in studying the risk ratio of some other risk factors, then those variables should be added to the multiplicative part. Based on the biological process, the covariates anticipated to have a large impact in absolute risks should be included in the additive part and those which could have a large impact in risk ratios should be included in the multiplicative part. When the biological process is not clear, some data-driven methods could be used for the classification of covariates. For example, when the number of covariates is small, all the possible models from different combinations of covariates in the additive and the multiplicative parts can be considered. To facilitate the selection of the models, we have adopted a mean-square-type distance measure D* between the observed and the expected recurrences as model selection criterion. The assignment of the covariates to the additive part or the multiplicative part will follow the model with the smallest D*. When the number of covariates is moderately large, this comprehensive method could be time-consuming. We recommend to use a two-step approach. First, one can plot the nonparametric estimators for the risk difference and the risk ratio for every covariate. The covariate which has constant risk difference over time should be included in the additive part and the covariate which has constant risk ratio should be included in the multiplicative part. Second, for those covariates which are not clear whether they have constant risk difference or risk ratio, we can consider models from different combinations of these covariates in the additive and the multiplicative parts. The assignment of the covariates to the additive part or the multiplicative part will follow the model with the smallest D*.

Obviously, a suitable choice of Qi(t, θ) would lead to appropriate interpretation of the regression parameters and one form of Qi(t, θ) is proposed, which includes multiplicative and additive covariate effects as its special cases. The estimating equations are proposed to allow arbitrary dependence structures within recurrent events. This is analogous to the use of a generalized estimating equation with an independence working assumption for longitudinal data (Liang and Zeger 1986). However, such approach may not necessarily be efficient. One possible way to improve the efficiency of estimate for the regression parameter is to follow the idea in Cai and Prentice (1995) and consider the weighted estimating function

Uω(τ,θ)=i=1n0tωi(u,θ){Qi(u,θ)Q¯(u,θ)}dMi(u,θ),

where ω = (ωi; i = 1, …, n) is a weight. For more discussions on the weight and its choice refer to Cai and Prentice (1995). One can also follow arguments in Lin and Ying (1995) to derive the information bound I01 for estimating θ0 through defining an appropriate parametric submodel and then construct the efficient estimating function Sopt(θ) using sample-splitting technique such that 1nSopt(θ)θpI0and1nSopt(θ)dN(0,I0). The zero root of such Sopt(θ) is expected to achieve the information bound at n rate. An alternative useful approach to accommodating the dependence of recurrent event times within the same subject is to incorporate a random effect or frailty η into model (1) as indicated in Sect. 4:

E{dN*(t)|W(t),η}=η[g{γ0TZ(t)}dt+h{β0TX(t)}dμ0(t)], (7)

where η follows a specified distribution with some unknown parameters, representing the intra-class correlation. Although there is some literature (e.g., Nielsen et al. 1992; Oakes 1992; Zeng and Lin 2007) on multiplicative intensity-based models with random effect of the form:

E{dN*(t)|t,η}=η exp{θ0TW(t)}λ0(t)dt,

where ℱt is the σ-field generated by {N* (u), W(u+) : 0 ≤ ut} with W(t+) = limst W(s), and λ0(t) is the baseline hazard function, methodology development on model (7) is lacking, although would be valuable, particularly in settings where the association between events within the same subject is of interest in addition to the additive and multiplicative covariate effects.

In the presence of a terminal event, such as death, which precludes further recurrent events, several methods have been proposed for analyzing such recurrent event data (e.g., Cook and Lawless 1997; Ghosh and Lin 2002; Huang and Wang 2004; Liu et al. 2004; Ye et al. 2007). As indicated in Lin et al. (2000) and Schaubel et al. (2006), our proposed model and method could potentially be modified to accommodate such case by substituting DC for C in the above inference procedures, where D is the time to the terminal event, and model the conditional rate function as follows:

E{dN*(t)|W(t),Dt}=g{γ0TZ(t)}dt+h{β0TX(t)}dμ0(t), (8)

and we can show that the basic results in this paper still hold provided that independent censoring is redefined in the sense that

E{dN*(t)|W(t),DCt}=E{dN*(t)|W(t),Dt}.

However, this approach pertains to the cause-specific rate function, which is analogous to the cause-specific hazard function (Kalbfleisch and Prentice 2002, p. 251). If the terminal event is independent of the recurrent event process, then the cause-specific rate function is the same as the marginal rate function for the recurrences. Otherwise, an alternative method is expected to jointly model the recurrent event process and terminal event process since these two processes are usually highly correlated in applications so that it is unrealistic to take them as separate matters and model them separately (e.g., Cook and Lawless 1997). It may be the combination of censoring and death that poses considerable challenges in statistical inference and would be worthwhile to develop methodology for such data.

In some applications, censoring may depend on the underlying recurrent event process even after conditioning on the covariates in the model (1) or (8). Under the informative censoring together with multiplicative rates model without terminal event, Wang et al. (2001) described the correlation structure between the recurrent event process and dependent censoring time through latent variables and provided an estimator for cumulative rate function by treating the distributions of the censoring and latent variables as nuisance parameters; Ghosh and Lin (2003) proposed a joint model that formulates the marginal distributions of the recurrent event process and dependent censoring time through scale-change models; Miloslavsky et al. (2004) proposed to adjust for dependent censoring through the inverse probability of censoring weighting. Generalization of this approach beyond the multiplicative rates model would be useful.

Acknowledgements

The authors thank the editor, the associate editor, and two anonymous referees for their valuable comments and suggestions. This work was partly supported by National Nature Science Fund of China grant 10771163 (Liu) and U.S. NIH grants R01 CA 79949 (Wu and Zhou) and R01 HL 57444 (Cai).

Appendix

To begin, we state two useful lemmas which are the bases for asymptotic approximation.

Lemma 1 Under conditions C1 to C6, the processes 1nU(t,θ0) converges weakly to a zero-mean Gaussian process with covariance Σ(s, t).

Lemma 2 Under conditions C1 to C6,

A^(θn+)a.s.A,  for any estimator θn+a.s.θ0,

where A^(θ)=1nU(τ,θ)θ.

The proofs for Lemmas 1 and 2 follow from Lin et al. (2000) by employing various empirical process results and the detailed proofs omitted here are available from the authors.

Proof of Theorem 1

Consistency

Define ‖a‖ as the max norm for a vector or matrix a. In view of condition C5, let d = 1/(4‖A−1‖) and dn = 1/(4‖Â0)−1‖) whenever Â0) is nonsingular. Select δ sufficiently small such that ‖Â(θ) − Â0)‖ < d whenever ‖θ − θ0‖ < δ, for all n. Since dn almost surely converges to d under Lemma 2, we can conclude that ‖Â(θ) − Â0)‖ < 2dn for large n, where n does not depend on θ, i.e., one can find a commonly large n for all θ under condition C6.

Write Oδ = {θ : ‖θ − θ0‖ < δ} and it follows from the Inverse Function Theorem (Rudin 1964; Foutz 1977) that 1nU(τ,·) is a one-to-one mapping from Oδ onto 1nU(τ,Oδ) and the image set 1nU(τ,Oδ) contains the open neighborhood 1nU(τ,θ0) with radius dnδ. Hence, when n is taken sufficiently large, image set 1nU(τ,Oδ) contains the open neighborhood 1nU(τ,θ0) with radius dδ/2. On the other hand, the convergence of 1nU(τ,θ0) to zero can be derived obviously from Lemma 1. Therefore, θ̂ exists and is unique in Oδ and θ̂ converges to θ0 almost surely since δ can be taken arbitrarily small. In addition, the argument of Jacobsen (1989) can be used to show the global uniqueness of θ̂ for large n.

Asymptotic normality

Clearly, it follows from the Taylor expansion, Lemmas 1 and 2 that

n(θ^θ0)=A11ni=1n0τ{Qi(u,θ0)q¯(u,θ0)}dMi(u)+op(1).

Consequently, n(θ^θ0) behaves asymptotically as a scaled average of independent and identically distributed random vectors, which converges to a normal distribution with mean 0 and covariance A−1Σ(τ, τ)(A−1)T.

Proof of Theorem 2

Consistency

For simplicity, we introduce the following notation:

Sg(t,θ)=1ni=1nYi(t)g{γTZi(t)}Zi(t),Sh(t,θ)=1ni=1nYi(t)h{βTXi(t)}Xi(t),Sh(t,θ)=1ni=1nYi(t)h{βTXi(t)},S~(t,θ)=[0tSg(u,θ)Sh(u,θ)du0tSh(u,θ)Sh(u,θ)dμ^0(u,θ)].

Using the Uniform Strong Law of Large Numbers (Pollard 1990, p. 41), we can obtain that as n → ∞, supt∈[0,τ] supθ∈ΘSg(t, θ) − sg (t, θ)‖ →a.s. 0, so are the convergences of Sh (t, θ) and Sh(t, θ) to sh (t, θ) and sh(t, θ), respectively.

We make a simple decomposition:

{μ^0(t,θ^)μ0(t)}={μ^0(t,θ0)μ0(t)}+{μ^0(t,θ^)μ^0(t,θ0)}

and then some simple calculations entail that

μ^0(t,θ0)μ0(t)=0t1Sh(u,θ0)d1ni=1nMi(u),

which converges almost surely to zero, uniformly in t.

On the other hand, by the Taylor expansion,

{μ^0(t,θ^)μ^0(t,θ0)}={μ^0(t,θ0)θ}T(θ^θ0)+op(θ^θ0),

where

μ^0(t,θ)θ=[0tSg(u,θ)Sh(u,θ)du0tSh(u,θ)Sh(u,θ)dμ0(u)+0tSh(u,θ)[Sh(u,θ)]2d1ni=1nMi(u,θ)].

It can be shown that μ^0(t,θ0)θ converges almost surely to −(t, θ0), uniformly in t. Since θ̂ →a.s. θ0, the convergence of μ̂0(t, θ̂) to μ̂0(t, θ0) is almost sure, uniformly in t. Hence, it is obvious that μ̂0(t, θ̂) converges almost surely to μ0(t), uniformly in t.

Weak convergence

With respect to the weak convergence of μ̂0(t, θ̂), we display the following asymptotic approximation:

n{μ^0(t,θ^)μ0(t)}=0t1Sh(u,θ0)d1ni=1nMi(u)+{μ^0(t,θ0)θ}Tn(θ^θ0)+op(nθ^θ0)=1ni=1nΨi(t)+op(1),

where op(1) is uniformly in t. Thus, the convergence of n{μ^0(t,θ^)μ0(t)} in finite dimensional distributions follows from the multivariate central limit theorem. Tightness can be shown by demonstrating the manageability of the components of Ψi(t). Hence, we prove that n{μ^0(t,θ^)μ0(t)} converges weakly to a zero-mean Gaussian process with covariance function ξ(s, t). Then the required weak convergence of n{μ~0(t,θ^)μ0(t)} follows directly. Furthermore, the consistency of the covariance function estimator follows from the consistency of θ̂ and μ̂0(t, θ̂), and repeated applications of the Uniform SLLN.

Contributor Information

Yanyan Liu, School of Mathematics and Statistics, Wuhan University, Wuhan, Hubei 430072, China.

Yuanshan Wu, School of Mathematics and Statistics, Wuhan University, Wuhan, Hubei 430072, China; Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7420, USA.

Jianwen Cai, Email: cai@bios.unc.edu, Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7420, USA.

Haibo Zhou, Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7420, USA.

References

  1. Aalen OO. A model for nonparametric regression analysis of counting processes. In: Klonecki N, Kosek A, Rosinski J, editors. Lecture notes in statistics. vol 2. New York: Springer; 1980. pp. 1–25. [Google Scholar]
  2. Aalen OO. A linear regression model for the analysis of the life times. Stat Med. 1989;8:907–925. doi: 10.1002/sim.4780080803. [DOI] [PubMed] [Google Scholar]
  3. Andersen PK, Gill RD. Cox’s regression model for counting processes: a large sample study. Ann Stat. 1982;10:1100–1120. [Google Scholar]
  4. Andersen PK, Borgan O, Gill RD, Keiding N. Statistical models based on counting processes. New York: Springer-Verlag; 1993. [Google Scholar]
  5. Bilias Y, Gu M, Ying Z. Towards a general asymptotic theory for Cox model with staggered entry. Ann Stat. 1997;25:662–682. [Google Scholar]
  6. Breslow NE, Day NE. The design and analysis of case-control studies. Lyon: IARC; 1980. Statistical methods in cancer research 1. [PubMed] [Google Scholar]
  7. Breslow NE, Day NE. The design and analysis of cohort studies. Lyon: IARC; 1987. Statistical methods in cancer research 2. [PubMed] [Google Scholar]
  8. Buckley JD. Additive and multiplicative models for relative survival rates. Biometrics. 1984;40:51–62. [PubMed] [Google Scholar]
  9. Cai J, Prentice RL. Estimating equations for hazard ratio parameters based on correlated failure time data. Biometrika. 1995;82:151–164. [Google Scholar]
  10. Cai J, Schaubel DE. Marginal means/rates models for multiple type recurrent event data. Lifetime Data Anal. 2004;10:121–138. doi: 10.1023/b:lida.0000030199.23383.45. [DOI] [PubMed] [Google Scholar]
  11. Chang SH, Wang MC. Conditional regression analysis for recurrence time data. J Am Stat Assoc. 1999;94:1221–1230. [Google Scholar]
  12. Cook RJ, Lawless JF. Marginal analysis of recurrent events and a terminating event. StatMed. 1997;16:911–924. doi: 10.1002/(sici)1097-0258(19970430)16:8<911::aid-sim544>3.0.co;2-i. [DOI] [PubMed] [Google Scholar]
  13. Cox DR, Oakes D. Analysis of survival data. London: Chapman and Hall; 1984. [Google Scholar]
  14. Foutz RV. On the unique consistent solution to the likelihood equations. J Am Stat Assoc. 1977;72:147–148. [Google Scholar]
  15. Ghosh D, Lin DY. Marginal regression methods for recurrent and terminal events. Stat Sin. 2002;12:663–688. [Google Scholar]
  16. Ghosh D, Lin DY. Semiparametric analysis of recurrent events data in the presence of dependent censoring. Biometrics. 2003;59:877–885. doi: 10.1111/j.0006-341x.2003.00102.x. [DOI] [PubMed] [Google Scholar]
  17. Huang CY, Wang MC. Joint modeling and estimation of recurrent event processes and failure time data. J Am Stat Assoc. 2004;99:1153–1165. doi: 10.1198/016214504000001033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Huffer FW, McKeague IW. Weighted least squares estimation for Aalen’s additive risk model. J Am Stat Assoc. 1991;86:114–129. [Google Scholar]
  19. Jacobsen M. Existence and unicity of MLEs in discrete exponential family distributions. Scand J Stat. 1989;16:335–349. [Google Scholar]
  20. Kalbfleisch JD, Prentice RL. The statistical analysis of failure time data. New York: Wiley; 2002. [Google Scholar]
  21. Lawless JF, Nadeau C. Some simple robust methods for the analysis of recurrent events. Technometrics. 1995;37:158–168. [Google Scholar]
  22. Lee EW, Wei LJ, Amato DA. Cox-type regression analysis for large numbers of small groups of correlated failure time observations. In: Klein JP, Goel PK, editors. Survival analysis: state of the art. Dordrecht: Kluwer Academic; 1992. pp. 237–247. [Google Scholar]
  23. Li Q, Lagakos S. Use of the Wei-Lin-Weissfeld method for the analysis of a recurring and a terminating event. Stat Med. 1997;16:925–940. doi: 10.1002/(sici)1097-0258(19970430)16:8<925::aid-sim545>3.0.co;2-2. [DOI] [PubMed] [Google Scholar]
  24. Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73:13–22. [Google Scholar]
  25. Lin DY, Wei LJ, Yang I, Ying Z. Semiparametric regression for the mean and rate functions of recurrent events. J R Stat Soc B. 2000;62:711–730. [Google Scholar]
  26. Lin DY, Wei LJ, Ying Z. Semiparametric transformation models for point processes. J Am Stat Assoc. 2001;96:620–628. [Google Scholar]
  27. Lin DY, Ying Z. Semiparametric analysis of the additive risk model. Biometrika. 1994;81:61–71. [Google Scholar]
  28. Lin DY, Ying Z. Semiparametric analysis of general additive-multiplicative hazard models for counting processes. Ann Stat. 1995;23:1712–1734. [Google Scholar]
  29. Liu L, Wolfe RA, Huang X. Shared frailty models for recurrent events and a terminal event. Biometrics. 2004;60:747–756. doi: 10.1111/j.0006-341X.2004.00225.x. [DOI] [PubMed] [Google Scholar]
  30. Miloslavsky M, Keles S, van der Laan MJ, Butler S. Recurrent events analysis in the presence of time-dependent covariates and dependent censoring. J R Stat Soc B. 2004;66:239–257. [Google Scholar]
  31. Nielsen GG, Gill RD, Andersen PK, Sorensen TIA. A counting process approach to maximum likelihood estimation in frailty models. Scand J Stat. 1992;19:25–44. [Google Scholar]
  32. Oakes D. Frailty models for multiple event times. In: Klein JP, Goel PK, editors. Survival analysis: state of the art. Dordrecht: Kluwer Academic; 1992. pp. 371–379. [Google Scholar]
  33. Pepe MS, Cai J. Some graphical displays and marginal regression analyses for recurrent failure times and time-dependent covariates. J Am Stat Assoc. 1993;88:811–820. [Google Scholar]
  34. Pollard D. Empirical processes: theory and applications. Hayward: Institute of Mathematical Statistics; 1990. [Google Scholar]
  35. Prentice RL, Williams BJ, Peterson AV. On the regression analysis of multivariate failure time data. Biometrika. 1981;68:373–379. [Google Scholar]
  36. Ross SM. Simulation. 4th edn. New York: Academic Press; 2006. [Google Scholar]
  37. Rudin W. Principles of mathematical analysis. New York: McGraw-Hill; 1964. [Google Scholar]
  38. Schaubel DE, Zeng D, Cai J. Asemiparametric additive rates model for recurrent event data. Lifetime Data Anal. 2006;12:389–406. doi: 10.1007/s10985-006-9017-x. [DOI] [PubMed] [Google Scholar]
  39. Therneau TM, Grambsch PM. Modeling survival data: extending the Cox model. New York: Springer; 2000. [Google Scholar]
  40. Wang MC, Qin J, Chiang CT. Analyzing recurrent event data with informative censoring. J Am Stat Assoc. 2001;96:1057–1065. doi: 10.1198/016214501753209031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Wei LJ, Lin DY, Weissfeld L. Regression analysis of multivariate incomplete failure time data by modeling marginal distributions. J Am Stat Assoc. 1989;84:1065–1073. [Google Scholar]
  42. Ye Y, Kalbfleisch JD, Schaubel DE. Semiparametric analysis of correlated recurrent and terminal events. Biometrics. 2007;63:78–87. doi: 10.1111/j.1541-0420.2006.00677.x. [DOI] [PubMed] [Google Scholar]
  43. Zeng D, Lin DY. Semiparametric transformation models with random effects for recurrent events. J Am Stat Assoc. 2007;102:167–180. [Google Scholar]

RESOURCES