Abstract
This paper deals with the analysis of recurrent event data subject to censored observation. Using a suitable adaptation of generalized estimating equations for longitudinal data, we propose a straightforward methodology for estimating the parameters indexing the conditional means and variances of the process interevent (i.e. gap) times. The proposed methodology permits the use of both time-fixed and time-varying covariates, as well as transformations of the gap times, creating a flexible and useful class of methods for analyzing gap-time data. Censoring is dealt with by imposing a parametric assumption on the censored gap times, and extensive simulation results demonstrate the relative robustness of parameter estimates even when this parametric assumption is incorrect. A suitable large-sample theory is developed. Finally, we use our methods to analyze data from a randomized trial of asthma prevention in young children.
Keywords: Asthma, Censoring, Generalized estimating equation, Intensity model, Longitudinal data, Marginal model
1. INTRODUCTION
In longitudinal studies, each subject may experience several consecutive events of the same basic type. Such “recurrent event” outcome data now constitute a heavily studied area in statistics, with applications ranging from economics to engineering to biomedicine. Examples common in medicine and public health applications include recurrent infections and other diseases, hospitalizations, and seizures. The development of useful regression models for recurrent outcome data is therefore a problem of significant practical and methodological interest. Beginning with the extension of the proportional hazards regression model of Cox (1972) to the case of multivariate counting processes by Andersen and Gill (1982), a significant literature on this topic has developed over the past 25 years. In general, methods for analyzing recurrent event data can be cross-classified into one of the 4 categories determined by the following: (i) the choice of “calendar” versus “gap” times as the fundamental temporal scale and (ii) the use of “marginal” versus “intensity” models for analyzing the data. Several classes of marginal and intensity models have been proposed for analyzing recurrent event outcomes on each timescale, and an extensive, contemporary review of existing methods is available in Cook and Lawless (2007).
When the events are all considered to be of the same type, the gap timescale is arguably the most natural and informative timescale for analysis. Examples of marginal models developed for the gap-time setting include Chang and Wang (1999), Chang (2004), Chen and Wang (2004), Huang (2002), Huang and Chen (2004), Lin and others (1999), and Prentice and others (1981). Gap-time-focused intensity models are considered in Aalen and Husebye (1991), Oakes and Cui (1994), Peña and others (2001), Duchateau and others (2003), and Strawderman (2005), Strawderman (2006), among others. In an interesting paper, Murphy and others (1995) propose a methodology that is difficult to wholly classify into a single category. Specifically, Murphy and others (1995) introduce a variant of the estimating equations considered in Murphy and Li (1995) in order to develop a model appropriate for describing the conditional mean length of women's menstrual cycles. In this model, the mean of the current cycle is allowed to depend on aspects of past cycle behavior and/or other time-fixed and time-dependent covariates. The methodology developed in Murphy and others (1995) forms a starting point for developing a large and useful class of methods for conducting gap-time analyses that relaxes the stringent restrictions imposed by simpler marginal models while avoiding the need to fully specify how the probability of subsequent recurrence depends on the prior event and covariate histories.
This paper expands upon the work of Murphy and others (1995) in several useful ways, including making allowances for transformations of gap times, providing clarification of the censoring conditions under which the proposed estimating equations are unbiased, correcting 2 errors in the original paper, further illuminating the connections to generalized estimating equations (GEEs), and the provision of an appropriate large-sample theory. The paper will proceed as follows. The proposed methodology is first developed in Sections 2.1–2.3, and Sections 2.4 and 2.5 then, respectively, establish the connections to the methodology originally proposed in Murphy and others (1995) and GEE. Section 3 contains an expanded version of the simulation study summarized in Tables II and III of Murphy and others (1995). In Section 4, the proposed methodology is used to reanalyze the asthma prevention trial data of Duchateau and others (2003). We close the paper in Section 5 with a discussion of several interesting directions for further study. Technical details are provided in an appendix; the supplementary material available at Biostatistics online contains further results and details that are referenced but not included in the main article.
2. METHODOLOGY
2.1. Notation and model
For simplicity, we introduce the notation for just one subject; the inclusion of an additional subscript permits immediate extension to the case of multiple subjects, as will be required in Section 2.3. We assume that the time origin for analysis is , with subsequent events occurring at times until observation terminates at an observed time C > 0. It is assumed that S1 represents a complete observation time, thereby covering those cases in which observation begins with the occurrence of an event. In settings where observation starts in between 2 events, S0 is taken to represent the time of the first event subsequent to the start of observation. Despite a mild loss of information, such a convention does not cause bias provided that the decision to delete this initial time period is made independently of its length (e.g Aalen and Husebye 1991; Murphy and others, 1995).
Let count the number of events up to and including time u, and let . The observed data on this subject are assumed to take the form
| (2.1) |
where
j denotes the covariate information available at time for and may include baseline covariates, covariates measured at or before each event time, and summaries of the past event and covariate history. It is not assumed that
j necessarily captures the full covariate or event history up to time . Let denote the jth gap time and define where is a specified monotone nondecreasing transformation. Also, let
denote the cumulative information concerning the event and covariate histories assumed available through time ; note that for .
The fundamental modeling assumption of this paper is
| (2.2) |
where and are known scalar functions of the parameter vector θ and . Importantly, these means and variances are defined conditionally upon and are therefore independent of censoring information. Further assumptions are needed in order to properly deal with the presence of censoring; see, for example, Sections 2.2 and 2.3 as well as conditions (A0) and (A1) of the Appendix. Three examples of interesting model choices include the following:
| (2.3) |
| (2.4) |
| (2.5) |
Murphy and others (1995) consider the model (2.3), allowing for the possibility of a general variance function . Models (2.4) and (2.5) provide useful generalizations of the accelerated gap times (AGT) model proposed in Strawderman (2005). The AGT model assumes that the gap times of the recurrent event process satisfy , where are independent and identically distributed with a distribution independent of θ and μ(θ) accelerates or decelerates the baseline gap times for a subject based on the values of time-independent covariates. Assuming that , model (2.4) is observed to be a direct generalization of this model upon taking , and for . Taking logs, the alternative model is obtained, and (2.5) evidently covers this form of the AGT model with , , and for .
2.2. Estimation with “full” and observed data: single subject
For convenience, define for j ≥ 1 the following notation:
| (2.6) |
For simplicity, we make no distinction between and the data-generating parameter η0 throughout this section. Under (2.2), it follows that and for each j ≥ 1. A naïve approach to the estimation of η might therefore begin with consideration of the estimating equations
![]() |
where bj(η) is a scalar weight function satisfying for each . However, these estimating equations generally fail to be unbiased because each utilizes only the complete gap times , each of which satisfies
Let
denote the observed data (2.1), augmented with the additional information on the first event time SN + 1 following time C. Since SN + 1 is not generally observable, one may view
as a suitable representation of “full data” in this setting. Consider the pair of
-dependent estimating equations (cf. Murphy and Li, 1995)
![]() |
(2.7) |
Theorem 2.1, proved in Section S.1.1 of the supplementary material available at Biostatistics online, shows that the
-dependent estimating equations (2.7) are unbiased under condition (A0) of the Appendix.
THEOREM 2.1
Under condition (A0), the estimating equations (2.7) are unbiased.
While unbiased, the estimating equations (2.7) cannot be used directly for estimating η from the observed data (2.1) because each depends on information not available under (2.1). Similarly to Murphy and others (1995), one starting point for developing practically useful estimating equations is to project (2.7) onto the observed data:
![]() |
(2.8) |
and
![]() |
(2.9) |
Using iterated expectation, an easy calculation shows that the
-dependent estimating equations (2.8) and (2.9) remain unbiased under the conditions of Theorem 2.1.
Defining and for , the expectations appearing on the right-hand side of (2.8) and (2.9) can be rewritten as follows:
![]() |
where is considered fixed in each expression.
The dependence of on implies that represents missing data under (2.1). The projections (2.8) and (2.9) therefore correspond to using an obvious form of conditional imputation and further modeling assumptions that permit computation of these conditional expectations are needed. Under condition (A0), and hence may be considered missing at random (MAR; e.g. Little and Rubin, 2002), and immediate progress is possible under a parametric specification for the conditional distribution . Noting that is a sequence of dependent, standardized (i.e. mean 0, variance 1) random variables, we propose to proceed under the simplifying assumption that is distributed according to a fully specified parametric distribution having mean 0 and variance 1. This immediately implies that
| (2.10) |
where
| (2.11) |
Under (2.10) and (2.11), (2.8) and (2.9) reduce to
![]() |
(2.12) |
and
![]() |
(2.13) |
For example, with
whereas the choice for leads to and Each of (2.12) and (2.13) has mean 0 provided that (2.10) holds and is correctly specified.
The moment assumptions on are natural in view of the mean–variance specification of the model. In addition, this parsimonious model facilitates straightforward implementation and justification of inference procedures, as will be seen in Section 2.3. We emphasize here that the parametric specification of is only introduced for the purposes of dealing with the censored time . We have not assumed that each member of the sequence has distribution ; in addition, we have not introduced any assumptions that impose a fully parametric dependence structure on . The easiest way to see this is to note that such distributional assumptions are neither needed nor used in the development of (2.7); see also Murphy and Li (1995) for related results and discussion.
REMARK:
One may write where W is a random variable with distribution function and is the corresponding mean residual life function. An immediate consequence of this relationship is that one cannot specify a valid parametric model for without also specifying a valid parametric model for e.g. (Cox, 1962; Oakes and Dasu, 1990; kotz and Shanbhag, 1980).
2.3. Estimation and inference with observed data: n subjects
Suppose there are data available on n independent subjects, say
i, i = 1, …, n, where
is the data (2.1) on the ith subject. Then, (2.12) and (2.13) immediately generalize, yielding the pair of estimating equations
![]() |
(2.14) |
and
![]() |
(2.15) |
where
Assuming that (2.10) holds and has been correctly specified, (2.14) and (2.15) together form a collection of unbiased estimating equations for η. In order to solve these equations and expect to obtain a unique solution
for finite n, smoothness assumptions on are required. Conditions (A3)–(A5) of the Appendix impose sufficient smoothness restrictions on ; see the Appendix and Section S.1.4 of the supplementary material available at Biostatistics online for further details and discussion.
Let then, we may write
| (2.16) |
where ψ(η,
i) = (ψ1(η,
i)T, ψ2(η,
i))T is a vector of known functions of η and
i, i = 1, …,n. Define S(η) = Eη0[ψ(η,
1)] and
Theorems 2.2 and 2.3 show that
is both consistent and asymptotically normal as ; see the Appendix for the statement of regularity conditions and further details on proof.
THEOREM 2.2
Under conditions (A0)–(A4) and as there exists a sequence
and a unique such that
with probability going to 1, and
THEOREM 2.3
Under conditions (A0)–(A7), the sequence
is asymptotically normal with mean 0 and covariance matrix
(2.17) It further follows that one can consistently estimate
via
(2.18)
REMARK:
The asymptotic results for
rely on the assumption that (2.10) holds with correctly specified. As pointed out earlier in Section 2.2, the parametric imputation assumption (2.10) has been introduced in order to deal with the censoring of . Other models for imputation, as well as methods for handling the missing data problem, are possible. However, under a MAR specification, all such methods rely on further modeling assumptions and, similarly to the problem of misspecifying , incorrect specifications create bias. We refer the reader to Section 5 for further discussion.
2.4. Connections to Murphy and others
Similarly to Section 2.3, Murphy and others (1995) focus on estimating η from the observed data
i, i = 1, …, n, when suggesting an adaptation of the expectation–maximization algorithm of Dempster and others (1977) for use with estimating equations, an idea recently explored in greater generality by Elashoff and Ryan (2004). Specifically, for each and given current estimates
and
of θ and σ, Murphy and others (1995) suggest imputing
| (2.19) |
where , are independent and identically distributed mean 0, variance 1 random variables. These B variables are then used to compute both
and 
![]() |
The estimates
and
are then updated according to the following procedure. Expressed in our notation,
is first computed by solving
![]() |
As indicated in the Appendix of Murphy and others (1995), one then computes
![]() |
(2.20) |
This iteration continues until the relative change in each estimated model parameter is small.
We now illuminate the connections to the methodology summarized in Section 2.3, as well as an important problem with the methodology described above. The “E-type” step described above involves computing Monte Carlo approximations to the conditional cumulants and evaluated at the current parameter values
and
Under the imputation assumption (2.19), it is apparent that is a Monte Carlo approximation to , , where is defined in (2.11) and . The use of (2.14) in place of the Monte Carlo approximation used by Murphy and others (1995) reduces computational demands and leads to a stable estimation procedure independent of Monte Carlo error. The use of (2.15) corrects a fundamental error in Murphy and others (1995). Specifically, as shown in the supplementary material available at Biostatistics online, the estimator (2.20) is biased. Moreover, the degree of bias increases with fewer complete observations per subject because the censored cycles contribute an increased proportion of the information to the estimating equation.
REMARK:
A less subtle error corrected by this paper involves the variance estimate (2.18). The corresponding estimate proposed in Murphy and others (Appendix 1995) assumes independence of the gap times within subjects and is therefore inconsistent when this assumption fails.
2.5. Connections with GEE
Similarly to Section 2.2, define
Then, assuming
i, i = 1, … , n, represent the available data and also that subjects are independent of each other, the results of Theorem 2.1 imply that
![]() |
(2.21) |
and
![]() |
(2.22) |
form a system of unbiased (full data) estimating equations for η. Under the same assumptions leading to (2.14) and (2.15), it follows that the projections of (2.21) and (2.22) onto the observed data
i, i = 1 … n, reproduce (2.14) and (2.15).
The estimating equations (2.21) and (2.22) represent a particular example of a full data GEE system. Specifically, define for , the matrices
In addition, let denote the identity matrix of dimension of . Then, one may write as , where is a vector with elements . The correspondence between (2.21) and a GEE system is now evident, the use of in further imposing a “working independence” correlation structure on (cf. Molenberghs and Verbeke, 2005, Section 8.2). A similar construction is possible for (2.22); moreover, (2.21) and (2.22) together form a particular example of a GEE2 system that imposes a block diagonal covariance structure on (cf. Molenberghs and Verbeke, 2005, Section 8.5). In the supplementary material available at Biostatistics online, we consider the use of alternative working correlation structures, demonstrating in particular that valid structures must respect the conditional specification of the model in order for (2.21) and (2.22) to remain unbiased.
3. SIMULATIONS
The work of Murphy and others (1995) was motivated by the analysis of menstrual cycle patterns. More specifically, the authors were interested in developing insight into the relationship between cycle length and covariates such as location, body mass index (BMI), and age. Murphy and others (1995) also carried out a small simulation study modeled after these data in order to evaluate the robustness of their methods to assumptions regarding the nature of the censored cycle length. The following conditional mean and variance specifications were, respectively, used in both the analysis and the simulation study: where
![]() |
(3.1) |
and and where
| (3.2) |
The use of (3.1) and (3.2) evidently corresponds to a special case of (2.3). Assuming an equal number of cycles per woman, the proposed mean function corresponds to a simple linear mixed effects model using woman as a random effect. The parameter ρ can thus be interpreted as an intrawoman correlation coefficient, with denoting that past cycle lengths are not useful in predicting current cycle length.
In this section, we consider an expanded version of the simulations considered in Murphy and others (1995). In addition to allowing for the possibility of a misspecified censored cycle length distribution, we consider the impact of both varying and shorter observation periods (i.e. fewer observed events), varying number of subjects, and 2 specifications of . More specifically, cycle data on either 50 or 200 independent women are simulated by generating cycle times according to the model , , , where and are independent, identically distributed observations from either a standard normal density or a shifted exponential density with mean 0 and unit variance. As in Murphy and others (1995), we assume that is specified according to the following trivial modification of (3.1):
![]() |
(3.3) |
where , , and . The single time-varying covariate
, where is assumed to decrease linearly from 22 kg/m2 on day 1 to 20 kg/m2 on day 195, increase linearly back to 21 on day 225, and then remain constant thereafter. As in Murphy and others (1995), we consider the specification (3.2) for in conjunction with ; in addition, we also consider the specification where is given in (3.3) and . These 2 choices of correspond to the model specifications (2.3) and (2.4); our choices of approximately equalize the variances of the average gap times across the 2 settings. Finally, we assume the observation period for subject i is , where and is normally distributed. Four possible settings are considered, with set to either 125 or 225 days and Var set to either 0 or 50, respectively. The average number of events per subject under an observation period with () is approximately 3.9 (7.4). All simulations were run using , a choice that corresponds to the use of generalized least squares.
Tables 1 and 2, respectively, summarize the results for and a comparison of these tables demonstrates the impact of the expected number of events. The remaining simulation results are summarized in Tables 5–10 of the supplementary material available at Biostatistics online. The top panel of Table 1 corresponds to the simulation summarized in Table II of Murphy and others (1995). Each table corresponds to one combination of censoring distribution and sample size and summarizes the results for the 4 possible combinations of true and assumed error distributions for each choice of variance function (i.e. ). In the tables, the absolute relative bias () and empirical standard errors (ESEs) are, respectively, the average and standard deviation computed from 1000 simulated data sets of the indicated sample size. The average estimated asymptotic standard errors (ASE) are obtained by averaging the square root of the diagonal of (2.18) over the 1000 simulated data sets. Relative bias is reported because the magnitude of differs greatly across the top and bottom halves of each table.
Table 1.
Simulation Results for n = 50, Ci ∼ N(225, 0)
| Expected number of events ≐ 7.4 |
True F0 |
|||||||
| Model | Imputed F0 | Parameter | Normal |
Exponential |
||||
| |rBias| | ESE | ASE | |rBias| | ESE | ASE | |||
| μij(θ) = (3.3) | Normal | γ0 | 0.003 | 0.193 | 0.188 | 0.009 | 0.189 | 0.191 |
| Vij(θ) = (3.2) | γ1 | 0.014 | 0.276 | 0.272 | 0.034 | 0.267 | 0.260 | |
| ρ | 0.113 | 0.032 | 0.032 | 0.007 | 0.034 | 0.032 | ||
| σ2 | 0.004 | 0.865 | 0.848 | 0.035 | 1.524 | 1.486 | ||
| Exponential | γ0 | 0.006 | 0.189 | 0.187 | 0.010 | 0.187 | 0.190 | |
| γ1 | 0.006 | 0.285 | 0.272 | 0.013 | 0.269 | 0.261 | ||
| ρ | 0.193 | 0.034 | 0.032 | 0.023 | 0.034 | 0.032 | ||
| σ2 | 0.025 | 0.931 | 0.879 | 0.003 | 1.655 | 1.561 | ||
| μij(θ) = (3.3) | Normal | γ0 | 0.023 | 0.194 | 0.189 | 0.013 | 0.193 | 0.191 |
| Vij(θ) = |(3.3)| | γ1 | 0.035 | 0.278 | 0.274 | 0.006 | 0.263 | 0.264 | |
| ρ | 0.098 | 0.033 | 0.032 | 0.036 | 0.035 | 0.032 | ||
| σ2 | 0.008 | 0.001 | 0.001 | 0.031 | 0.002 | 0.002 | ||
| Exponential | γ0 | 0.007 | 0.198 | 0.189 | 0.007 | 0.196 | 0.189 | |
| γ1 | 0.010 | 0.295 | 0.275 | 0.012 | 0.272 | 0.264 | ||
| ρ | 0.105 | 0.034 | 0.032 | 0.136 | 0.035 | 0.032 | ||
| σ2 | 0.015 | 0.001 | 0.001 | 0.003 | 0.002 | 0.002 | ||
Table 2.
Simulation Results for n = 50, Ci ∼ N(125, 0)
| Expected number of events ≐ 3.9 |
True F0 |
|||||||
| Model | Imputed F0 | Parameter | Normal |
Exponential |
||||
| |rBias| | ESE | ASE | |rBias| | ESE | ASE | |||
| μij(θ) = (3.3) | Normal | γ0 | 0.075 | 0.469 | 0.463 | 0.012 | 0.455 | 0.458 |
| Vij(θ) = (3.2) | γ1 | 0.152 | 0.713 | 0.701 | 0.011 | 0.664 | 0.687 | |
| ρ | 0.069 | 0.058 | 0.057 | 0.066 | 0.062 | 0.061 | ||
| σ2 | 0.004 | 1.206 | 1.236 | 0.037 | 2.173 | 1.985 | ||
| Exponential | γ0 | 0.025 | 0.489 | 0.475 | 0.017 | 0.489 | 0.474 | |
| γ1 | 0.078 | 0.736 | 0.706 | 0.005 | 0.708 | 0.710 | ||
| ρ | 0.104 | 0.060 | 0.058 | 0.024 | 0.062 | 0.061 | ||
| σ2 | 0.015 | 1.331 | 1.274 | 0.006 | 2.247 | 2.136 | ||
| μij(θ) = (3.3) | Normal | γ0 | 0.044 | 0.475 | 0.466 | 0.032 | 0.471 | 0.460 |
| Vij(θ) = |(3.3)| | γ1 | 0.094 | 0.727 | 0.702 | 0.067 | 0.702 | 0.688 | |
| ρ | 0.092 | 0.064 | 0.059 | 0.182 | 0.067 | 0.061 | ||
| σ2 | 0.014 | 0.001 | 0.001 | 0.047 | 0.002 | 0.002 | ||
| Exponential | γ0 | 0.028 | 0.472 | 0.470 | 0.031 | 0.463 | 0.474 | |
| γ1 | 0.060 | 0.720 | 0.698 | 0.046 | 0.698 | 0.701 | ||
| ρ | 0.172 | 0.064 | 0.058 | 0.047 | 0.068 | 0.063 | ||
| σ2 | 0.008 | 0.001 | 0.001 | 0.015 | 0.003 | 0.002 | ||
In general, the simulation results demonstrate that the model specifications (2.3) and (2.4) produce estimates of θ and with comparable relative biases. The standard errors for
are also comparable across models and ASE provides an acceptable approximation to ESE in all cases. While a strong effect of misspecifying is absent, the impact of doing so on ρ and does become more apparent when the sample size is increased to . In addition, comparing the results for Tables 7 and 9 of the supplementary material available at Biostatistics online, one can additionally see that biases increase under both incorrect and correct model misspecifications when the average number of expected events decreases. In general, though, the relative biases remain modest in most cases and the signs of all estimated parameters are also correct (results not shown).
Other interesting patterns also arise in these tables. For example, the bias of ρ is typically the largest, especially so when the true is normally distributed. In addition, the sign of the bias is frequently negative (results not shown), indicating that ρ is often underestimated. However, as expected, this bias drops dramatically with an increase in sample size. Comparing results for versus , we further observe that a decrease in the number of complete times generally leads to substantially increased standard errors, as might be expected. Interestingly, with a mean censoring time of 125, the use of a nonconstant censoring time often leads to lower standard errors in comparison with a fixed censoring time; however, with a mean censoring time of 225, the situation is reversed. The exact reasons for this change in behavior are unclear, though may have something to do with the fact that the BMI variable changes from a decreasing function at 195 days, a time that lies in between these 2 censoring times.
Finally, we remark that Table 1 demonstrates a substantial increase in the standard error of
in comparison with Table II in Murphy and others (1995). We repeated the simulation corresponding to Table 1, replacing the analytical computation (2.11) with the Monte Carlo approximation described in Section 2.4 using . The empirical and estimated asymptotic standard errors were very similar to those in Table 1; hence, such a large discrepancy in the estimated standard error of
almost certainly reflects the use of an incorrect standard error formula by Murphy and others (1995), as discussed earlier in Section 2.4.
4. DATA ANALYSIS: RECURRENT ASTHMA IN CHILDREN
In this section, we use our methodology to reanalyze patterns of recurrent asthma events occurring in young children. Briefly, 232 children aged 6 months at high risk of experiencing an asthmatic event, but who have not yet done so, were randomized to receive either drug or placebo and then followed for up to 18 months. The data are available from the data archive at http://blackwellpublishing.com/rss. Duchateau and others (2003, Table 1) analyze these data using various frailty models and timescales for recurrent event counting processes and find a statistically significant treatment effect. Such a difference seems apparent in Figure 2 (supplementary material available at Biostatistics online), which, respectively, summarizes the number of asthmatic events experienced in the drug and placebo groups over the course of the trial.
The gap times of particular interest in this study are the “asthma-free” periods, namely, (i) the time elapsed between randomization and the beginning of the first asthmatic episode and (ii) the time elapsed between the end of one asthmatic episode and the beginning of the next attack. A complicating factor in this analysis is the fact that a child is not considered to be at risk for another asthmatic event until the current episode ends, with such periods possibly lasting several days. However, with a median length of 4 days and 95% of the asthmatic events having durations less than 20 days, such times are generally short in comparison with the asthma-free periods (median of 39 days, with 95% of the times less than 430 days). For the purposes of this analysis, we therefore focus on the asthma-free gap times, accounting for the potential impact of asthmatic episodes through covariate adjustment. A secondary analysis, conducted by redefining the gap times of interest as the time elapsed between the start of each asthma-free period, led to the same qualitative and similar quantitative conclusions (results not shown).
In addition to evaluating the treatment effect using various frailty models, Duchateau and others (2003) comment that there is interest in “the evolution of the asthma recurrent event rate over time,” “how the appearance of an event influences the event rate,” and “how the asthma event rate changes with age.” As suggested in Aalen and others (2004), heterogeneity (i.e. frailty) can sometimes be accounted for using covariate information that changes over the course of observation. Thus, in the context of modeling gap-time data as proposed here, one might investigate how the average length of the current asthma-free episode depends on treatment, the occurrence and length of prior asthmatic and asthma-free episodes, and child age. Due to a significant right skew in the complete gap times, we model the conditional mean of (i.e. ), as in (2.5). Two models, described later, are fit assuming depends on some function of the covariates
, where ,
(i.e. “0” for the first event, 1 otherwise),
is the length of the most recent asthmatic episode (
is the length of the most recent asthma-free episode (
), and
is the age of the child (in days) at the beginning of the jth asthma-free period. To account for the possibility that the s might be heavy tailed, the cumulative distribution function is chosen as a standardized t with 3 degrees of freedom. The use of results in no qualitative and minimal quantitative changes (results not shown). The results, reported in Tables 3 and 4, use in general, selecting leads to nearly identical answers for the regression coefficients θ but rather different estimates of . The reported standard errors (in brackets) are based on (2.18).
Table 3.
Estimated regression coefficients and standard errors, Models 1 and 2
| θ0 | θ1 | θ2 | θ3 | θ4 | θ5 | θ6 | θ7 | |
| Model 1a | 4.874 | – 0.758 | – 0.012 | 0.846 | – 0.204 | – 0.846 | ||
| (0.163) | (0.281) | (0.155) | (0.188) | (0.180) | (0.234) | |||
| Model 1b | 4.404 | – 0.771 | 0.312 | 0.748 | – 0.434 | – 0.230 | ||
| (0.194) | (0.382) | (0.139) | (0.287) | (0.171) | (0.144) | |||
| Model 2 | 4.420 | 0.437 | – 0.747 | 0.313 | – 0.449 | 0.766 | – 0.325 | – 0.457 |
| (0.161) | (0.175) | (0.257) | (0.134) | (0.202) | (0.185) | (0.125) | (0.143) |
Table 4.
Estimated regression coefficients and standard errors, Model 3
| θ0 | θ1 | θ2 | θ′3 | θ′4 | θ′5 | θ′6 | |
| Model 3 | 4.376 | 0.529 | – 1.263 | 0.121 | – 0.258 | 0.369 | – 0.778 |
| (0.162) | (0.167) | (0.349) | (0.071) | (0.107) | (0.063) | (0.136) |
Define
and
to, respectively, represent the lengths of the most recent asthmatic and asthma-free episodes as being above and below the sample median values. Also, let
and
denote indicators of a child's age to be 1–1.5 years or greater than 1.5 years. Then, the 2 mean models summarized in Table 3 are described below:
.
In Model 2, the intercept has a useful interpretation as the average of the logarithm of the first gap time for subjects taking placebo. Notably, Model 1 is not fit directly; rather, this model is fit separately by treatment group in order to help illuminate important interactions with treatment, resulting in Models 1a (drug group) and 1b (control group). The intercepts in Models 1a and 1b, respectively, represent the mean log first gap times for subjects taking drug and placebo. Model 2 assumes a parametric form for the interaction between and
and is therefore a more restricted version of Models 1a and 1b.
Respectively, the results for under Model 1 and under Model 2 summarized in Table 3 indicate the presence of a treatment effect on the initial asthma-free episode, the average gap time being larger in the drug group. Through , both models also indicate that the mean gap time decreases after the occurrence of the first asthmatic episode, the magnitude of this effect varying little by treatment status. Qualitatively, these results are consistent with those of the gap-time-intensity models reported in Table 3 of Duchateau and others (2003, Models 2 and 3), where statistically significant differences by treatment and between the effects of the first and subsequent events are reported.
Together, Models 1a and 1b further suggest that the mean gap times for the drug and control groups tend to be considerably closer to each other for
= 1 (i.e. among children whose most recent asthmatic attack is on the longer side) than for
= 0. These patterns are present whether or not the effects corresponding to are included in the model (results not shown). Models 1a and 1b further suggest that a longer previous asthma-free episode tends to result in a longer current asthma-free episode, regardless of treatment. We also observe that the average length of asthma-free episodes tends to decrease with increasing age, with the drug having a relatively protective effect in younger children (i.e. less than 1.5 years old) that may start to wear off with increasing age.
Models 1a and 1b also demonstrate the existence of a modest level of interaction between certain patient history variables and treatment. Specifically, there are noticeable changes in (length of most recent asthmatic episode) and (children older than 1.5 years) and more minor changes in (length of most recent asthma-free episode) and (children aged 1–1.5 years). However, with the exception of , these effects only exhibit changes in magnitude not in direction. Therefore, Model 2 can be expected to provide a parsimonious description of the trends exhibited in Models 1a and 1b, as well as a more direct evaluation of the treatment effect. In fact, under Model 2, we observe that the treatment effect and its interaction with
are both statistically significant ( and , respectively).
Model 3, summarized in Table 4, is specified similarly to Model 2, except that continuous versions of certain covariates are used in order to investigate the impact of discretizing covariates. Let
if and
define
similarly. Also, let
be the log age of a child, centered at its minimum value of 182 days. Then, the mean function for Model 3 is the following:
The results of fitting this model reflect the same trends observed in Model 2.
Residual analysis for GEE models with longitudinal data, particularly in the presence of missing data, is not a well-developed field. In the current setting, one might consider using
that is, the estimated “standardized residuals” derived from the complete gap-time information. Figure 1 provides histograms of these quantities, respectively, obtained under Models 1–3. While such plots cannot be used to validate that the individual mean and variance functions have been well specified, the lack of unusually large standardized residuals is an indication that the model has done a reasonable overall job in describing the observed gap-time data. However, care must be exercised in the interpretation of such plots due to the presence of correlation between residuals and also the fact that
. The last result is a consequence of using residuals derived only from complete gap-time information and likely explains the mild left skew observed in each plot.
Fig. 1.
Standardized complete gap-time residuals for Models 1–3.
5. DISCUSSION
The use of GEE-type methods for analyzing recurrent event counting processes is now common. In contrast, the use of such methods for directly analyzing gap-time data has not been systematically investigated. The methodology proposed here extends and corrects methodology originally proposed in Murphy and others (1995) for the purposes of analyzing menstrual cycle data. The result is a simple yet flexible class of models for analyzing gap-time data. An especially attractive feature is the ability to specify rich models through mean and variance structures, leading to direct interpretability of regression effects on the gap times.
The main limitation of the proposed methodology is the reliance of (2.14) and (2.15) on the parametric model specified by (2.11) and (2.10). This parametric assumption is used for the sole purpose of dealing with censored gap-time information, and our simulation results also demonstrate a degree of robustness to the misspecification of However, for reasons explained earlier, both consistency and asymptotic normality of
do rely on the correct specification of the imputation model (2.10). Consequently, it is worthwhile to consider estimating η using alternative methods for handling the censored gap times.
For example, one might try and estimate the required conditional moments for the censored gap times under (2.10) without imposing a specific parametric model in (2.11). Let be an arbitrarily large fixed integer and define for and the probabilities
| (5.1) |
Assume that for and, in addition, that and . Under mild regularity conditions
![]() |
is a pointwise-consistent estimator of for . This estimator avoids the need to use a parametric specification for in (2.11) at the price of assuming that , for all standardized gap times. This latter assumption is stronger than that made in connection with (2.10), which is only imposed on the last incomplete gap time on each subject. In practice, use of this inverse-probability-of-censoring-weighted (IPCW) estimator also requires estimating and the censoring probabilities in (5.1). In the special case where censoring is completely independent of the event process and covariates, it is possible to consistently estimate the via
, where
is the empirical cumulative distribution function of the censoring times . More generally, a model for the censoring mechanism must be imposed; if this model is misspecified, bias can be expected for reasons analogous to the misspecification of . For these and other reasons (e.g. the need for uniform asymptotic results, proper methods of variance estimation), we have not investigated the utility of this estimator any further.
An alternative use of IPCW estimation is to construct direct analogs of (2.14) and (2.15). Specifically, one might proceed by estimating η using
![]() |
While such an approach obviates the need to impose assumptions (2.11) and (2.10), the avoidance of bias continues to require that one be able to correctly model and consistently estimate the censoring probabilities (5.1). In addition, because the information on censored gap times is no longer utilized directly, the efficiency of such an approach may suffer. This efficiency loss may be offset by including information on the censored gap times via augmented estimating equations (e.g. Rotnitzky and others, 1998; Scharfstein and others, 1999; Bang and Robins, 2005). We hope to explore the utility of this approach for analyzing gap-time data in subsequent work.
Finally, the methods developed in this paper can in principle be extended to multivariate recurrent event processes arising either as a result of having clustered data or due to the presence of multiple recurrent event outcomes on each subject. However, if the dependence structure between processes or especially between censored multivariate gap times must be modeled, the robustness of the present approach is likely to suffer and it may be advantageous to use a proper extension of the IPCW-type estimation scheme described above.
FUNDING
National Institutes of Health (R01 GM056182) to D.Y.C. and R.L.S.
SUPPLEMENTARY MATERIAL
Supplementary Material is available at http://www.biostatistics.oxfordjournals.org.
Acknowledgments
Conflict of interest: None declared.
Appendix
Regularity conditions sufficient for Theorems 2.1–2.3 to hold are summarized below:
(A0) The parameter lies in some compact subset , and the data-generating parameter is assumed to lie interior to The known transformation is monotone nondecreasing and bounded for . Subjects are independent and identically distributed. Noninformative censoring holds in the sense that for we have and for .
(A1) Assumption (2.10) holds, with correctly specified. In addition, , , and are each bounded and twice continuously differentiable for , .
(A2) is continuous for , and converges uniformly in probability to S(η) := Eη0[ψ(η,
1)] in some open neighborhood containing , where .(A3)
exists and is continuous for , and converges uniformly in probability to
in some open neighborhood containing .(A4) is nonsingular.
(A5) ψ(η,
1) satisfies the Lipschitz condition
, where and both lie in a neighborhood containing and
is a measurable, scalar-valued function with
.(A6)
.(A7)
.
Conditions (A0) and (A1) impose assumptions specific to the problem at hand, and conditions (A2)–(A7) are more general, consisting of a combination of regularity conditions taken from Yuan and Jennrich (1998) and van der Vaart (Section 5.3 1998) adapted to the current problem. Yuan and Jennrich (1998), extending results originally due to Foutz (1977), use (A2)–(A4) and the inverse function theorem to prove the consistency of a sequence of solutions obtained via an unbiased estimating equation. van der Vaart (Section 5.3 1998) uses (A4)–(A7) and the assumption that a consistent estimator exists in order to prove asymptotic normality under an i.i.d. sampling assumption. In particular, under conditions (A0)–(A7), the proofs of Theorems 2.2 and 2.3 are, respectively, direct consequences of Theorem 3 of Yuan and Jennrich (1998) and Theorem 5.21 of van der Vaart (1998). The remaining details are therefore omitted.
Condition (A1) says very little about the nature of , appearing in (2.14) and (2.15), and the requisite assumptions are embedded in (A2)–(A7). For example, the derivatives appearing in (A3)–(A5) involve the functions
. Using integration by parts and assuming that is continuously differentiable, we see that
| (A.1) |
for , where is the hazard function corresponding to . Thus, conditions (A3)–(A5) impose implicit smoothness assumptions on and . These sufficient conditions can be refined in a way that makes the required smoothness assumptions more transparent, and further details may be found in Section S.1.4 of the supplementary material available at Biostatistics online.
References
- Aalen O, Husebye E. Statistical analysis of repeated events forming renewal processes. Statistics in Medicine. 1991;10:1227–1240. doi: 10.1002/sim.4780100806. [DOI] [PubMed] [Google Scholar]
- Aalen OO, Fosen J, Weedon-Fekjær H, Borgan Ø, Husebye E. Dynamic analysis of multivariate failure time data. Biometrics. 2004;60:764–773. doi: 10.1111/j.0006-341X.2004.00227.x. [DOI] [PubMed] [Google Scholar]
- Andersen PK, Gill RD. Cox's regression model for counting processes: a large sample study. Annals of Statistics. 1982;10:1100–1120. [Google Scholar]
- Bang H, Robins JM. Doubly robust estimation in missing data and causal inference models. Biometrics. 2005;61:962–972. doi: 10.1111/j.1541-0420.2005.00377.x. [DOI] [PubMed] [Google Scholar]
- Chang S. Estimating marginal effects in accelerated failure time models for serial sojourn times among repeated events. Lifetime Data Analysis. 2004;10:175–190. doi: 10.1023/b:lida.0000030202.20842.c9. [DOI] [PubMed] [Google Scholar]
- Chang SH, Wang MC. Conditional regression analysis for recurrence time data. Journal of the American Statistical Association. 1999;94:1221–1230. [Google Scholar]
- Chen Y, Wang M. Semiparametric regression analysis on longitudinal pattern of recurrent gap times. Biostatistics. 2004;5:277–290. doi: 10.1093/biostatistics/5.2.277. [DOI] [PubMed] [Google Scholar]
- Cook R, Lawless J. The Statistical Analysis of Recurrent Events. New York: Springer; 2007. [Google Scholar]
- Cox DR. Renewal Theory. London: Methuen; 1962. [Google Scholar]
- Cox DR. Regression models and life-tables (with discussion by F. Downton, Richard Peto, D.J. Bartholomew, D.V. Lindley, P.W. Glassborow, D.E. Barton, Susannah Howard, B. Benjamin, John J. Gart, L.D. Meshalkin, A.R. Kagan, M. Zelen, R.E. Barlow, Jack Kalbfleisch, R.L. Prentice and Norman Breslow, and a reply by D.R. Cox) Journal of the Royal Statistical Society Series B. 1972;34:187–220. [Google Scholar]
- Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm (with discussion) Journal of the Royal Statistical Society Series B. 1977;39:1–38. [Google Scholar]
- Duchateau L, Janssen P, Kezic I, Fortpied C. Evolution of recurrent asthma event rate over time in frailty models. Journal of the Royal Statistical Society Series C. 2003;52:355–363. [Google Scholar]
- Elashoff M, Ryan L. An EM algorithm for estimating equations. Journal of Computational and Graphical Statistics. 2004;13:48–65. [Google Scholar]
- Foutz RV. On the unique consistent solution to the likelihood equations. Journal of the American Statistical Association. 1977;72:147–148. [Google Scholar]
- Huang Y. Censored regression with the multistate accelerated sojourn times model. Journal of the Royal Statistical Society Series B. 2002;64:17–29. [Google Scholar]
- Huang Y, Chen Y. Marginal regression of gaps between recurrent events. Lifetime Data Analysis. 2004;9:293–303. doi: 10.1023/a:1025892922453. [DOI] [PubMed] [Google Scholar]
- Kotz S, Shanbhag DN. Some new approaches to probability distributions. Advances in Applied Probability. 1980;12:903–921. [Google Scholar]
- Lin DY, Sun W, Ying Z. Nonparametric estimation of the gap time distributions for serial events with censored data. Biometrika. 1999;86:59–70. [Google Scholar]
- Little RJA, Rubin DB. Statistical Analysis with Missing Data. 2nd edition. New York: Wiley Series in Probability and Statistics, Wiley-Interscience; 2002. [Google Scholar]
- Molenberghs G, Verbeke G. Models for Discrete Longitudinal Data. New York: Springer; 2005. [Google Scholar]
- Murphy SA, Bentley GR, O'Hanesian MA. An analysis for menstrual data with time-varying covariates. Statistics in Medicine. 1995;14:1843–1857. doi: 10.1002/sim.4780141702. [DOI] [PubMed] [Google Scholar]
- Murphy SA, Li B. Projected partial likelihood and its application to longitudinal data. Biometrika. 1995;82:399–406. [Google Scholar]
- Oakes D, Cui L. On semiparametric inference for modulated renewal processes. Biometrika. 1994;81:83–90. [Google Scholar]
- Oakes D, Dasu T. A note on residual life. Biometrika. 1990;77:409–410. [Google Scholar]
- Peña EA, Strawderman RL, Hollander M. Nonparametric estimation with recurrent event data. Journal of the American Statistical Association. 2001;96:1299–1315. [Google Scholar]
- Prentice RL, Williams BJ, Peterson AV. On the regression analysis of multivariate failure time data. Biometrika. 1981;68:373–379. [Google Scholar]
- Rotnitzky A, Robins JM, Scharfstein DO. Semiparametric regression for repeated outcomes with nonignorable nonresponse. Journal of the American Statistical Association. 1998;93:1321–1339. [Google Scholar]
- Scharfstein DO, Rotnitzky A, Robins JM. Adjusting for nonignorable drop-out using semiparametric nonresponse models (with comments and a rejoinder by the authors) Journal of the American Statistical Association. 1999;94:1096–1146. [Google Scholar]
- Strawderman RL. The accelerated gap times model. Biometrika. 2005;92:647–666. [Google Scholar]
- Strawderman RL. A regression model for dependent gap times. International Journal of Biostatics. 2006:34. 2, Artcle 1(electronic) [Google Scholar]
- van der Vaart A. Asymptotic Statistics. New York: Cambridge University Press; 1998. [Google Scholar]
- Yuan K-H, Jennrich RI. Asymptotics of estimating equations under natural conditions. Journal of Multivariate Analysis. 1998;65:245–260. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


























