Joint modeling of recurrent events and a terminal event adjusted for zero inflation and a matched design

Cong Xu; Vernon M Chinchilli; Ming Wang

doi:10.1002/sim.7682

. Author manuscript; available in PMC: 2020 May 26.

Published in final edited form as: Stat Med. 2018 Apr 22;37(18):2771–2786. doi: 10.1002/sim.7682

Joint modeling of recurrent events and a terminal event adjusted for zero inflation and a matched design

Cong Xu ¹, Vernon M Chinchilli ¹, Ming Wang ¹

PMCID: PMC7249437 NIHMSID: NIHMS1588868 PMID: 29682772

Abstract

In longitudinal studies, matched designs are often employed to control the potential confounding effects in the field of biomedical research and public health. Because of clinical interest, recurrent time-to-event data are captured during the follow-up. Meanwhile, the terminal event of death is always encountered, which should be taken into account for valid inference because of informative censoring. In some scenarios, a certain large portion of subjects may not have any recurrent events during the study period due to nonsusceptibility to events or censoring; thus, the zero-inflated nature of data should be considered in analysis. In this paper, a joint frailty model with recurrent events and death is proposed to adjust for zero inflation and matched designs. We incorporate 2 frailties to measure the dependency between subjects within a matched pair and that among recurrent events within each individual. By sharing the random effects, 2 event processes of recurrent events and death are dependent with each other. The maximum likelihood based approach is applied for parameter estimation, where the Monte Carlo expectation-maximization algorithm is adopted, and the corresponding R program is developed and available for public usage. In addition, alternative estimation methods such as Gaussian quadrature (PROC NLMIXED) and a Bayesian approach (PROC MCMC) are also considered for comparison to show our method's superiority. Extensive simulations are conducted, and a real data application on acute ischemic studies is provided in the end.

Keywords: death, frailty models, joint modeling, Monte Carlo expectation-maximization algorithm, recurrent events, zero inflation

1 ∣. INTRODUCTION

Recurrent events occur frequently during the follow-up period in longitudinal clinical studies. For example, patients may experience relapsed tumors, recurrent strokes, or hypoglycemia events on multiple occasions. In many instances, the terminal event of death is commonly encountered during follow-up, which prevents the observations and even the occurrence of any further recurrent events, but not vice versa.¹ Thus, the common assumption of independent censoring for recurrent events is violated because of the competing risk of death because these 2 event processes are often correlated. For instance, if recurrent events (eg, heart attacks) have a substantially negative effect on health condition, then the hazard for death could be diminished. The association between recurrent events and death has attracted increasing interest recently, and a joint analysis taking their correlation into account is needed for valid inference.²

Regarding the joint analysis of recurrent events and death, 2 general approaches can be adopted to accommodate the dependent censoring, namely, marginal models and frailty models. Marginal models regard the terminating event of death as a censoring event for each recurrent event, or estimate the marginal mean of the cumulative number of recurrent events over time.^3-5 However, the dependence between recurrent events and death cannot be specified by a marginal model.⁶ Moreover, copula models could be considered because of their convenience, but undesirable issues still exist, such as strong parametric assumptions on the association structure, and misspecification of the copula may lead to unreliable estimates.⁷ Frailty models or shared random effects models, not only capture the correlation among recurrent events but also incorporate the dependence between 2 event processes of recurrent events and death. Lancaster and Intrator² initially considered a correlation between recurrent events and death via a person-specific frailty term by proposing a joint parametric modeling of the repeated inpatient episodes (via Poisson process) and survival time in a human immunodeficiency virus study. Huang and Wolfe⁸ proposed a joint frailty model for clustered data with informative censoring, where the risk to be censored was affected by the risk of failure by sharing the common log normal frailty. Liu et al⁶ proposed joint models by incorporating a shared gamma frailty that was included in both intensity functions of recurrent events and death to account for their dependency. Zeng and Lin⁹ generalized joint frailty models of recurrent events and death using a broad class of transformation models, rather than Cox proportional hazards (PH) models.

In this paper, our motivation example explores recurrent acute ischemic stroke. As is known, stroke leads to huge economical burden with the estimated direct and indirect cost as 33 billion dollars from 2011 to 2012, and also stroke is the third leading cause of death in the United States.¹⁰ Literatures have shown that recurrent stroke can be one of the major causes of morbidity and mortality among stroke victims, and provided valuable information about the risk of stroke recurrence in stroke survivors and the associated effects of comorbidities.^11,12 However, most studies adopted the basic Cox PH model to analyze the stroke recurrence without considering the competing risk of death or a matched design because the majority are randomized trials or epidemiological studies. Our motivation data are a matched cohort of patients with and without acute ischemic stroke at baseline from MarketScan research database year 2011 to 2014.¹³ The International Classification of Diseases, Ninth Revision code is used to identify the episode of acute ischemic stroke, and the 1:1 exact matching strategy is performed based on several important confounding variables, such as age, gender, admission date, and the follow-up window. Our objective is to investigate the effect of initial stroke detected at baseline and comorbidity (eg, diabetes mellitus, hypertension, and heart disease) on the risk of recurrent stroke occurrence and death during index hospitalization, and also explore the dependency between recurrent stroke events and death.

Regarding this joint analysis, 2 main issues require attention, namely, zero-inflated recurrent events and the matched design. In our study, recurrent stroke events are observed in 597 out of 2122 patients; thus, a substantial portion of patients (71.8%) have no recurrent stroke events during the follow-up period due to nonsusceptibility to events or censoring; thus, the zero-inflated nature of data should be considered in analysis. Zero-inflated Poisson or negative binomial models have been proposed and popularly used for count data to handle the overdispersion.^14,15 Regarding zero-inflated recurrent events data, the models have been extended from the cure models for single-event data, where the population consists of a mixture of noncured (susceptible to the event) and cured subjects (nonsusceptible to the event). For estimation of the cure rate and survival distribution of the noncured subjects, a logistic regression for mixture proportions has been widely considered.^16-19 Recently, Rondeau et al²⁰ and Liu et al¹⁵ extended the Cox PH cure models to recurrent events by incorporating the frailties. The other issue is the matched design, where the correlation within each matched pair should not be ignored; otherwise, the inference will be invalid. The common strategy is to incorporate random effects (ie, frailties) to capture it.¹⁴ Therefore, a hierarchical or nested structure of correlation will be employed, where the frailties that takes the correlation among recurrent stroke events are nested to the ones that capture matched-pair correlation. In recent years, such kind of clinical studies are commonly encountered indicating the needs of advanced model development, for instance, a child survival study in Northeast Brazil with data being collected according to families and communities,²¹ and the assessment, serial evaluation, and subsequent sequelae of acute kidney injury study in which patients are matched by baseline acute kidney injury status within each stratum according to baseline chronic kidney disease and clinical research center.²² Several studies using nested frailty models to account for the hierarchical clustering of the data have been discussed, but limited work exist for recurrent time-to-event data.^21,23,24

In this article, due to specific issues on zero-inflated nature of data and matched designs, we propose a novel joint frailty modeling for recurrent events and a terminal event of death by accommodating a conditional logistic model for subjects with zero recurrent events and nested frailties accounting for the hierarchical correlation structure. In particular, 2 nested frailties are to measure the dependency (1) between stroke and nonstroke subjects within a matched pair and (2) among recurrent stroke events within one individual. By sharing these frailties, death is dependent on recurrent stroke event process. The remainder of this article is organized as follows. In Section 2, we introduce the joint modeling of recurrent events and death in a matched cohort study. We provide the theoretical work on parameter estimation via the Monte Carlo expectation-maximization (MCEM) algorithm in Section 3. Results from simulation and a real data analysis are presented in Sections 4 and 5, respectively. We summarize the conclusions and a discussion of future work in Section 6.

2 ∣. JOINT MODELING

2.1 ∣. Notations

Let i denote the cluster index (ie, matched pairs based on our motivation example), and j denote the subject index, i = 1, 2, … ,I; j = 1, 2, … J_i. Because of the simplicity and our motivation data structure, we consider balanced cluster size; thus, J_i = J, where J is a constant number (ie, J = 2 based on our motivation example); thus, the total sample size N = I × J; however, the generalization of our approach to accommodate unbalanced cluster size is straightforward. For the ijth subject, let C_ij, M_ij, and D_ij be the follow-up time, drop-out time, and the death time, respectively. Let T_ij = min(D_ij, C_ij, M_ij) be the observed follow-up time with Δ_ij = I(D_ij ≤ min(C_ij, M_ij)) as the death indicator, where I(·) is the indicator function. Denote Ψ_ij(t) = I(T_ij ≥ t) as the “at-risk" indicator to show whether the subject is still under observation at time t or not. X_ij denotes the vector of the observed covariates including baseline risk factors, such as gender, race, and smoking status and so on. Currently, we consider time-independent covariates, but the time-varying covariates can be incorporated in a straightforward manner.

Define $N_{i j}^{D} (t) = I (D_{i j} \leq t, Δ_{i j} = 1)$ as the observed death process, and $N_{i j}^{D *} (t) = I (D_{i j} \leq t)$ as the actual death process by time point t. In addition, denote $N_{i j}^{R *} (t) = N {min (D_{i j}, t)}$ and $N_{i j}^{R} (t) = N {min (T_{i j}, t)} = N_{i j}^{R *} (min (T_{i j}, t))$ as the actual and observed number of recurrent events by time point t separately. $N_{i j}^{D}$ and $N_{i j}^{R}$ are both the observed parts of the counting processes $N_{i j}^{D *}$ and $N_{i j}^{R *}$ . The number of recurrent events that occur for the ijth subject over the small interval [t, t + dt) is $d N_{i j}^{R *} (t) = N_{i j}^{R *} ((t + d t)^{-}) - N_{i j}^{R *} (t^{-})$ as dt → 0 and $d N_{i j}^{R} = Ψ_{i j} (t) d N_{i j}^{R *}$ . $R_{i j} = I (N_{i j}^{R} > 0)$ denotes the observed indicator for whether the ijth subject has at least one recurrent stroke during the follow-up. Let t_ijk be the kth recurrent event time and δ_ijk denote the indicator of recurrent events at time t_ijk, $k = 1, 2, \dots, N_{i j}^{R}$ . Therefore, the data of the ijth individual at time t is then O_ij(t) = {Ψ_ij(u), $N_{i j}^{D} (u)$ , $N_{i j}^{R} (u)$ , 0 < u ≤ t}. Then the entire set of observed data for individual ij is O_ij = {O_ij(u), 0 < u ≤ T_ij}.

2.2 ∣. The joint frailty model of zero-inflated recurrent events and death

As we mentioned previously, there may be a high proportion patients (ie, > 50%) who didn't experience any recurrent events by the end of follow-up period. Let Y_ij denote the indicator that the ijth subject will eventually (Y_ij = 1) or never (Y_ij = 0) experience the recurrent events, with probability p_ij = Pr(Y_ij = 1). Define y_ij as the value taken by the random variable Y_ij. It follows that, if R_ij = 1, y_ij = 1 and if R_ij = 0, y_ij is unobserved, then this individual could fall in either of the 2 groups. A logistic regression model is used to model the probability of susceptible or not cured p_ij:

p_{i j} = \frac{exp (β_{1}^{T} {\tilde{X}}_{ij} + μ_{i})}{1 + exp (β_{1}^{T} {\tilde{X}}_{ij} + μ_{i})},

(1)

where β₁ is the vector of regression coefficients, indicating the effect of potential covariates ${\tilde{X}}_{ij}$ . Note that the intercept term is absorbed into ${\tilde{X}}_{ij}$ , which could be overlapped with X_ij.

Let μ_i denote the frailty measuring the dependency between subjects within a matched pair. $Y_{i j}^{'}$ from individuals in the same matched pair tend to be correlated and that also can be captured by μ_i. Denote ω_ij as the frailty measuring the dependency among the recurrent events within the ijth individual. μ_i and ω_ij are shared by the recurrent and terminal events to induce their dependency. Assume that (μ_i, ω_ij)^T ~ MVN(0, Σ). For simplicity, we can assume that μ_i ⊥ ω_ij, where $μ_{i} \sim N (0, σ_{μ}^{2})$ and $ω_{i j} \sim N (0, σ_{ω}^{2})$ . If $σ_{μ}^{2}$ and $σ_{ω}^{2}$ equal 0, then it implies there is no dependency between recurrent events and death, and the heterogeneity in both event processes is solely explained by covariate X_ij. Conditional on frailties μ_i, ω_ij, and covariates X_ij, the zero-inflated model for recurrent events is defined as follows, which is a special case of latent models with 2 classes:

λ_{i j}^{R} (t ∣ X_{ij}, μ_{i}, ω_{i j}) = {\begin{matrix} λ_{0}^{R} (t) exp (β_{2}^{T} X_{ij} + μ_{i} + ω_{i j}) & with probability p_{i j} (Y_{i j} = 1) \\ 0 & with probability 1 - p_{i j} (Y_{i j} = 0), \end{matrix}

(2)

where β₂ is the vector of regression coefficients and $λ_{0}^{R} (t)$ is the baseline intensity function. Conditional on μ_i and ω_ij, the recurrent event process and death are independent of each other. The association between death and recurrent events is quantified by the shared frailties μ_i and ω_ij through ϕ_μ and ϕ_ω in Equation 3, and a higher intensity of recurrent events is associated with a higher hazard rate for mortality. Thus, the hazard function for death is given by

λ_{i j}^{D} (t ∣ X_{ij}, μ_{i}, ω_{i j}) = λ_{0}^{D} (t) exp (β_{3}^{T} X_{ij} + ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}),

(3)

where β₃ is the vector of regression coefficients and $λ_{0}^{D} (t)$ is the baseline hazard function of death. Of note, X_ij is assumed to be the same for $λ_{i j}^{R}$ and $λ_{i j}^{D}$ for simplicity, but can be different due to clinical perspectives in data applications.

Combining Equations 2 and 3, we have a joint model of recurrent events and death adjusted for zero inflation and a matched design. Given $θ = (λ_{0}^{R} (\cdot)$ , $λ_{0}^{D} (\cdot)$ , β₁, β₂, β₃, ϕ_μ, ϕ_ω, $σ_{ω}^{2}$ , $σ_{μ}^{2})^{T}$ , the marginal likelihood is

L (θ) = \int_{μ_{i}} \int_{ω_{i j}} \prod_{i = 1}^{I} \prod_{j = 1}^{J} L (θ ∣ μ_{i}, ω_{i j}) f (μ_{i}) f (ω_{i j}) d μ_{i} d ω_{i j} = \prod_{i = 1}^{I} \int_{μ_{i}} {\prod_{j = 1}^{I} \int_{ω_{i j}} ((L_{i j}^{0})^{1 - R_{i j}} (L_{i j}^{1})^{R_{i j}} L_{i j}^{D} f (ω_{i j}) d ω_{i j} f (μ_{i})} d μ_{i},

(4)

with

L_{i j}^{0} = 1 - p_{i j} + p_{i j} e x p [- \int_{0}^{\infty} Ψ_{i j} (t) λ_{0}^{R} (t) d t exp (β_{2}^{T} X_{ij} + μ_{i} + ω_{i j})], L_{i j}^{1} = p_{i j} \prod_{k} [λ_{0}^{R} (t) exp (β_{2}^{T} X_{ij} + μ_{i} + ω_{i j})]^{δ_{i j k}} exp [- \int_{0}^{\infty} Ψ_{i j} (t) λ_{0}^{R} (t) d t exp (β_{2}^{T} X_{ij} + μ_{i} + ω_{i j})], L_{i j}^{D} = [(λ_{0}^{D} (T_{i j}) exp (β_{3}^{T} X_{ij} + ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j})]^{Δ_{i j}} exp (- \int_{0}^{\infty} Ψ_{i j} (t) λ_{0}^{D} (t) d t exp (β_{3}^{T} X_{ij} + ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j})),

where $L_{i j}^{0}$ is the likelihood of observing zero recurrent events (R_ij = 0), which includes those participants who are at risk for recurrent events but censored or died before the firs event. $L_{i j}^{1}$ , is the likelihood of observing at least one recurrent event (R_ij = 1). $L_{i j}^{D}$ is the likelihood for the terminal event of death. f(μ_i) and f(ω_ij) are normal densities. In addition, the cumulative baseline intensities of recurrent events and terminal event denoted by $Λ_{0}^{R} (t)$ and $Λ_{0}^{D} (t)$ are given by $Λ_{0}^{R} (t) = \int_{0}^{\infty} Ψ_{i j} (t) λ_{0}^{R} (t) d t$ and $Λ_{0}^{D} (t) = \int_{0}^{\infty} Ψ_{i j} (t) λ_{0}^{D} (t) d t$ .

3 ∣. ESTIMATION

The parameter estimation can be achieved by maximizing L(θ). There are several commonly approaches used for estimation in the literature. The first approach is based on numerical integration techniques, such as Gaussian quadrature, to integrate out the frailties and then maximize the integrated likelihood. This approach can be easily implemented in SAS PROC NLMIXED and has been adopted by many researchers.^15,20,25 Another approach is a Bayesian framework with Markov chain Monte Carlo (MCMC) technique, which has emerged as a popular tool of joint frailty analysis.^26-28 The MCMC algorithm is able to draw inferences from a complex posterior distribution on a high-dimensional parameter space. Another alternative approach, the most popular for estimation in joint random effects models, is the EM algorithm.^6,9,18,29-31 The EM algorithm iteratively computes maximum likelihood estimates from an incomplete data set by treating the frailties as missing data. In some cases, the joint likelihood functions involving multiple integrals do not yield closed-form expressions or analytic solutions for parameter estimation. Therefore, several numerical integration techniques, such as Monte Carlo^6,18,31 or Laplace approximation,^23,32 can be used.

In this paper, we adopt the MCEM algorithm for estimation. In the E-step, we find the expectation of the conditional log likelihood, and integrate out the frailties by numerical integration with the Metropolis-Hasting algorithm. We define $q_{1 i j} = exp (β_{1}^{T} {\tilde{X}}_{ij} + μ_{i})$ , $q_{2 i j} = exp (β_{2}^{T} X_{ij} + μ_{i} + ω_{i j})$ , and $q_{3 i j} = exp (β_{3}^{T} X_{ij} + ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j})$ . In addition, the cumulative baseline intensities of recurrent events and the terminal event, denoted by $Λ_{0}^{R} (t)$ and $Λ_{0}^{D} (t)$ , are given by $Λ_{0}^{R} (t) = \int_{0}^{\infty} Ψ_{i j} (t) λ_{o}^{R} (t) d t$ and $Λ_{0}^{D} (t) = \int_{0}^{\infty} Ψ_{i j} (t) λ_{o}^{D} (t) d t$ . Thus given the values of latent variable y, frailties μ, and ω, the complete log-likelihood function denoted by l(θ) can be split into 4 parts:

l (θ) = \sum_{i = 1}^{I} \sum_{j = 1}^{J} \log {p_{i j}^{y_{i j}} (1 - p_{i j})^{1 - y_{i j}}} + \sum_{i = 1}^{I} \sum_{j = 1}^{J} \log {{(\prod_{k} (λ_{0}^{R} (t_{i j k}) \cdot q_{2 i j})^{δ_{i j k}})}^{R_{i j} y_{i j}} (exp (- Λ_{0}^{R} (t) \cdot q_{2 i j}))^{y_{i j}}} + \sum_{i = 1}^{I} \sum_{j = 1}^{J} \log {(λ_{0}^{D} (T_{i j}) \cdot q_{3 i j})^{Δ_{i j}} exp (- Λ_{0}^{D} (t) \cdot q_{3 i j})} + \sum_{i = 1}^{I} \sum_{j = 1}^{J} \log {\frac{1}{\sqrt{2 π σ_{μ}^{2}}} exp (- \frac{μ_{i}^{2}}{2 σ_{μ}^{2}}) \frac{1}{\sqrt{2 π σ_{ω}^{2}}} exp (- \frac{ω_{i j}^{2}}{2 σ_{ω}^{2}})} = l_{1} (β_{1}) + l_{2} (β_{2}, λ_{0}^{R} (t)) + l_{3} (β_{3}, λ_{0}^{D} (t), ϕ_{μ}, ϕ_{ω}) + l_{4} (σ_{μ}^{2}, σ_{ω}^{2}) .

(5)

The expectation of the log-likelihood l(θ) conditional on the observed data and the current parameter estimate ${\hat{θ}}^{(k)}$ is

Q (θ ∣ {\hat{θ}}^{(k)}) = E [l (θ) ∣ {\hat{θ}}^{(k)}] = E [l_{1} (β_{1}) ∣ {\hat{θ}}^{(k)}] + E [l_{2} (β_{2}, λ_{0}^{R} (t)) ∣ {\hat{θ}}^{(k)}] + E [l_{3} (β_{3}, λ_{0}^{D} (t), ϕ_{μ}, ϕ_{ω}) ∣ {\hat{θ}}^{(k)}] + E [l_{4} (σ_{μ}^{2}, σ_{ω}^{2}) ∣ {\hat{θ}}^{(k)}] = Q_{1} (β_{1} ∣ {\hat{θ}}^{(k)}) + Q_{2} (β_{2}, λ_{0}^{R} (t) ∣ {\hat{θ}}^{(k)}) + Q_{3} (β_{3}, λ_{0}^{D} (t), ϕ_{μ}, ϕ_{ω} ∣ {\hat{θ}}^{(k)}) + Q_{4} (σ_{μ}^{2}, σ_{ω}^{2} ∣ {\hat{θ}}^{(k)}),

(6)

where

Q_{1} (β_{1} ∣ {\hat{θ}}^{(k)}) = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {E [y_{i j} μ_{i} ∣ {\hat{θ}}^{(k)}] + E [y_{i j} ∣ {\hat{θ}}^{(k)}] β_{1}^{T} {\tilde{X}}_{ij} - E [\log (1 + q_{1 i j}) ∣ {\hat{θ}}^{(k)}]}, Q_{2} (β_{2}, λ_{0}^{R} (t) ∣ {\hat{θ}}^{(k)}) = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {R_{i j} (\sum_{m = 1}^{N_{i j}^{R}} (\log (λ_{0}^{R} (t_{i j k})) + β_{2}^{T} X_{ij} + E [μ_{i} ∣ {\hat{θ}}^{(k)}] + E [ω_{i j} ∣ {\hat{θ}}^{(k)}])), - Λ_{0}^{R} (T_{i j}) exp (β_{2}^{T} X_{ij} + \log E [y_{i j} exp (μ_{i} + ω_{i j}) ∣ {\hat{θ}}^{(k)}])}, Q_{3} (β_{3}, λ_{0}^{D} (t), ϕ_{μ}, ϕ_{ω} ∣ {\hat{θ}}^{(k)}) = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {Δ_{i j} (\log (λ_{0}^{D} (T_{i j})) + β_{3}^{T} X_{ij} + ϕ_{μ} E [μ_{i} ∣ {\hat{θ}}^{(k)}] + ϕ_{ω} E [ω_{i j} ∣ {\hat{θ}}^{(k)}]) - Λ_{0}^{D} (T_{i j}) exp (β_{3}^{T} X_{ij}) E [exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)}]}, Q_{4} (σ_{μ}^{2}, σ_{ω}^{2} ∣ {\hat{θ}}^{(k)}) = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- \frac{1}{2} (\log (2 π) + \log σ_{ω}^{2} + \frac{E [ω_{i j}^{2} ∣ {\hat{θ}}^{(k)}]}{σ_{ω}^{2}}) - \frac{1}{2} (\log (2 π) + \log σ_{μ}^{2} + \frac{E [μ_{i}^{2} ∣ {\hat{θ}}^{(k)}]}{σ_{μ}^{2}})} .

More details on the computation of the conditional expectation above and E-step are given in Appendix A.

In the M-step, we use a Newton-Raphson procedure to maximize $Q_{1} (β_{1} ∣ {\hat{θ}}^{(k)})$ , $Q_{2} (β_{2}, λ_{0}^{R} (t) ∣ {\hat{θ}}^{(k)})$ , $Q_{3} (β_{3}, λ_{0}^{D} (t)$ , $ϕ_{μ}, ϕ_{ω} ∣ {\hat{θ}}^{(k)})$ , $Q_{4} (σ_{μ}^{2}, σ_{ω}^{2} ∣ {\hat{θ}}^{(k)})$ to estimate θ. For the estimation of baseline intensity, we suggest the piecewise constant baseline intensity function, which retains majority model structure, because it provides more flexibility over the a priori choice of a baseline intensity distribution (eg, exponential and Weibull). Partition of M intervals with cutpoints 0 ≤ c₀ < c₁ < … < c_M = ∞ on the time duration can be based on the quantiles of event times, where c₀ = 0 or the smallest event time. The baseline intensity function is assumed to be constant within each of the M intervals, so that

{\tilde{λ}}_{0} (t) = \sum_{m = 1}^{M} λ_{m} I (c_{m - 1} < t \leq c_{m}) .

(7)

The cumulative baseline intensity is

{\tilde{Λ}}_{0} (t) = \sum_{m = 1}^{M} λ_{m} \max (0, min (c_{m} - c_{m - 1}, t - c_{m - 1})) .

(8)

The variance of the maximum likelihood estimation (MLE) $\hat{θ} = {\hat{\tilde{λ}}}_{0}^{^{^{R}}} (t)$ , ${\hat{\tilde{λ}}}_{0}^{^{^{D}}} (t)$ , ${\hat{β}}_{1}$ , ${\hat{β}}_{2}$ , ${\hat{β}}_{3}$ , ${\hat{ϕ}}_{μ}$ , ${\hat{ϕ}}_{ω}$ , ${\hat{σ}}_{ω}^{2}$ , ${\hat{σ}}_{μ}^{2})^{T}$ in the MCEM algorithm cannot be obtained directly from the algorithm. Louis's formula was used to obtain it.³³ The variance-covariance matrix can be estimated using the inverse of an observed information matrix. The observed information matrix $I (\hat{θ})$ is given by

I (\hat{θ}) = - E [\frac{\partial^{2} l (θ)}{\partial θ \partial θ^{T}} ∣ \hat{θ}] - E [\frac{\partial l (θ)}{\partial θ} \frac{\partial l (θ)}{\partial θ^{T}} ∣ \hat{θ}] + E [\frac{\partial l (θ)}{\partial θ} ∣ \hat{θ}] + E [\frac{\partial l (θ)}{\partial θ^{T}} ∣ \hat{θ}] .

(9)

The last term becomes zero because of the MLE $\hat{θ}$ , and all of these terms are evaluated at the last iteration of the EM algorithm. The details of applying the Newton-Raphson algorithm are given in Appendix B.

4 ∣. SIMULATION

In this section, we conduct simulation studies to evaluate our proposed joint frailty model. For simplicity, we only consider one covariate for X, which takes values of 0 or 1 each with probability 0.5, and also only the intercept term for the logistic regression model. Considering exponential distributions for both recurrent event times and death, the following models are shown below:

logit P (Y_{i j} = 1 ∣ μ_{i}) = β_{1} + μ_{i}, λ_{i j}^{R} (t ∣ μ_{i}, ω_{i j}, Y_{i j} = 1) = λ_{0}^{R} (t) exp (β_{2} X_{i j} + μ_{i} + ω_{i j}), λ_{i j}^{D} (t ∣ μ_{i}, ω_{i j}) = λ_{0}^{D} (t) exp (β_{3} X_{i j} + ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) .

(10)

The censoring time is taken as C_ij = 6 * Unif(0, 1), where Unif(0, 1) is a random number generated from a uniform distribution in [0, 1]. Note that the number of pairs (ie, clusters) I = 250, and the cluster size J = 2; thus, the sample size is 500 for each scenario. We assume $μ_{i} \sim N (0, σ_{μ}^{2})$ , $ω_{i j} \sim N (0, σ_{ω}^{2})$ and set $σ_{μ}^{2} = 1$ , $σ_{ω}^{2} = 0.25$ . We set the parameters as β₂ = 1.2, β₃ = 1, ϕ_μ = 1, ϕ_ω = 0.5, but invoke different values for β₁ and recurrent events baseline intensity $λ_{0}^{R}$ . There are 2 setups: (1) β₁ = 1, $λ_{0}^{R} = 0.5$ ; 48% subjects are observed with at least one recurrent event; 22% subjects are susceptible but have not yet experienced any events during the follow-up period; (2) β₁ = 0.5, $λ_{0}^{R} = 0.25$ ; 33% subjects are observed with at least one recurrent event; 27% subjects are susceptible but have not yet experienced any events during the follow-up period. In both setups, $λ_{0}^{D} = 0.1$ , and the censoring rate for death is 40%. The major difference between the 2 setups is the percentage of subjects who have at least one recurrent events (48% and 33%), we recognize these 2 percentages are reasonable to consider the zero inflation, and obviously if the percentage is too low, the convergence issue could be challenging. A total of 500 Monte Carlo replicates are generated for results summary. A nonhomogeneous Poisson process is adopted to simulate recurrent event times, and the detailed procedures are provided in Appendix C. Of note, for comparison, we also consider the estimation approaches based on Gaussian quadrature and a Bayesian approach through SAS PROC MCMC to further evaluate our algorithm.

The piecewise constant approach is adopted to estimate the baseline intensity for recurrent events and baseline hazard for death, where 5 intervals are chosen. The proposed MCEM algorithm is implemented in R software, and the functions are available upon request. Regarding the Gaussian quadrature method, we invoke the adaptive Gaussian quadrature option with 5 quadrature points in SAS PROC NLMIXED. The Bayesian method in SAS PROC MCMC is implemented by the random walk Metropolis algorithm to obtain posterior samples with random walk step size 2.38. A Normal (0, 1000) prior is adopted for β₁, β₂, β₃, ϕ_μ, ϕ_ω. An Inverse gamma (0.01, 0.01) prior is adopted for $σ_{μ}^{2}$ and $σ_{ω}^{2}$ . A Gamma (0.01,0.01) prior is adopted for 5 piecewise baseline intensities for both ${\tilde{λ}}_{0}^{R} (t)$ and ${\tilde{λ}}_{0}^{D} (t)$ . The posterior sample size is 50 000. Also, due to the poor mixing and slow convergence, we thin a chain by keeping every 50th simulated draw from each sequence.

The resulting parameter estimates are shown in Tables 1 and 2. It can be seen that in both setups for the full model, MCEM estimation method yield satisfactory results and all parameter estimates have very small empirical biases. Monte Carlo expectation-maximization has better performance than the Gaussian quadrature and the Bayesian in SAS, especially on the estimation of parameter ϕ_ω, which has the smallest bias. Also, Monte Carlo standard error (MCSE) and MSE estimates from the EM algorithm are smaller than those from PROC NLMIXED and PROC MCMC. Comparing the parameter estimates between settings I and II, the empirical biases tend to increase when the percentage of recurrent events decreases from 48% to 33%. We also compute the mean of the standard error estimate for the full model based on our derivation, which are very close to the MCSE. Therefore, here, we only report MCSE in summary results for simulation.

TABLE 1.

Simulation results of setting I: β₁ = 1, $λ_{0}^{R} = 0.5$ , $λ_{0}^{D} = 0.1 : 48 %$ subjects are observed with at least one recurrent event^a

Setting I			Model I			Model II			Full Model
N = 500	Parameters	True	AB	MCSE	MSE	AB	MCSE	MSE	AB	MCSE	MSE
MCEM	β₁	1	0.203	0.140	0.061				0.000	0.195	0.038
	β₂	1.2	0.020	0.088	0.008	0.049	0.096	0.012	0.002	0.090	0.008
	β₃	1	0.014	0.147	0.022	0.013	0.164	0.027	0.020	0.145	0.021
	ϕ_μ	1				0.216	0.109	0.059	0.033	0.129	0.018
	ϕ_ω	0.5	0.538	0.141	0.309	0.372	0.175	0.169	0.002	0.368	0.135
	$σ_{μ}^{2}$	1				0.757	0.278	0.650	0.025	0.156	0.025
	$σ_{ω}^{2}$	0.25	0.926	0.204	0.900	0.360	0.205	0.171	0.001	0.044	0.002
GQ	β₁	1	0.192	0.205	0.079				0.023	0.223	0.050
	β₂	1.2	0.013	0.124	0.016	0.010	0.132	0.018	0.004	0.116	0.013
	β₂	1	0.056	0.180	0.036	0.011	0.163	0.026	0.041	0.169	0.030
	ϕ_μ	1				0.228	0.119	0.066	0.033	0.154	0.025
	ϕ_ω	0.5	0.541	0.157	0.317	0.293	0.196	0.124	0.040	0.469	0.221
	$σ_{μ}^{2}$	1				0.762	0.296	0.668	0.005	0.187	0.035
	$σ_{ω}^{2}$	0.25	0.986	0.241	1.030	0.466	0.184	0.252	0.002	0.093	0.009
MCMC	β₁	1	0.178	0.218	0.079				0.031	0.230	0.054
	β₂	1.2	0.008	0.154	0.024	0.010	0.152	0.024	0.001	0.118	0.014
	β₃	1	0.076	0.204	0.057	0.022	0.194	0.038	0.062	0.177	0.035
	ϕ_μ	1				0.207	0.134	0.061	0.035	0.312	0.099
	ϕ_ω	0.5	0.565	0.175	0.350	0.262	0.296	0.156	0.097	0.705	0.507
	$σ_{μ}^{2}$	1				0.764	0.305	0.677	0.022	0.196	0.039
	$σ_{ω}^{2}$	0.25	1.05	0.238	1.159	0.462	0.205	0.255	0.005	0.108	0.012

Open in a new tab

Abbreviations: AB, absolute value of the difference between Monte Carlo mean of the parameter estimates (bases on 500 replicates) and the true value; GQ, Gaussian quadrature approach; MCEM, Monte Carlo expectation-maximization algorithm; MCMC, Markov chain Monte Carlo method; MCSE, Monte Carlo standard error; MSE, mean square error.

Another 22% subjects are susceptible but have not yet experienced any events during the follow-up period. The censoring rate for death is 40%. Full model denotes the proposed joint model of zero-inflated recurrent events and death event; Model I denotes the model without considering the matched-pair correlation, where matched pair random effect μ is removed; Model II denotes the model without considering zero inflation for recurrent events.

TABLE 2.

Simulation results of setting II: β₁ = 0.5, $λ_{0}^{R} = 0.25$ , $λ_{0}^{D} = 0.1 : 33 %$ subjects are observed with at least one recurrent event^a

Setup II			Model I			Model II			Full Model
N=500	Parameters	True	AB	MCSE	MSE	AB	MCSE	MSE	AB	MCSE	MSE
MCEM	β₁	0.5	0.102	0.128	0.027				0.016	0.196	0.038
	β₂	1.2	0.040	0.116	0.015	0.038	0.143	0.022	0.009	0.137	0.019
	β₃	1	0.042	0.163	0.028	0.026	0.159	0.026	0.012	0.143	0.021
	ϕ_μ	1				0.282	0.111	0.092	0.012	0.133	0.018
	ϕ_ω	0.5	0.643	0.182	0.446	0.328	0.198	0.146	0.021	0.344	0.119
	$σ_{μ}^{2}$	1				0.928	0.355	0.987	0.013	0.185	0.034
	$σ_{ω}^{2}$	0.25	0.935	0.289	0.958	0.300	0.227	0.142	0.002	0.032	0.001
GQ	β₁	0.5	0.080	0.237	0.062				0.032	0.283	0.081
	β₂	1.2	0.014	0.152	0.023	0.002	0.156	0.024	0.012	0.144	0.021
	β₃	1	0.084	0.195	0.045	0.005	0.167	0.028	0.036	0.175	0.032
	ϕ_μ	1				0.258	0.129	0.083	0.058	0.188	0.039
	ϕ_ω	0.5	0.597	0.187	0.391	0.319	0.254	0.166	0.090	0.499	0.257
	$σ_{μ}^{2}$	1				0.920	0.399	1.005	0.002	0.251	0.063
	$σ_{ω}^{2}$	0.25	1.114	0.305	1.334	0.529	0.270	0.363	0.033	0.137	0.020
MCMC	β₁	0.5	0.070	0.292	0.090				0.028	0.304	0.093
	β₂	1.2	0.003	0.177	0.031	0.007	0.181	0.032	0.002	0.148	0.022
	β₃	1	0.114	0.226	0.064	0.025	0.190	0.037	0.060	0.184	0.038
	ϕ_μ	1				0.210	1.281	1.685	0.053	0.748	0.562
	ϕ_ω	0.5	0.652	0.234	0.479	0.305	0.870	0.849	0.098	0.968	0.947
	$σ_{μ}^{2}$	1				0.961	0.453	1.129	0.031	0.275	0.076
	$σ_{ω}^{2}$	0.25	1.169	0.318	1.468	0.506	0.352	0.380	0.004	0.177	0.031

Open in a new tab

Another 27% subjects are susceptible but have not yet experienced any events during the follow-up period. The censoring rate for death is 40%. Full model denotes the proposed joint model of zero-inflated recurrent events and death event. Model I denotes the model without considering the matched-pair correlation, where matched pair random effect μ is removed. Model II denotes the model without considering zero inflation for recurrent events.

For comparisons with misspecified models, if we do not consider the correlation μ in the matched pair (denoted by model I), the estimation of $σ_{ω}^{2}$ and ϕ_ω are overestimated. The correlation that was induced by matching is accounted for by recurrent events. This indicates that the hierarchical structure of the data needs to be taken into account to obtain accurate inference. If we ignore the zero inflation from the full model (denoted by model II), this will yield poor estimates in variance and coefficients of both frailties ω and μ. For example, in setting I, the estimate of $σ_{ω}^{2}$ is about 2 times larger than the true value, suggesting much more heterogeneity in recurrent events if not accounting for zero inflation. Therefore, our model is recommended in practice for valid inference, and insufficient models could lead to large biased estimates. In addition, per reviewers' comments, we also investigate the robustness of our approach when the assumption of μ_i ⊥ ω_ij or the distribution assumption of normality is violated. Additional simulation studies (not provided here because of limited space) are conducted. The results show that the proposed joint modeling remains applicable even if there exist mild or moderate correlation between nested frailties μ_i and ω_ij (ie, ρ = 0.5); also, if the misspecification of frailty distribution does not affect the performance of this joint modeling in particularly with negligible influence on the parameter estimates of the covariates effects on the risk of recurrent event and death.

5 ∣. APPLICATION-STROKE STUDY

We apply the proposed model to a real data application on recurrent acute ischemic stroke. Our real data example is obtained from the MarketScan database between January 2011 and December 2014, including a matched cohort of patients with and without acute ischemic stroke at baseline, who are aged 45 to 54 years with surgical and medical admission for inpatient acute care hospitalization. Thus, I = 1061 matched pairs with cluster size J = 2 are extracted for analysis with the total sample size is N = 2122. Of note, the episodes of acute ischemic stroke are diagnosed by the International Classification of Diseases, Ninth Revision, Clinical Modification codes with 434.x and 436.x.³⁴ A recurrent stroke is defined as any recurrent stroke occurring more than 28 days after the incident stroke.³⁵ Figure 1 shows the sample data from random 6 matched pairs with and without stroke at baseline. Both cohorts are tracked for in-hospitalized mortality and pre-existing comorbidity conditions including diabetes mellitus, hypertension, and heart disease during the past 12 months, which could potentially influence outcomes.

Sample data on recurrent acute ischemic stroke

Based on preliminary check of the data, the number of recurrent stroke ranges from 0 to 43, and 71.8% of the patients are not observed with recurrent strokes. The wide range of the number of recurrent stroke suggests a large variation across subjects; 139 patients (6.55%) died in the follow-up period, and 405 (20%) withdrew from the study. The preliminary analysis potentially shows that patients with baseline stroke have a higher risk of mortality (57.55%). Thus, censoring because of death is very likely to be informative. Joint frailty models of recurrent events in the presence of death can circumvent this problem by modeling recurrent events and death with shared frailties to capture their dependency. Baseline covariates included in the analysis are stroke status: Stroke (1=yes, 2=no) and Comorbidity (1=yes, 2=no). Comorbidity is defined as one if the patient has any one of diabetes mellitus, hypertension, and heart disease during the past 12 months. Our final model is

logit P (Y_{i j} = 1 ∣ μ_{i}) = β_{0} + β_{1} {Stroke}_{i j} + μ_{i}, λ_{i j}^{R} (t ∣ μ_{i}, ω_{i j}, Y = 1) = λ_{0}^{R} (t) exp (β_{2} {Stroke}_{i j} + β_{3} {Comorbidity}_{i j} + μ_{i} + ω_{i j}), λ_{i j}^{D} (t ∣ μ_{i}, ω_{i j}) = λ_{0}^{D} (t) exp (β_{4} {Stroke}_{i j} + β_{5} {Comorbidity}_{i j} + ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) .

(11)

We summarize the results in Table 3. We can see that both the baseline stroke and comorbidity covariates are significant in the proposed model. Patients with stroke at baseline had on odds ratio of 1.493 (= exp(0.401)) to be "susceptible." Baseline stroke and comorbidity also have significant effects on both intensity of recurrent stroke among those "susceptible" and death hazard among all subjects. The intensity ratio of having recurrent stroke for patients with stroke at baseline is 2.942 (= exp(1.079)) compared with those without stroke at baseline. Compared with patients who don't have comorbidity at baseline, the intensity ratio of having recurrent events is 1.104 (= exp(0.099)) for patients who have comorbidity. Finally, the frailty variance estimate of μ is 0.669, suggesting the existence of heterogeneity in the matched stroke and nonstroke pair. The frailty variance estimate for ω is 0.907, which may due to the wide range of the number of recurrent stroke events among patients. The estimates of ϕ_μ (4.001) and ϕ_ω (0.430) are significant greater than 0, which implies that the recurrent strokes events and death rates are positively associated. Thus, higher frailty (ω) will lead to a higher risk of recurrence and a higher risk of death. Also, higher frailty (μ) will result in a higher risk of recurrence, a higher risk of death, and a higher probability to develop a new stroke event.

TABLE 3.

Analysis of recurrent ischemic stroke data^a

	Model I			Model II			Full Model
Covariate	Estimate	SE	P Value	Estimate	SE	P Value	Estimate	SE	P Value
Incidence
Intercept	−0.725	0.218	<.001				−0.300	0.114	.009
Stroke	2.055	0.244	<.001				0.401	0.134	.003
Recurrent stroke
Stroke	1.152	0.242	<.001	1.045	0.038	<.001	1.079	0.024	<.001
Comorbidity	0.284	0.127	.025	0.307	0.119	<.001	0.099	0.096	.303
Death
Stroke	1.812	0.356	<.001	1.315	0.147	<.001	0.703	0.143	<.001
Comorbidity	1.148	0.322	<.001	1.025	0.230	<.001	0.615	0.169	<.001
ϕ_μ				3.041	0.234	.042	4.001	0.263	<.001
ϕ_ω	2.634	0.216	<.001	0.275	0.191	.009	0.430	0.210	.041
$σ_{μ}^{2}$				1.551	0.091	<.001	0.669	0.074	<.001
$σ_{ω}^{2}$	2.923	0.349	<.001	2.502	0.374	<.001	0.907	0.230	<.001
AIC		14630			14966			11923
BIC		14709			15036			12003

Open in a new tab

Abbreviations: AIC, Akaike information criterion; BIC, Bayesian information criterion; SE, standard error of the parameter estimate.

Full model denotes the proposed joint model of zero-inflated recurrent events and death event. Model I denotes the model without considering the matched-pair correlation, where matched pair random effect μ is removed. Model II denotes the model without considering zero inflation for recurrent events.

We also fit 2 reduced joint frailty models for recurrent stroke, ie, without zero inflation and μ. The results from these 2 models are shown in Table 3. We notice that some parameter estimates are quite different from those in the full model, indicating the necessity to adjust for zero inflation and matched correlation, ie, without considering zero inflation ur matched-pair correlation, the effect of comorbidity at baseline on having recurrent events is enlarged (the rate of having recurrent stroke increases from 1.104 to 1.359 (= exp(0.307)) and 1.328 (= exp(0.284)) compared with patients who don't have comorbidity). We also note that the frailty variance estimates are much larger than that in the full model, ie, not accounting for the zero inflation would leads to more heterogeneity for recurrent events in the reduced model (variance estimate of μ increased from 0.669 to 1.551). Ignoring the correlation induced by the matched design result in severely overlooking the importance of the correlation in recurrent events (variance estimate of ω increased from 0.907 to 2.923). Two information criteria for model assessment, the Akaike information criterion (AIC) and the Bayesian information criterion (BIC) are also calculated, and both AIC and BIC statistics indicate that the proposed full model has the best fit with the smallest AIC = 11923 and BIC = 12003.

6 ∣. DISCUSSION

In this article, we proposed a joint frailty model of recurrent events and a terminal event adjusted for zero inflation and the matched design. Compared with the latest research on similar topics, the major advantage of our model is the ability to quantify the dependency between recurrent events and the terminal event that are correlated in a hierarchical structure (eg, matched case-control study and meta-analysis). Extensive simulations have shown the robustness of our proposal with regards to the violation of several assumptions (ie, μ_i ⊥ ω_ij, normality). Of note is that we consider the multivariate normal distribution for frailties because of the following reasons: (1) The normal distribution is one of most popularly used distributions for frailties.^8,9,18,36 (2) The (cluster-level) random effect is also included in the logistic regression for the zero inflation, and in such generalized linear regression setups, normal distribution is usually assumed for random effects.^37-40 (3) Limited work exist for nested frailties with assumed multivariate normal distributions, where the computation for parameter estimation is more challenging because of nonclosed-form expressions or analytic solutions. Our work can fill up this gap by providing strategies for parameter estimation and statistical inference with the details of theoretical derivations. In addition, we have shown by simulation, and a real data application that using a reduced model (ignoring matched-pair correlation) instead of the proposed full model when there are 2 levels of clustering can lead to biased estimates, with an overestimation of the correlation in recurrent events. This calls into question the validity of traditional statistical techniques such as the shared frailty model in studies with matched design. Another major advantage is the consideration of zero inflation of the recurrent event, which results in more accurate parameter estimation. In general, the price of omitting the feature of zero inflation of the events in the data from the models is biased estimates and overlooking the importance of certain cluster effects. Also, in medical research, investigators can evaluate the treatment effect by estimating the fraction of cured subjects, which is a substantially important question.

We also implemented our joint modeling approach in R software by considering an MCEM algorithm with a piecewise baseline intensity for the MLE of the model parameters. The R functions are available upon request, which can be flexibly modified for model extension and other research purposes. Most of the latest research on similar topics adopt the Gaussian quadrature in SAS PROC NLMIXED for estimation. Although SAS PROC NLMIXED is easy to implement for parameter estimation, there exist several limitations in practice. In our situation, we have 2 levels of nested random effects, which leads to dramatically computationally intense and convergence issues with about 3% failed optimization because of the fact that Hessian matrix is not positive definite. The number of failure optimizations may highly increase with a more sophisticated joint frailty model (eg, more levels of random effects). Because of the large volume of parameters for estimation and complexity of the posterior distribution, the adaptive MCMC algorithm in SAS also requires high computation burden because a large number of posterior samples are required. Sometimes, poor mixing or slow convergence issues emerge because the model parameters may be correlated with each other. Also, it is a little bit unstable because it yields relatively large standard errors on some parameter estimates, especially the parameters related with frailties.

There exist some limitations in our data. In the MarketScan research database, it is not possible to obtain the death information when patients are discharged from the hospital. The study cohort was tracked for in-huspitalized mortality only. Therefore, the death rate in our data underrepresents the true rate. The dependency between recurrent acute ischemic stroke and death may be underestimated. For the future work, our model can be extended in several directions. First, other functional forms of the frailties can be incorporated in the proposed joint model, such as gamma frailties, which have been applied by many researchers.^6,23 With only a minor modification, the corresponding likelihood and the estimation algorithm are readily available. Second, time-varying covariates also can be incorporated. Third, we can consider more complex joint model settings. For example, we can jointly model longitudinal biomarkers with the current model.

ACKNOWLEDGEMENTS

The authors thank Dr Lan Kong and Dr W. Brian Reeves for helpful suggestions on statistical models and study designs. We also thank Dr Douglas Leslie, Dr Guodong Liu, and MS Djibril Ba for data preparation and permission to use the data from the MarketScan research database.

The project described was supported in part by research grant U01 DK082183 from the National Institute of Digestive, Diabetes and Kidney Diseases of the National Institutes of Health, US Department of Health and Human Services and by the National Center for Advancing Translational Sciences, grant KL2 TR000126 and TR002015. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Funding information

The National Institute of Digestive, Diabetes and Kidney Diseases (NIDDK), Grant/Award Number: U01 DK082183; The National Center for Advancing Translational Sciences, Grant/Award Number: KL2 TR002015 and UL1 TR002014; The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Appendix

APPENDIX A: E-STEP OF THE MCEM ALGORITHM

A.1 ∣. Conditional expectation

The computation of the conditional expectation Equation 6 is not trivial. Denote the function of frailties as h(γ_ij) and γ_ij = (y_ij, μ_i, ω_ij)^T. Thus, the expectation of complete data likelihood is $E [h (γ_{ij} ∣ {\hat{θ}}^{(k)})]$ . The term ${\hat{θ}}^{(k)}$ denotes all current parameters estimated in the M-step. The conditional density of γ_ij given the observed data and current estimate of the parameters is

f (γ_{ij} ∣ {\hat{θ}}^{(k)}) = \frac{L ({\hat{θ}}^{(k)} ∣ γ_{ij}) f (γ_{ij})}{\int_{- \infty}^{\infty} L ({\hat{θ}}^{(k)} ∣ γ_{ij}) f (γ_{ij}) d γ_{ij}} .

(A1)

Thus, the conditional expectation for any function h of the random effects is

E [h (γ_{ij} ∣ {\hat{θ}}^{(k)})] = \int_{- \infty}^{\infty} h (γ_{ij}) f (γ_{ij} ∣ {\hat{θ}}^{(k)}) d γ_{ij} = \frac{\int_{- \infty}^{\infty} h (γ_{ij}) k (γ_{ij} ∣ {\hat{θ}}^{(k)}) d γ_{ij}}{k (γ_{ij} ∣ {\hat{θ}}^{(k)}) d γ_{ij}},

(A2)

where $k (γ_{ij} ∣ {\hat{θ}}^{(k)})$ is the kernel density after removing the parts that do not depend on γ_ij.

A.2 ∣. E-step

Expectation terms that involve y_ij in Equation 6 in E-step are: $E [y_{i j} ∣ {\hat{θ}}^{(k)}]$ , $E [y_{i j} μ_{i} ∣ {\hat{θ}}^{(k)}]$ , $E [y_{i j} exp (μ_{i} + ω_{i j}) ∣ {\hat{θ}}^{(k)}]$ . $E [y_{i j} μ_{i} ∣ {\hat{θ}}^{(k)}]$ is not involved in the M-step, and thus not considered. The expectation terms are evaluated with respect to p(y, μ, ω):

p (y, μ, ω ∣ {\hat{θ}}^{(k)}) \propto \prod_{i = 1}^{I} \prod_{j = 1}^{J} {[N_{i j}^{R} exp (μ_{i} + ω_{i j})]^{R_{i j}} exp (- y_{i j} Λ_{0}^{R} (T_{i j}) q_{2 i j}^{(k)}) \times \frac{exp (y_{i j} (β_{1}^{T (k)} {\tilde{X}}_{ij} + μ_{i}))}{1 + exp (β_{1}^{T (k)} {\tilde{X}}_{ij} + μ_{i})} (exp (ϕ_{μ}^{(k)} μ_{i} + ϕ_{ω}^{(k)} ω_{i j}))^{Δ_{i j}} exp (- Λ_{0}^{D (k)} (T_{i j}) q_{3 i j}^{(k)}) \times \frac{1}{\sqrt{2 π σ_{μ}^{2 (k)}}} exp (- \frac{μ_{i}^{2}}{2 σ_{μ}^{2 (k)}}) \frac{1}{\sqrt{2 π σ_{ω}^{2 (k)}}} exp (- \frac{ω_{i j}^{2}}{2 σ_{ω}^{2 (k)}})}

(A3)

where, $q_{1 i j}^{(k)} = exp (β_{1}^{T (k)} {\tilde{X}}_{ij} + μ_{i})$ , $q_{2 i j}^{(k)} = exp (β_{2}^{T (k)} X_{ij} + μ_{i} + ω_{i j})$ , $q_{3 i j}^{(k)} = exp (β_{3}^{T (k)} X_{ij} + ϕ_{μ}^{(k)} μ_{i} + ϕ_{ω}^{(k)} ω_{i j})$ , and $p_{i j}^{(k)} = \frac{exp (β_{1}^{T (k)} {\tilde{X}}_{ij} + μ_{i})}{1 + exp (β_{1}^{T (k)} {\tilde{X}}_{ij} + μ_{i})}$ .

We know that if R_ij = 1, then y_ij = 1. If R_ij = 0, then y_ij follows the binomial distribution below:

y_{i j} ∣ μ, ω \sim b i n {1, π = \frac{p_{i j}^{(k)} exp (- Λ_{0}^{R (k)} (T_{i j}) q_{2 i j}^{(k)})}{1 - p_{i j}^{(k)} + p_{i j}^{(k)} exp (- Λ_{0}^{R (k)} (T_{i j}) q_{2 i j}^{(k)})}} .

(A4)

Therefore, conditional on the observed data and the current parameter estimation ${\hat{θ}}^{(k)}$ , we can evaluate the following expectations:

E [y_{i j} ∣ {\hat{θ}}^{(k)}] = {\begin{matrix} E [\frac{p_{i j}^{(k)} exp (- Λ_{0}^{R (k)} (T_{i j}) q_{2 i j}^{(k)})}{1 - p_{i j}^{(k)} + p_{i j}^{(k)} exp (- Λ_{0}^{R (k)} (T_{i j}) q_{2 i j}^{(k)})} ∣ {\hat{θ}}^{(k)}] & if R_{i j} = 0 \\ 1 & otherwise, \end{matrix}

(A5)

E [y_{i j} exp (μ_{i} + ω_{i j}) ∣ {\hat{θ}}^{(k)}] = {\begin{matrix} E [\frac{exp (μ_{i} + ω_{i j}) p_{i j}^{(k)} exp (- Λ_{0}^{R (k)} (T_{i j}) q_{2 i j}^{(k)})}{1 - p_{i j}^{(k)} + p_{i j}^{(k)} exp (- Λ_{0}^{R (k)} (T_{i j}) q_{2 i j}^{(k)})} ∣ {\hat{θ}}^{(k)}] & if R_{i j} = 0 \\ E [exp (μ_{i} + ω_{i j}) ∣ {\hat{θ}}^{(k)}] & otherwise, \end{matrix}

(A6)

where the expectation are taken with respect to the $p (μ, ω ∣ {\hat{θ}}^{(k)})$ .

p (μ, ω ∣ {\hat{θ}}^{(k)}) \propto \prod_{i = 1}^{I} \prod_{j = 1}^{J} {1 - p_{i j}^{(k)} + p_{i j}^{(k)} exp (- \int_{0}^{\infty} Ψ_{i j} (t) λ_{0}^{R (k)} (t) d t q_{2 i j}^{(k)}) p_{i j}^{(k)} \prod_{k} {(q_{2 i j}^{(k)})}^{δ_{i j k}} exp (- \int_{0}^{\infty} Ψ_{i j} (t) λ_{0}^{R (k)} (t) d t q_{2 i j}^{(k)}) \times {(q_{3 i j}^{(k)})}^{Δ_{i j}} exp (- \int_{0}^{\infty} Ψ_{i j} (t) λ_{0}^{D (k)} d t q_{3 i j}^{(k)}) \frac{1}{\sqrt{2 π σ_{μ}^{2 (k)}}} exp (- \frac{μ_{i}^{2}}{2 σ_{μ}^{2 (k)}}) \frac{1}{\sqrt{2 π σ_{ω}^{2 (k)}}} exp (- \frac{ω_{i j}^{2}}{2 σ_{ω}^{2 (k)}})} .

(A7)

Since there is no closed-form expression of $p (μ, ω ∣ {\hat{θ}}^{(k)})$ , Monte Carlo methods in combination with the Metropolis-Hasting algorithm are used to approximate the posterior distributions of $u_{i}^{'} s$ and $ω_{i j}^{'} s$ . Given ${\hat{θ}}^{(k)}$ , N random samples are generated for $μ_{i}^{(m)}$ (m = 1, … M) and $ω_{i j}^{(m)}$ (m = 1, … M) to estimate the expectation of the sufficient statistics involving frailties. Thus, $E [f (μ_{i}) ∣ {\hat{θ}}^{(k)}] = \frac{1}{N} \sum_{n = 1}^{N} f (μ_{i})^{(n)}$ and $E [f (ω_{i j}) ∣ {\hat{θ}}^{(k)}] = \frac{1}{N} \sum_{n = 1}^{N} f (ω_{i j})^{(n)}$ , where f(·) could be any smooth and monotone function.

APPENDIX B : M-STEP OF THE MCEM ALGORITHM

In the M-step, first we have closed-form estimators for ${\tilde{λ}}_{0}^{R} (t)$ , ${\tilde{λ}}_{0}^{D} (t) σ_{μ}^{2}$ and $σ_{ω}^{2}$ . For the estimation of ${\tilde{λ}}_{0}^{R} (t)$ , first we introduce several notations. Assume there are M intervals with cutpoints 0 ≤ $c_{0}^{R}$ < $c_{1}^{R}$ < … < $c_{M}^{R}$ = ∞, which can be quantiles of recurrent event times, where $c_{0}^{R} = 0$ or the smallest event time. Denote the indicator function $I_{m}^{R} (t) = I (c_{m - 1}^{R} < t \leq c_{m}^{R})$ . For m = 1, 2, … , M,

{\tilde{λ}}_{0}^{R} (t) = \sum_{m = 1}^{M} λ_{m}^{R} I (c_{m - 1}^{R} < t \leq c_{m}^{R}) = \sum_{m = 1}^{M} λ_{m}^{R} I_{m}^{R} (t) .

(B1)

Therefore, we denote by $λ^{R} = (λ_{1}^{R}, λ_{2}^{R}, \dots, λ_{m}^{R}, \dots, λ_{M}^{R})$ the parameter we aim to estimate. Denote $E_{i j k, m}^{R} = I^{R} (T_{i j} \geq c_{m - 1}^{R}) (c_{m}^{R} \land T_{i j} - c_{m - 1}^{R})$ the total time individual ij is at risk in the mth recurrent events time interval $(c_{m - 1}^{R}, c_{m}^{R}]$ . $O_{i j, m}^{R} = \cap ε \sum_{k}^{N_{i j}^{R}} I_{m}^{R} (t_{i j k}) δ_{i j k}$ , the number of recurrent events for individual ijth in the mth subinterval. The maximum likelihood estimator has an explicit solution:

{\hat{\tilde{λ}}}_{m}^{^{^{R (k + 1)}}} = \frac{\sum_{m}^{M} [\sum_{i = 1}^{I} \sum_{j = 1}^{J} O_{i j, m}^{R}]}{\sum_{m}^{M} [\sum_{i = 1}^{I} \sum_{j = 1}^{J} E_{i j, m}^{R} exp (X_{i j} β_{2}^{k}) E (y_{i j} exp (μ_{i} + ω_{i j}) ∣ {\hat{θ}}^{(k)})]} .

(B2)

To estimate ${\tilde{λ}}_{0}^{D} (t)$ , similarly, we have the closed form:

{\hat{\tilde{λ}}}_{m}^{^{^{D (k + 1)}}} = \frac{\sum_{m}^{M} [\sum_{i = 1}^{I} \sum_{j = 1}^{J} O_{i j, m}^{D}]}{\sum_{m}^{M} [\sum_{i = 1}^{I} \sum_{j = 1}^{J} E_{i j, m}^{D} exp (X_{i j} β_{3}^{k}) E (exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})]},

(B3)

where $0 \leq c_{0}^{D} < c_{1}^{D} < \dots < c_{M}^{D} = \infty$ is the quantiles of death event times. $I_{m}^{D} (t) = I (c_{m - 1}^{D} < t \leq c_{m}^{D})$ . $E_{i j k, m}^{D} = I^{D} (T_{i j} \geq c_{m - 1}^{D}) (c_{m}^{D} \land T_{i j} - c_{m - 1}^{D})$ is the total time individual ij is at risk in the mth death events time interval $(c_{m - 1}^{D}, c_{m}^{D}]$ . $O_{i j, m}^{D} = \cap ε I_{m}^{D} (T_{i j}) Δ_{i j}$ , the number of death event for individual ijth in the mth interval.

The corresponding score functions and second partial derivatives for $σ_{μ}^{2}$ and $σ_{ω}^{2}$ are

S (σ_{μ}^{2}) = \frac{\partial Q_{4}}{\partial σ_{μ}^{2}} = \sum_{i = 1}^{I} {- \frac{1}{2 σ_{μ}^{2}} + \frac{E (μ_{i}^{2} ∣ {\hat{θ}}^{(k)})}{2 σ_{μ}^{4}}}, S (σ_{ω}^{2}) = \frac{\partial Q_{4}}{\partial σ_{ω}^{2}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- \frac{1}{2 σ_{ω}^{2}} + \frac{E (ω_{i j}^{2} ∣ {\hat{θ}}^{(k)})}{2 σ_{ω}^{4}}} .

The estimators for $σ_{μ}^{2}$ and $σ_{ω}^{2}$ are

{\hat{σ}}_{μ}^{2^{(k + 1)}} = \frac{1}{I} \sum_{i = 1}^{I} E (μ_{i}^{2} ∣ {\hat{θ}}^{(k)}),

(B4)

{\hat{σ}}_{ω}^{2^{(k + 1)}} = \frac{1}{I J} \sum_{i = 1}^{I} \sum_{j = 1}^{J} E (ω_{i j}^{2} {\hat{θ}}^{(k)}),

(B5)

of note that IJ is the total number of subjects.

For the other parameters, which do not have the closed-form estimator, the Newton-Raphson algorithm is used to solve the expected log-likelihood iteratively. Given the kth estimate ${\hat{θ}}^{(k)}$ , the (k + 1)th estimate is obtained by

{\hat{β}}_{2}^{(k + 1)} = {\hat{β}}_{2}^{(k)} + I_{β_{2}}^{- 1} ({\hat{β}}_{2}^{(k)}) S_{β_{2}} ({\hat{β}}_{2}^{(k)}) .

(B6)

Denote the parameters in $Q_{3} (β_{3}^{T}, {\tilde{λ}}_{0}^{D}, ϕ_{μ}, ϕ_{ω})$ as τ = (β₃, ϕ_μ, ϕ_ω). The gradient function $g (τ) = (\frac{\partial Q 3}{\partial β_{3}}, \frac{\partial Q 3}{\partial ϕ_{μ}}, \frac{\partial Q 3}{\partial ϕ_{ω}})$ , and the Hessian matrix can be obtained and denoted as H. Then, given current estimate ${\hat{θ}}^{(k)}$ , the (k + 1)th estimate is obtained by

{\hat{τ}}^{(k + 1)} = {\hat{τ}}^{(k)} - H_{τ}^{- 1} ({\hat{τ}}^{(k)}) g_{τ} ({\hat{τ}}^{(k)}) .

(B7)

Similarly, the MLE of β₁ can be updated by

{\hat{β}}_{1}^{(k + 1)} = {\hat{β}}_{1}^{(k)} + I_{β_{1}}^{- 1} ({\hat{β}}_{1}^{(k)}) S_{β_{1}} ({\hat{β}}_{1}^{(k)}),

(B8)

where S(·) and I(·) are the score function and information matrix, respectively. To simplify the formula, we assume here that X_ij only contains one covariate, which is denoted as X_ij. Then the corresponding score functions and information matrices of the parameters are

S (β_{1}) = \frac{\partial Q_{1}}{\partial β_{1}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {E (y_{i j} ∣ {\hat{θ}}^{(k)}) X_{i j} - exp (β_{1} X_{i j}) X_{i j} E [\frac{exp (μ_{i})}{1 + q_{1 i j}} ∣ {\hat{θ}}^{(k)}]}, I (β_{1}) = - \frac{\partial^{2} Q_{1}}{\partial β_{1}^{2}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {exp (β_{1} X_{i j}) X_{i j}^{2} E [\frac{exp (μ_{i})}{(1 + q_{1 i j})^{2}} ∣ {\hat{θ}}^{(k)}]}, S (β_{2}) = \frac{\partial Q_{2}}{\partial β_{2}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {\sum_{k}^{N_{i j}^{R}} δ_{i j k} X_{i j} - {\tilde{Λ}}_{0}^{R} (T_{i j}) X_{i j} exp (X_{i j} β_{2}) E (y_{i j} exp (μ_{i} + ω_{i j}) ∣ {\hat{θ}}^{(k)})}, I (β_{2}) = - \frac{\partial^{2} Q_{2}}{\partial β_{2}^{2}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {{\tilde{Λ}}_{0}^{R} (T_{i j}) X_{i j}^{2} exp (X_{i j} β_{2}) E (y_{i j} exp (μ_{i} + ω_{i j}) ∣ {\hat{θ}}^{(k)})}, S (β_{3}) = \frac{\partial Q_{3}}{\partial β_{3}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {Δ_{i j} X_{i j} - {\tilde{Λ}}_{0}^{D} (T_{i j}) X_{i j} exp (X_{i j} β_{3}) E (exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})}, S (ϕ_{μ}) = \frac{\partial Q_{3}}{\partial ϕ_{μ}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {Δ_{i j} E (μ_{i} ∣ {\hat{θ}}^{(k)}) - {\tilde{Λ}}_{0}^{D} (T_{i j}) exp (X_{i j} β_{3}) E (μ_{i} exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})}, S (ϕ_{ω}) = \frac{\partial Q_{3}}{\partial ϕ_{ω}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {Δ_{i j} E (ω_{i j} ∣ {\hat{θ}}^{(k)}) - {\tilde{Λ}}_{0}^{D} (T_{i j}) exp (X_{i j} β_{3}) E (ω_{i j} exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})}, \frac{\partial^{2} Q_{3}}{\partial β_{3}^{2}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- {\tilde{Λ}}_{0}^{D} (T_{i j}) X_{i j}^{2} exp (X_{i j} β_{3}) E (exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})}, \frac{\partial^{2} Q_{3}}{\partial ϕ_{μ}^{2}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- {\tilde{Λ}}_{0}^{D} (T_{i j}) exp (X_{i j} β_{3}) E (exp (μ_{i}^{2} exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})}, \frac{\partial^{2} Q_{3}}{\partial ϕ_{ω}^{2}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- {\tilde{Λ}}_{0}^{D} (T_{i j}) exp (X_{i j} β_{3}) E (ω_{i j}^{2} exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})}, \frac{\partial^{2} Q_{3}}{\partial β_{3} ϕ_{μ}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- {\tilde{Λ}}_{0}^{D} (T_{i j}) X_{i j} exp (X_{i j} β_{3}) E (μ_{i} exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})}, \frac{\partial^{2} Q_{3}}{\partial β_{3} ϕ_{ω}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- {\tilde{Λ}}_{0}^{D} (T_{i j}) X_{i j} exp (X_{i j} β_{3}) E (ω_{i j} exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})}, \frac{\partial^{2} Q_{3}}{\partial ϕ_{μ} ϕ_{ω}} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- {\tilde{Λ}}_{0}^{D} (T_{i j}) exp (X_{i j} β_{3}) E (μ_{i} ω_{i j} exp (ϕ_{μ} μ_{i} + ϕ_{ω} ω_{i j}) ∣ {\hat{θ}}^{(k)})} .

More components of the information matrix are given below:

\frac{\partial^{2} l (θ)}{\partial σ_{μ}^{2}} = - \frac{1}{2} {- \frac{I}{σ_{μ}^{4}} + \frac{2 \sum_{i = 1}^{I} E (μ_{i}^{2} ∣ {\hat{θ}}^{(k)})}{σ_{μ}^{6}}}, \frac{\partial^{2} l (θ)}{\partial σ_{ω}^{2}} = - \frac{1}{2} {- \frac{I J}{σ_{ω}^{4}} + \frac{2 \sum_{i = 1}^{I} \sum_{j = 1}^{J} E (ω_{i j}^{2} ∣ {\hat{θ}}^{(k)})}{σ_{ω}^{6}}} .

All other off-diagonal terms are zero. When X_ij is a covariate vector, corresponding score functions and information matrices of the parameters can be easily adapted to matrix versions.

APPENDIX C : SIMULATION ON RECURRENT EVENT TIMES

In this section, we give a brief description about the simulation procedure. Based on the definition of a Poisson process, for each 0 ≤ s ≤ t, N(t)−N(s) has a Poisson distribution with mean $m (t) - m (s) = \int_{s}^{t} λ (x) d x$ . Therefore, we need to simulate recurrent event times from a Poisson process with any intensity function λ(t). The most common parametric choices for time-to-event simulation are exponential, Weibull, and Gompertz distributions. As noted by Bender et al,⁴¹ among the commonly used distributions for survival times, only these 3 distributions share the assumption of proportional hazards, and the survival time, T, can be generated by $T = Λ_{0}^{- 1} [- \log (u) exp (- β^{T} x)]$ , where u ~ Unif[0, 1].

There are several available methods for recurrent event times simulation, and here we adopt the inversion method, which is often used. Referring to the inversion method by Pénichoux et al,⁴² for a given subject, let T_j denote the time elapsed from the origin to the jth event, T₀ = 0, and let W_j = T_j − T_j−1 denote the jth waiting time between 2 consecutive events. The W_j are referred to as gap times. The number of events observed for a nonhomogeneous Poisson process with intensity λ(·) between 2 times, s and t follows a Poisson distribution with parameter $\int_{s}^{t} λ (v) d v$ , which leads to $F_{j} (w) = 1 - exp (- \int_{T_{j - 1}}^{T_{j - 1} + w} λ (v) d v)$ . $W_{j} = F_{j}^{- 1} (u_{j})$ and $T_{j} = T_{j - 1} + W_{j}$ . For simplicity, we adopt an exponential distribution with constant baseline constant λ₀(t) = λ, the recurrent event process can be considered equivalently as an homogeneous Poisson process or a sequence of exponentially distributed gaps.⁴² The survival times are generated by $T = - \frac{\log (u)}{λ exp (- β^{T} x)}$ , $T_{j} = T_{j - 1} + W_{j} = - \frac{\log (u_{j})}{λ exp (- β^{T} x)} + T_{j - 1}$ .

REFERENCES

1.Fine JP, Jiang H, Chappell R. On semi-competing risks data. Biometrika. 2001;88(4):907–919. [Google Scholar]
2.Lancaster T, Intrator O. Panel data with survival: hospitalization of hiv-positive patients. J Am Stat Assoc. 1998;93(441):46–53. [Google Scholar]
3.Chen BE, Cook RJ. Tests for multivariate recurrent events in the presence of a terminal event. Biostatistics. 2004;5(1):129–143. [DOI] [PubMed] [Google Scholar]
4.Cook RJ, Lawless JF. Marginal analysis of recurrent events and a terminating event. Stat Med. 1997;16(8):911–924. [DOI] [PubMed] [Google Scholar]
5.Ghosh D, Lin DY. Marginal regression models for recurrent and terminal events. Stat Sin. 2002;12:663–688. [Google Scholar]
6.Liu L, Wolfe RA, Huang X. Shared frailty models for recurrent events and a terminal event. Biometrics. 2004;60(3):747–756. [DOI] [PubMed] [Google Scholar]
7.Li R, Cheng Y. Flexible association modelling and prediction with semi-competing risks data. Can J Stat. 2016;44(3):361–374. [Google Scholar]
8.Huang X, Wolfe RA. A frailty model for informative censoring. Biometrics. 2002;58(3):510–520. [DOI] [PubMed] [Google Scholar]
9.Zeng D, Lin DY. Semiparametric transformation models with random effects for joint analysis of recurrent and terminal events. Biometrics. 2009;65(3):746–752. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Mozaffarian D, Benjamin EJ, Go AS, et al. Executive summary: heart disease and stroke statistics—2016 update: a report from the American Heart Association. Circ. 2016;133(4):447. [DOI] [PubMed] [Google Scholar]
11.Azarpazhooh MR, Nicol MB, Donnan GA, et al. Patterns of stroke recurrence according to subtype of first stroke event: the North East Melbourne Stroke Incidence Study (NEMESIS). Int J Stroke. 2008;3(3):158–164. [DOI] [PubMed] [Google Scholar]
12.Feng W, Hendry RM, Adams RJ. Risk of recurrent stroke, myocardial infarction, or death in hospitalized stroke patients. Neurology. 2010;74(7):588–593. [DOI] [PubMed] [Google Scholar]
13.TruvenHealth. Marketscan research databases. 1990. https://marketscan.truvenhealth.com/marketscanportal/.
14.DeSantis SM, Lazaridis C, Ji S, Spinale FG. Analyzing propensity matched zero-inflated count outcomes in observational studies. J Appl Stat. 2014;41(1):127–141. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Liu L, Huang X, Yaroshinsky A, Cormier JN. Joint frailty models for zero-inflated recurrent events in the presence of a terminal event. Biometrics. 2016;72(1):204–214. [DOI] [PubMed] [Google Scholar]
16.Farewell VT. The use of mixture models for the analysis of survival data with long-term survivors. Biometrics. 1982;38(4):1041–1046. [PubMed] [Google Scholar]
17.Kuk AYC, Chen CH. A mixture model combining logistic regression with proportional hazards regression. Biometrika. 1992;79(3):531–541. [Google Scholar]
18.Peng Y, Taylor JMG. Mixture cure model with random effects for the analysis of a multi-center tonsil cancer study. Stat Med. 2011;30(3):211–223. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Sy JP, Taylor JMG. Estimation in a Cox proportional hazards cure model. Biometrics. 2000;56(1):227–236. [DOI] [PubMed] [Google Scholar]
20.Rondeau V, Schaffner E, Corbière F, Gonzalez JR, Mathoulin-Pélissier S. Cure frailty models for survival data: application to recurrences for breast cancer and to hospital readmissions for colorectal cancer. Stat Methods Med Res. 2013;22(3):243–260. [DOI] [PubMed] [Google Scholar]
21.Sastry N A nested frailty model for survival data, with an application to the study of child survival in northeast Brazil. J Am Stat Assoc. 1997;92(438):426–435. [DOI] [PubMed] [Google Scholar]
22.Go AS, Parikh CR, Ikizler TA, et al. The assessment, serial evaluation, and subsequent sequelae of acute kidney injury (ASSESS-AKI) study: design and methods. BMC Nephrol. 2010;11(1):1. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Rondeau V, Filleul L, Joly P. Nested frailty models using maximum penalized likelihood estimation. Stat Med. 2006;25(23):4036–4052. [DOI] [PubMed] [Google Scholar]
24.Yau KKW. Multilevel models for survival analysis with random effects. Biometrics. 2001;57(1):96–102. [DOI] [PubMed] [Google Scholar]
25.Liu L, Huang X. The use of gaussian quadrature for estimation in frailty proportional hazards models. Stat Med. 2008;27(14):2665–2683. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Ibrahim JG, Chen MH, Sinha D. Bayesian methods for jointmodeling of longitudinal and survival data with applications to cancer vaccine trials. Stat Sin. 2004;14:863–883. [Google Scholar]
27.Ouyang B, Sinha D, Slate EH, Van Bakel AB. Bayesian analysis of recurrent event with dependent termination: an application to a heart transplant study. Stat Med. 2013;32(15):2629–2642. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Wen S, Huang X, Frankowski RF, Cormier JN, Pisters P. A bayesian multivariate joint frailty model for disease recurrences and survival. Stat Med. 2016;35(26):4794–4812. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Kim S, Zeng D, Chambless L, Li Y. Joint models of longitudinal data and recurrent events with informative terminal event. Stat Biosci. 2012;4(2):262–281. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Klein JP. Semiparametric estimation of random effects using the Cox model based on the EM algorithm. Biometrics. 1992;48:795–806. [PubMed] [Google Scholar]
31.Vaida F, Xu R. Proportional hazards model with random effects. Stat Med. 2000;19(24):3309–3324. [DOI] [PubMed] [Google Scholar]
32.Król A, Ferrer L, Pignon JP, et al. Joint model for left-censored longitudinal data, recurrent events and terminal event: predictive abilities of tumor burden for cancer evolution with application to the FFCD 2000-05 trial. Biometrics. 2016;72(3):907–916. [DOI] [PubMed] [Google Scholar]
33.Louis TA. Finding the observed information matrix when using the EM algorithm. J R Stat Soc Ser B Methodol. 1982;44(2):226–233. [Google Scholar]
34.Goldstein LB. Accuracy of ICD-9-CM coding for the identification of patients with acute ischemic stroke effect of modifier codes. Stroke. 1998;29(8):1602–1604. [DOI] [PubMed] [Google Scholar]
35.Coull AJ, Rothwell PM. Underestimation of the early risk of recurrent stroke evidence of the need for a standard definition. Stroke. 2004;35(8):1925–1929. [DOI] [PubMed] [Google Scholar]
36.Zeng D, Ibrahim JG, Chen MH, Hu K, Jia C. Multivariate recurrent events in the presence of multivariate informative censoring with applications to bleeding and transfusion events in myelodysplastic syndrome. J Biopharm Stat. 2014;24(2):429–442. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Hall DB, Wang L. Two-component mixtures of generalized linear mixed effects models for cluster correlated data. Stat Model. 2005;5(1):21–37. [Google Scholar]
38.Inan G, Ilk O. A marginalized multilevel model for bivariate longitudinal binary data. Stat Pap. 2016. 10.1007/S00362-016-0840-1. [DOI] [Google Scholar]
39.Neuhaus JM, McCulloch CE. Estimation of covariate effects in generalized linear mixed models with informative cluster sizes. Biometrika. 2011;98(1):147–162. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Parzen M, Ghosh S, Lipsitz S, et al. A generalized linear mixed model for longitudinal binary data with a marginal logit link function. Ann Appl Stat. 2011;5(1):449. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Bender R, Augustin T, Blettner M. Generating survival times to simulate Cox proportional hazards models. Stat Med. 2005;24(11):1713–1723. [DOI] [PubMed] [Google Scholar]
42.Pénichoux J, Moreau T, Latouche A. Simulating recurrent events that mimic actual data: a review of the literature with emphasis on event-dependence. 2014. HAL Id: hal-01133497.

[R1] 1.Fine JP, Jiang H, Chappell R. On semi-competing risks data. Biometrika. 2001;88(4):907–919. [Google Scholar]

[R2] 2.Lancaster T, Intrator O. Panel data with survival: hospitalization of hiv-positive patients. J Am Stat Assoc. 1998;93(441):46–53. [Google Scholar]

[R3] 3.Chen BE, Cook RJ. Tests for multivariate recurrent events in the presence of a terminal event. Biostatistics. 2004;5(1):129–143. [DOI] [PubMed] [Google Scholar]

[R4] 4.Cook RJ, Lawless JF. Marginal analysis of recurrent events and a terminating event. Stat Med. 1997;16(8):911–924. [DOI] [PubMed] [Google Scholar]

[R5] 5.Ghosh D, Lin DY. Marginal regression models for recurrent and terminal events. Stat Sin. 2002;12:663–688. [Google Scholar]

[R6] 6.Liu L, Wolfe RA, Huang X. Shared frailty models for recurrent events and a terminal event. Biometrics. 2004;60(3):747–756. [DOI] [PubMed] [Google Scholar]

[R7] 7.Li R, Cheng Y. Flexible association modelling and prediction with semi-competing risks data. Can J Stat. 2016;44(3):361–374. [Google Scholar]

[R8] 8.Huang X, Wolfe RA. A frailty model for informative censoring. Biometrics. 2002;58(3):510–520. [DOI] [PubMed] [Google Scholar]

[R9] 9.Zeng D, Lin DY. Semiparametric transformation models with random effects for joint analysis of recurrent and terminal events. Biometrics. 2009;65(3):746–752. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Mozaffarian D, Benjamin EJ, Go AS, et al. Executive summary: heart disease and stroke statistics—2016 update: a report from the American Heart Association. Circ. 2016;133(4):447. [DOI] [PubMed] [Google Scholar]

[R11] 11.Azarpazhooh MR, Nicol MB, Donnan GA, et al. Patterns of stroke recurrence according to subtype of first stroke event: the North East Melbourne Stroke Incidence Study (NEMESIS). Int J Stroke. 2008;3(3):158–164. [DOI] [PubMed] [Google Scholar]

[R12] 12.Feng W, Hendry RM, Adams RJ. Risk of recurrent stroke, myocardial infarction, or death in hospitalized stroke patients. Neurology. 2010;74(7):588–593. [DOI] [PubMed] [Google Scholar]

[R13] 13.TruvenHealth. Marketscan research databases. 1990. https://marketscan.truvenhealth.com/marketscanportal/.

[R14] 14.DeSantis SM, Lazaridis C, Ji S, Spinale FG. Analyzing propensity matched zero-inflated count outcomes in observational studies. J Appl Stat. 2014;41(1):127–141. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] 15.Liu L, Huang X, Yaroshinsky A, Cormier JN. Joint frailty models for zero-inflated recurrent events in the presence of a terminal event. Biometrics. 2016;72(1):204–214. [DOI] [PubMed] [Google Scholar]

[R16] 16.Farewell VT. The use of mixture models for the analysis of survival data with long-term survivors. Biometrics. 1982;38(4):1041–1046. [PubMed] [Google Scholar]

[R17] 17.Kuk AYC, Chen CH. A mixture model combining logistic regression with proportional hazards regression. Biometrika. 1992;79(3):531–541. [Google Scholar]

[R18] 18.Peng Y, Taylor JMG. Mixture cure model with random effects for the analysis of a multi-center tonsil cancer study. Stat Med. 2011;30(3):211–223. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Sy JP, Taylor JMG. Estimation in a Cox proportional hazards cure model. Biometrics. 2000;56(1):227–236. [DOI] [PubMed] [Google Scholar]

[R20] 20.Rondeau V, Schaffner E, Corbière F, Gonzalez JR, Mathoulin-Pélissier S. Cure frailty models for survival data: application to recurrences for breast cancer and to hospital readmissions for colorectal cancer. Stat Methods Med Res. 2013;22(3):243–260. [DOI] [PubMed] [Google Scholar]

[R21] 21.Sastry N A nested frailty model for survival data, with an application to the study of child survival in northeast Brazil. J Am Stat Assoc. 1997;92(438):426–435. [DOI] [PubMed] [Google Scholar]

[R22] 22.Go AS, Parikh CR, Ikizler TA, et al. The assessment, serial evaluation, and subsequent sequelae of acute kidney injury (ASSESS-AKI) study: design and methods. BMC Nephrol. 2010;11(1):1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Rondeau V, Filleul L, Joly P. Nested frailty models using maximum penalized likelihood estimation. Stat Med. 2006;25(23):4036–4052. [DOI] [PubMed] [Google Scholar]

[R24] 24.Yau KKW. Multilevel models for survival analysis with random effects. Biometrics. 2001;57(1):96–102. [DOI] [PubMed] [Google Scholar]

[R25] 25.Liu L, Huang X. The use of gaussian quadrature for estimation in frailty proportional hazards models. Stat Med. 2008;27(14):2665–2683. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Ibrahim JG, Chen MH, Sinha D. Bayesian methods for jointmodeling of longitudinal and survival data with applications to cancer vaccine trials. Stat Sin. 2004;14:863–883. [Google Scholar]

[R27] 27.Ouyang B, Sinha D, Slate EH, Van Bakel AB. Bayesian analysis of recurrent event with dependent termination: an application to a heart transplant study. Stat Med. 2013;32(15):2629–2642. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Wen S, Huang X, Frankowski RF, Cormier JN, Pisters P. A bayesian multivariate joint frailty model for disease recurrences and survival. Stat Med. 2016;35(26):4794–4812. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Kim S, Zeng D, Chambless L, Li Y. Joint models of longitudinal data and recurrent events with informative terminal event. Stat Biosci. 2012;4(2):262–281. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.Klein JP. Semiparametric estimation of random effects using the Cox model based on the EM algorithm. Biometrics. 1992;48:795–806. [PubMed] [Google Scholar]

[R31] 31.Vaida F, Xu R. Proportional hazards model with random effects. Stat Med. 2000;19(24):3309–3324. [DOI] [PubMed] [Google Scholar]

[R32] 32.Król A, Ferrer L, Pignon JP, et al. Joint model for left-censored longitudinal data, recurrent events and terminal event: predictive abilities of tumor burden for cancer evolution with application to the FFCD 2000-05 trial. Biometrics. 2016;72(3):907–916. [DOI] [PubMed] [Google Scholar]

[R33] 33.Louis TA. Finding the observed information matrix when using the EM algorithm. J R Stat Soc Ser B Methodol. 1982;44(2):226–233. [Google Scholar]

[R34] 34.Goldstein LB. Accuracy of ICD-9-CM coding for the identification of patients with acute ischemic stroke effect of modifier codes. Stroke. 1998;29(8):1602–1604. [DOI] [PubMed] [Google Scholar]

[R35] 35.Coull AJ, Rothwell PM. Underestimation of the early risk of recurrent stroke evidence of the need for a standard definition. Stroke. 2004;35(8):1925–1929. [DOI] [PubMed] [Google Scholar]

[R36] 36.Zeng D, Ibrahim JG, Chen MH, Hu K, Jia C. Multivariate recurrent events in the presence of multivariate informative censoring with applications to bleeding and transfusion events in myelodysplastic syndrome. J Biopharm Stat. 2014;24(2):429–442. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] 37.Hall DB, Wang L. Two-component mixtures of generalized linear mixed effects models for cluster correlated data. Stat Model. 2005;5(1):21–37. [Google Scholar]

[R38] 38.Inan G, Ilk O. A marginalized multilevel model for bivariate longitudinal binary data. Stat Pap. 2016. 10.1007/S00362-016-0840-1. [DOI] [Google Scholar]

[R39] 39.Neuhaus JM, McCulloch CE. Estimation of covariate effects in generalized linear mixed models with informative cluster sizes. Biometrika. 2011;98(1):147–162. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] 40.Parzen M, Ghosh S, Lipsitz S, et al. A generalized linear mixed model for longitudinal binary data with a marginal logit link function. Ann Appl Stat. 2011;5(1):449. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] 41.Bender R, Augustin T, Blettner M. Generating survival times to simulate Cox proportional hazards models. Stat Med. 2005;24(11):1713–1723. [DOI] [PubMed] [Google Scholar]

[R42] 42.Pénichoux J, Moreau T, Latouche A. Simulating recurrent events that mimic actual data: a review of the literature with emphasis on event-dependence. 2014. HAL Id: hal-01133497.

PERMALINK

Joint modeling of recurrent events and a terminal event adjusted for zero inflation and a matched design

Cong Xu

Vernon M Chinchilli

Ming Wang

Abstract

1 ∣. INTRODUCTION

2 ∣. JOINT MODELING

2.1 ∣. Notations

2.2 ∣. The joint frailty model of zero-inflated recurrent events and death

3 ∣. ESTIMATION

4 ∣. SIMULATION

TABLE 1.

TABLE 2.

5 ∣. APPLICATION-STROKE STUDY

Figure 1.

TABLE 3.

6 ∣. DISCUSSION

ACKNOWLEDGEMENTS

Appendix

APPENDIX A: E-STEP OF THE MCEM ALGORITHM

A.1 ∣. Conditional expectation

A.2 ∣. E-step

APPENDIX B : M-STEP OF THE MCEM ALGORITHM

APPENDIX C : SIMULATION ON RECURRENT EVENT TIMES

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Joint modeling of recurrent events and a terminal event adjusted for zero inflation and a matched design

Cong Xu

Vernon M Chinchilli

Ming Wang

Abstract

1 ∣. INTRODUCTION

2 ∣. JOINT MODELING

2.1 ∣. Notations

2.2 ∣. The joint frailty model of zero-inflated recurrent events and death

3 ∣. ESTIMATION

4 ∣. SIMULATION

TABLE 1.

TABLE 2.

5 ∣. APPLICATION-STROKE STUDY

Figure 1.

TABLE 3.

6 ∣. DISCUSSION

ACKNOWLEDGEMENTS

Appendix

APPENDIX A: E-STEP OF THE MCEM ALGORITHM

A.1 ∣. Conditional expectation

A.2 ∣. E-step

APPENDIX B : M-STEP OF THE MCEM ALGORITHM

APPENDIX C : SIMULATION ON RECURRENT EVENT TIMES

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases