Semiparametric Modelling and Estimation of Covariate-Adjusted Dependence Between Bivariate Recurrent Events

Jing Ning; Chunyan Cai; Yong Chen; Xuelin Huang; Mei-Cheng Wang

doi:10.1111/biom.13229

. Author manuscript; available in PMC: 2020 Dec 15.

Published in final edited form as: Biometrics. 2020 Feb 18;76(4):1229–1239. doi: 10.1111/biom.13229

Semiparametric Modelling and Estimation of Covariate-Adjusted Dependence Between Bivariate Recurrent Events

Jing Ning ^1,^*, Chunyan Cai ², Yong Chen ³, Xuelin Huang ¹, Mei-Cheng Wang ⁴

PMCID: PMC7384929 NIHMSID: NIHMS1596616 PMID: 31994170

Summary:

A time-dependent measure, termed the rate ratio, was proposed to assess the local dependence between two types of recurrent event processes in one-sample settings. However, the one-sample work does not consider modelling the dependence by covariates such as subject characteristics and treatments received. The focus of this paper is to understand how and in what magnitude the covariates influence the dependence strength for bivariate recurrent events. We propose the covariate-adjusted rate ratio, a measure of covariate-adjusted dependence. We propose a semiparametric regression model for jointly modeling the frequency and dependence of bivariate recurrent events: the first level is a proportional rates model for the marginal rates and the second level is a proportional rate ratio model for the dependence structure. We develop a pseudo-partial likelihood to estimate the parameters in the proportional rate ratio model. We establish the asymptotic properties of the estimators and evaluate the finite sample performance via simulation studies. We illustrate the proposed models and methods using a soft tissue sarcoma study that examines the effects of initial treatments on the marginal frequencies of local/distant sarcoma recurrence and the dependence structure between the two types of cancer recurrence.

Keywords: Bivariate recurrent event, Covariate-adjusted rate ratio, Dependence structure, Joint model, Rate ratio

Recurrent event data are often encountered in reliability experiments (Dalal and McIntosh, 1994), insurance warranty claims (Lawless and Nadeau, 1995), and biomedical studies (Cook and Lawless, 2007). In these situations, there may exist more than one type of recurrent event of interest (Cai and Schaubel, 2004; Schaubel and Cai, 2005; Sun et al., 2009; Cook et al., 2010; Zhu et al., 2010; Zhao et al., 2012). When analyzing such data, it is important to assess the dependence structure between different types of recurrent events, which, even if it is not of primary scientific interest, at least ensures the correctness of joint model assumptions and then guides the model selection. However, such an assessment is challenging as the dependence between recurrent events may vary, not only with time, but also with patients’ characteristics and initial treatments received. Previous work has focused on a time-variant property but ignored the fact that the strength of dependence can be affected by covariates (Ventura et al., 2005; Ning et al., 2015).

This research is motivated by a soft tissue sarcoma study, in which 674 patients with stage III soft tissue sarcoma who were treated between 1984 and 1999 were identified from two comprehensive cancer centers and then were followed for cancer recurrences (Cormier et al., 2004). The study documented the timing of bivariate recurrence events: local disease recurrence, in which tumor cells that remain in the original site form another tumor over time, and distant disease recurrence, in which sarcoma spreads through the bloodstream to distant sites such as the lungs or liver. Previous research has shown that the dependence between local and distant cancer recurrences for patients with soft tissue sarcoma was not constant over time (Ning et al., 2015). A natural question is whether the dependency structure of the bivariate sarcoma recurrences depends not only on time, but also on other covariates, such as the initial treatments received by the patients with sarcoma, and if so, in what way?

A time-dependent measure, termed the rate ratio, has been used to assess the local dependence between two types of recurrent event processes (Ning et al., 2015). One advantage of using the rate ratio as the dependence measure is that it has an attractive relative probability interpretation, which is simple for practitioners to understand. There are often factors, such as patient characteristics and treatments received, that affect the strength of the dependence between different types of recurrent events. It may be important to understand how the dependence structure depends on such factors, although little has been done to develop measures and models for characterizing the dependence structure. For bivariate survival time, there is some work in the literature regarding how to model the covariate effects on the dependence structure, in which the covariate effects are often modeled through marginal distributions. For example, Fan and Prentice (2002) used a proportional hazards model on the marginal survival function to accommodate covariate effects. However, when the dependence structure itself is of major interest, modeling the covariate effects via marginal regression models does not explicitly determine how covariates change the strength of dependence nor by how much. In this paper, we propose the covariate-adjusted rate ratio for the dependence structure and consider a joint model of the bivariate recurrent events. In the joint model, we directly link the covariates and the covariate-adjusted rate ratio via a regression model, where the covariate effects are multiplicative on the baseline rate ratio function.

The rest of the paper is organized as follows. In Section 1, we introduce the notation, the covariate-adjusted dependence measure, and the semiparametric model. We then construct a two-stage estimating procedure in which a pseudo-partial likelihood is used to estimate parameters. We establish the large sample properties for the estimated parameters. We assess the empirical performance of the proposed estimators in Section 2, describe an application to the soft tissue sarcoma study in Section 3, and conclude with a discussion in Section 4.

1. METHOD

1.1. Notation and Model

Let {N₁(t), N₂(t), t ⩾ 0} represent the bivariate counting process for the number of type-specific events during the time [0, t]. Assume X_j (t) to be a p-dimensional vector of possibly time-dependent covariates for the jth type of recurrent event. We denote λ_j{t|x_j(t)} as the type-specific and covariate-adjusted rate function,

λ_{j} {t | x_{j} (t)} = \lim_{Δ \to 0 +} P {N_{j} (t + Δ) - N_{j} (t) > 0 | x_{j} (t)} / Δ .

We generalize the rate ratio to accommodate the covariate effects on the dependence measure, termed the covariate-adjusted rate ratio,

ρ {s, t | x_{1} (s), x_{2} (t)} = \frac{λ_{1 | 2} {s | t, x_{1} (s), x_{2} (t)}}{λ_{1} {s | x_{1} (s)}},

where λ_1|2{s|t,x₁(s),x₂(t)} is a covariate-adjusted conditional rate function defined as

\lim_{Δ \to 0^{+}} P {N_{1} (s + Δ) - N_{1} (s) > 0 ∣ N_{2} (t + Δ) - N_{2} (t) > 0, X_{1} (s) = x_{1} (s), X_{2} (t) = x_{2} (t)} ∕ Δ .

Under the assumption that λ_j(t|x_j(t),x_j′(s)) = λ_j(t|x_j(t))(j ≠ j′ ∈ {1, 2}), it can be shown that the covariate-adjusted rate ratio is symmetric,

ρ {s, t | x_{1} (s), x_{2} (t)} = \frac{λ_{1 | 2} {s | t, x_{1} (s), x_{2} (t)}}{λ_{1} {s | x_{1} (s)}} = \frac{λ_{2 | 1} {t | s, x_{1} (s), x_{2} (t)}}{λ_{2} {t | x_{2} (t)}},

where

λ_{2 | 1} {t | s, x_{1} (s), x_{2} (t)} = \lim_{Δ \to 0^{+}} P {N_{2} (t + Δ) - N_{2} (t) > 0 | N_{1} (s + Δ) - N_{1} (s) > 0, x_{1} (s), x_{2} (t)} / Δ .

The above assumption implies that, given the covariate information up to time t of one process, the covariate information of the other process does not provide any additional information on the marginal rate function of the process. The covariate-adjusted rate ratio shares the same desirable properties as the rate ratio, including the relative probability interpretation (Ning et al., 2015). We use the covariate-adjusted rates in both the numerator and denominator to eliminate the influence of covariate effects on the frequencies of recurrent events, such that the covariate-adjusted rate ratio can only capture the covariate effects on the strength of dependence between different types of recurrent events.

Note that the rate ratio can be rewritten as

ρ {s, t | x_{1} (s), x_{2} (t)} = \frac{\lim_{Δ \to 0^{+}} P {N_{2} (t + Δ) - N_{2} (t) > 0, N_{1} (s + Δ) - N_{1} (s) > 0 | x_{1} (s), x_{2} (t)} / Δ}{λ_{1} {s | x_{1} (s)} λ_{2} {t | x_{2} (t)}},

(1)

in which two marginal rate functions are involved. It implies that the covariate-adjusted rate ratio depends not only on the joint probability, but also on how the covariate effect affects the two marginal rate function. This expression gives an alternative interpretation for the rate ratio. It provides a standardized co-occurrence rate of event pair consisting a type-1 event at s and a type-2 event at t, with the standard being the co-occurrence rate in a situation where these two types of events are independent of each other.

Without any parametric assumption, estimation of the covariate-adjusted rate ratio can be computationally prohibitive in the presence of continuous covariates. Our strategy is to impose semiparametric regression models with model flexibility and easy interpretation. There are certain considerations for constructing regression models with a dependence structure for bivariate recurrent event data. First, we choose the exponential function as the link function in the regression model for the rate ratio, due to an easy interpretation of the regression coefficients and non-negative nature of the rate ratio. Second, when calculating the rate of the observed event that subject i experiences both types of events respectively at times s and t, which plays a fundamental role on the construction of the likelihood function, we have

P (d N_{i 1} (s) = 1, d N_{i 2} (t) = 1 | x_{i 1} (s), x_{i 2} (t)) = λ_{1} {s | x_{i 1} (s)} λ_{2 | 1} {t | s, x_{i 1} (s), x_{i 2} (t)} d s d t = λ_{1} {s | x_{i 1} (s)} λ_{2} {t | x_{i 2} (t)} ρ {s, t | x_{i 1} (s), x_{i 2} (t)} d s d t .

(2)

This equation implies that the probability of observing a pair of events again depends on both the rate ratio and two marginal rates. Both equations (1) and (2) illustrate that the covariate-specific marginal rate functions need to be specified to evaluate the covariate effects on the strength of dependence. Following the above considerations, we propose the following joint model.

Level 1: Proportional rate model for N_j(.) conditioning on X_j(.) (Lin et al., 2000),

λ_{j} {s | x_{j} (s)} = λ_{j 0} (s) \exp {γ_{j}^{T} x_{j} (s)}, j = 1, 2,

(3)

where λ_j⁰(s) is an unspecified rate function for the jth type of event and γ_j is a p-dimensional parameter vector to characterize the covariate effects on marginal rates.

Level 2: Proportional rate ratio model conditioning on (X₁(.),X₂(.)),

ρ {s, t | x_{1} (s), x_{2} (t), β, α} = ρ_{0} (s, t; β) \exp {α_{1}^{T} x_{1} (s) + α_{2}^{T} x_{2} (t)},

(4)

where $α = {(α_{1}^{T}, α_{2}^{T})}^{T}$ is a 2p-dimensional parameter vector and ρ₀(s,t;β) is a prespecified baseline rate ratio with a q-dimensional parameter β. For simplicity of notation, we use the same vector of covariate in the marginal models and rate ratio model, but the two models are allowed to have different sets of covariates. The proportional rate ratio model directly links the covariate and the rate ratio by assuming that the covariate effect is multiplicative on the baseline rate ratio function. The parameter α describes the covariate effects on the strength of dependence of the bivariate recurrent events and has an interpretation as the log-relative rate ratio related to the covariate. The baseline rate ratio function serves to identify the degree of dependence over time and is set to be a constant under the commonly used joint random effect models (Sun et al., 2009; Zhu et al., 2010; Zhao et al., 2012; Ning et al., 2015). Flexible functions of time, such as polynomial functions or regression splines, could be used to specify the baseline rate ratio function.

The probability structure on the bivariate recurrent events is determined by the bivariate intensity function, h(s, t|x₁(s),x₂(t)), which is defined as

\lim_{Δ \to 0^{+}} P {N_{1} (s + Δ) - N_{1} (s) > 0, N_{2} (t + Δ) - N_{2} (t) > 0 | H (s, t), x_{1} (s), x_{2} (t)} / Δ .

Here $H (s, t)$ is the history of the bivariate counting process up to time (s,t). The corresponding rate function represents the probability of events occurring at time (s,t) without conditioning on the event history. Although the proposed model assumes a parametric regression model on the rate ratio, it does not fully impose the probability structure of the bivariate processes (e.g., bivariate intensity function) and is semiparametric and enjoys model robustness. In contrast, for the shared random effects models with fully parametric assumptions, our semiparametric model framework provides more flexibility in modeling and data application.

1.2. Estimation Procedure

Consider a study that involves n independent subjects, each of whom may experience two types of recurrent events in observation time period [0,τ]. For subject i, denote C_ij and Y_ij(t) = I(C_ij ⩾ t) as the censoring time of the jth type of recurrent event and its risk function, j = 1, 2. Let N_ij be the number of events of type j for subject i that occur over the interval [0, C_ij] and m_ij = N_ij(C_ij), i = 1, ⋯, n, j = 1,2. The observed event times of the ith subject are respectively t_i11 ⩽ t_i12 ⩽ ⋯ ⩽ t_{i1m_i1} and t_i21 ⩽ t_i22 ⩽ ⋯ ⩽ t_{i2m_i2} for the two types of recurrent events. Denote the observed covariate information as X_ij(.). For simplicity of notation, denote x_ijk = X_ij(t_ijk) for i = 1, ⋯, n, j = 1, 2 and k = 1, ⋯, m_ij.

The rate function gives the marginal probability, and its integral from 0 to t represents the mean of number of events occurring up to time t. Therefore, the parameters under the proportional rate models can be estimated by solving the following moment-based estimating equations (Lin et al., 2000),

U_{j} (γ) = \sum_{i = 1}^{n} \int_{0}^{τ} {x_{i j} (u) - {\bar{x}}_{j} (γ_{j}; u)} d N_{i j} (u)

(5)

where a^⊗0 = 1, a^⊗1 = a, a^⊗2 = aa^T, ${\overset{‒}{x}}_{j} (γ_{j}; u) = S^{(1)} (γ_{j}; u) ∕ S^{(0)} (γ_{j}; u)$ , and

S^{(k)} (γ_{j}; t) = n^{- 1} {\sum_{i = 1}^{n} Y_{i j} (t) x_{i j} (t)}^{\otimes k} \exp {γ_{j}^{T} x_{i j} (t)} .

Under the proposed model, we specify the rates and rate ratio of the recurrent event processes, but do not fully impose the probability structure of the bivariate recurrent event processes (e.g., bivariate intensity function). Accordingly, the full likelihood method can not be applied to estimate the unknown parameters under the proposed joint model. Different from the proportional rate model, the proportional rate ratio model is built on the ratio of conditional probabilities; it has no direct relationship with the mean/variance of the recurrent events. Hence the moment-based approach is not an ideal option to estimate the unknown parameters under the proportional rate ratio model.

To solve this estimation challenge, we construct a pseudo-partial likelihood by mimicking the partial likelihood under the Cox proportional hazards model (Cox, 1975). Denote R(s, t) = {i : C_i1 ⩾ s, C_i2 ⩾ t} as the bivariate risk set for bivariate recurrent events at time (s, t), and n(s,t) as the size of the risk set R(s, t). When we apply the partial likelihood idea to the observed bivariate recurrent event data, we need to identify all risk sets at all possible paired observation times for the different types of events. Hence, unlike the construction of univariate risk sets in the standard partial likelihood, the observed recurrent event times from one subject can contribute to multiple risk sets due to the unique data structure of bivariate recurrent event data. Note that the paired time points can come from the same subject or from two different subjects, even though they are different types of events.

For each bivariate risk set at time (s, t), we consider the following event, called e_ii′(s,t):

e_{i i^{'}} (s, t) = {i, i^{'} \in R (s, t) : d N_{i 1} (s) = 1, d N_{i^{'} 2} (t) = 1} .

When i = i′ event e_ii′(s,t) occurs if subject i in the risk set R(s,t) experiences both types of events respectively at times s and t. When i ≠ i′ event e_ii′(s,t) occurs if subject i in the risk set R(s,t) experiences the first type of event and the other subject i′ in the risk set R(s,t) experiences the second type of event. Calculating the event probabilities conditional on the composition of the risk set, we have

\Pr {e_{i i} (s, t) | R (s, t)} = \frac{I_{i i} (s, t) λ_{1} {s | x_{i 1} (s)} λ_{2 | 1} {t | s, x_{i 1} (s), x_{i 2} (t)}}{\sum_{i = 1}^{n} I_{i i} (s, t) λ_{1} {s | x_{i 1} (s)} λ_{2 | 1} {t | s, x_{i 1} (s), x_{i 2} (t)} + \sum_{i = 1}^{n} \sum_{i^{'} \neq i} I_{i i^{'}} (s, t) λ_{1} {s | x_{i 1} (s)} λ_{2} {t | x_{i^{'} 2} (t)}}

and for i ≠ i′,

\Pr {e_{{ii}^{'}} (s, t) ∣ R (s, t)} = \frac{I (i \neq i^{'}) I_{{ii}^{'}} (s, t) λ_{1} (s ∣ x_{i 1} (s)) λ_{2} (t ∣ s, x_{i^{'} 2} (t))}{\sum_{i = 1}^{n} I_{ii} (s, t) λ_{1} (s ∣ x_{i 1} (s)) λ_{2 ∣ 1} (t ∣ s, x_{i 1} (s), x_{i 2} (t)) + \sum_{i = 1}^{n} \sum_{i^{'} \neq i} I_{{ii}^{'}} (s, t) λ_{1} (s ∣ x_{i 1} (s)) λ_{2} (t ∣ x_{i^{'} 2} (t))}

where I_ii(s,t) = I(C_i1 ⩾ s, C_i2 ⩾ t) and I_ii′(s,t) = I_ii(s,t)I_i′i′(s,t). Using the proportional rate model and the definition of the rate ratio, we can eliminate the unspecified baseline rate function and further simplify the above two conditional probabilities:

\Pr {e_{i i} (s, t) | R (s, t)} = I_{i i} (s, t) ρ (s, t | x_{i 1} (s), x_{i 2} (t), η) \exp {γ_{1}^{T} x_{i 1} (s) + γ_{2}^{T} x_{i 2} (t)} / P (s, t; γ, η)

\Pr {e_{i i^{'}} (s, t) | R (s, t)} = I (i \neq i^{'}) I_{i i^{'}} (s, t) \exp {γ_{1}^{T} x_{i 1} (s) + γ_{2}^{T} x_{i^{'} 2} (t)} / P (s, t; γ, η)

where γ = (γ₁,γ₂), η = (β,α) and

P (s, t; γ, η) = \sum_{i = 1}^{n} I_{i i} (s, t) ρ (s, t; x_{i 1} (s), x_{i 2} (t), η) \exp {γ_{1}^{T} x_{i 1} (s) + γ_{2}^{T} x_{i 2} (t)} + \sum_{i = 1}^{n} \sum_{i^{'} \neq i} I_{i i^{'}} (s, t) \exp {γ_{1}^{T} x_{i 1} (s) + γ_{2}^{T} x_{i^{'} 2} (t)} .

Of note, for each bivariate risk set that can be constructed from the observed event times (s, t), either e_ii(s, t) or e_ii′(s,t)(i ≠ i′) must occur. The pseudo-partial likelihood is the product of all of the conditional probabilities of e_ii or e_ii′ (i ≠ i′), given the corresponding risk sets constructed from the observed event times. Synthesizing this information, we have the log-pseudo-partial likelihood as

L_{p} (γ, η) = \sum_{i = 1}^{n} \sum_{k = 1}^{m_{i 1}} \sum_{k^{'} = 1}^{m_{i 2}} \log [\frac{ρ (t_{i 1 k}, t_{i 2 k^{'}} | x_{i 1 k}, x_{i 2 k^{'}}, η) \exp {γ_{1}^{T} x_{i 1 k} + γ_{2}^{T} x_{i 2 k^{'}}}}{P (t_{i 1 k}, t_{i 2 k^{'}}; γ, η)}] + \sum_{i = 1}^{n} \sum_{i^{'} \neq i}^{n} \sum_{k = 1}^{m_{i 1}} \sum_{k^{'} = 1}^{m_{i^{'} 2}} I_{i^{'} i} (t_{i 1 k}, t_{i^{'} 2 k^{'}}) \log [\frac{\exp {γ_{1}^{T} x_{i 1 k} + γ_{2}^{T} x_{i^{'} 2 k^{'}}}}{P (t_{i 1 k}, t_{i^{'} 2 k^{'}}; γ, η)}] .

(6)

This pseudo-partial likelihood naturally extends the partial likelihood method for right-censored single time-to-event data to bivariate recurrent event data. For the estimation procedure, we can simultaneously solve the joint estimations in equation (5) for the marginal rates and the score equation of the pseudo-partial likelihood. Alternatively, we can first estimate the covariate effects on the marginal rates, denoted as $\hat{γ}$ , by solving estimating equation (5), since that equation does not involve parameter η. We then replace $\hat{γ}$ in the log-pseudo-partial likelihood function, denoted as $L_{p} (\hat{γ}, η)$ , and maximize $L_{p} (\hat{γ}, η)$ to obtain the estimator of η, denoted as $\hat{η}$ . The corresponding score equation is

U (η, \hat{γ}) = \sum_{i = 1}^{n} \sum_{i^{'} = 1}^{n} \sum_{k = 1}^{m_{i 1}} \sum_{k^{'} = 1}^{m_{i^{'} 2}} [I (i = i^{'}) \nabla_{η} \log {ρ (t_{i 1 k}, t_{i 2 k^{'}} | x_{i 1 k}, x_{i 2 k^{'}}, η)} - I_{i^{'} i} (t_{i 1 k}, t_{i^{'} 2 k^{'}}) \nabla_{η} \log {P (t_{i 1 k}, t_{i^{'} 2 k^{'}}; \hat{γ}, η)}],

where ∇_η denotes the first derivative with respect to η. Theoretically, the two estimators should be identical by solving the two set equations simultaneously or sequentially; while numerically, the two estimators could be different since they are usually not the exact solutions to the equations. We have conducted simulation studies and confirmed that the differences between the two estimating procedures are ignorable in our setting. Accordingly, we solved the two sets of equations sequentially in our simulation studies and application, due to its computational efficiency.

1.3. Asymptotic Behavior

The asymptotic properties of $\hat{γ}$ have been well established (Lin et al., 2000). We need to consider the asymptotic performance of the pseudo-partial likelihood and the influence of plugging $\hat{γ}$ into the pseudo-partial likelihood. Note that the proposed likelihood is not a true partial likelihood because the conditional events from which components are constructed do not come from a nested sequence; hence, the martingale theorem cannot be applied to derive the asymptotic properties of the estimators by maximizing the pseudo-partial likelihood. Nevertheless, each component in the pseudo-partial likelihood is a legitimate conditional density, and it can be shown that the corresponding score equation is an unbiased estimating equation for η, although the correlation between the observed events on the observed bivariate risk sets is not accounted for in the construction of the pseudo-partial likelihood. We apply the empirical process theorem and U-statistics theorem to establish the consistency and asymptotic normality of the estimators under the regularity conditions listed in the Supporting Information. We summarize these results in the following theorem and provide the proof in the Supporting Information.

Theorem 1: Under regularity conditions [C.1-C.5] listed in the Supporting Information, the estimator $\hat{η}$ converges to the true value η₀ in probability. Moreover, $\sqrt{n} (\hat{η} - η_{0})$ converges in distribution to a normal distribution with mean 0 and the covariance matrix as defined in the Supporting Information.

2. SIMULATIONS

2.1. Design and Data Generation

We conducted simulation studies to evaluate the finite sample performance of the proposed method. We simulated bivariate recurrent event data through the shared random effects models. Specifically, we generated the shared random effects from a Gamma distribution with a shape parameter a and scale parameter b. We further increased the random effects by δ to avoid extremely small random effects that might lead to highly rare recurrent events. We considered two covariates for both the proportional rate models and proportional rate ratio models: X₁ ~ binomial(0.5) and X₂ ~ Uniform(0,1). We generated independent censoring times from a uniform distribution on the interval [3, 12]. We used two sample sizes (n = 200 and n = 400) with 1000 replications for each scenario. We generated bivariate recurrent event data from two scenarios with different dependence structures (Level 2 model). In Scenario 1, the two recurrent events have covariate-varying dependence, and in Scenario 2, the events have time-varying dependence in addition to covariate-varying dependence. Please see the Supporting Information for more details on the data generation.

For the variance estimation, because the explicit form of the asymptotic variance of the estimators is very complicated, involving unknown functions such as the baseline rate functions, it is computationally difficult and unstable to directly obtain a consistent variance estimator. Given the established square root of n convergence rate, we can use resampling methods (e.g., bootstrap or Jackknife methods) to estimate the variance of estimators. We choose the jackknife method for two main reasons: 1) For time-to event data with a small or moderate sample size, the nonparametric bootstrap method has the potential to produce a data set with fewer events, due to its random sampling with replacement, which may cause the non-convergence issue in the estimation procedure for this re-sampled data. This is not an issue, however, using the Jackknife method. 2) Based on our previous experience, the Jackknife-bias correction formula can reduce the estimator biases substantially when the sample size is small or moderate. Although the bootstrap can provide more general information (the sampling distribution of an estimator), considering our purpose, the Jackknife method is preferred for its explicit bias-correction formula (Shao and Wu, 1989). Denote ${\hat{η}}_{(- i)}$ as the estimator after the i-th observation being deleted. Then the bias corrected Jackknife estimator is ${\hat{η}}_{jack} = n \hat{η} - (n - 1) {\hat{η}}_{(.)}$ where ${\hat{η}}_{(.)}$ is the empirical average of the Jackknife replicates and equals to $\sum_{i = 1}^{n} {\hat{η}}_{(- i)} ∕ n$ . The corresponding Jackknife standard error is defined as

S E {(\hat{η})}_{j a c k} = {\frac{n - 1}{n} \sum_{i = 1}^{n} ({\hat{η}}_{(- i)} - {\hat{η}}_{(\cdot)})^{2}}^{1 / 2} .

2.2. Simulation Results

We obtained the proposed estimates by solving estimating equation (5) and maximizing the log-pseudo-partial likelihood specified in equation (6), which was implemented by the optim function in R (R Development Core Team). Tables 1 and 2 respectively summarize simulation results under Scenarios 1 and 2, including empirical biases of estimators and bias corrected Jackknife estimators, empirical standard deviation, standard error estimators by the Jackknife method and coverage probabilities of the 95% Wald type confidence intervals of the original estimators.

Table 1.

Simulation results under Scenario 1

	Proportional rate model						Proportional rate ratio model
n	γ	Bias	ESD	JSE	Bias_j	CP	(β,α)	Bias	ESD	JSE	Bias_j	CP
Low event scenario: # of first type of events=1.48, # of second type of events=1.49
200	γ₁₁ = .2	.010	.168	.165	.005	94.4	β₁ = .693	−.066	.324	.309	.000	92.7
	γ₁₂ = 0	.003	.273	.275	.003	95.8	β₂ = .0	−.001	.021	.020	−.001	93.3
	γ₂₁ = .4	.010	.172	.167	.004	95.1	β₃ = .0	−.001	.021	.021	.000	95.4
	γ₂₂ = 0	.004	.267	.268	.004	95.5	α₁ = −.511	.038	.254	.244	−.004	92.6
							α₂ = .0	.001	.418	.385	.002	94.6
400	γ₁₁ = .2	.001	.121	.116	−.002	94.0	β₁ = .693	−.048	.248	.282	.015	94.8
	γ₁₂ = 0	.001	.205	.194	.001	93.2	β₂ = .0	.000	.014	.018	−.002	95.4
	γ₂₁ = .4	−.003	.120	.117	−.006	93.9	β₃ = .0	−.001	.015	.018	−.003	95.2
	γ₂₂ = 0	.000	.189	.189	.000	95.8	α₁ = −.511	.017	.189	.224	−.007	94.3
							α₂ = .0	.024	.332	.331	.022	94.9
High event scenario: # of first type of events=2.04, # of second type of events=2.07
200	γ₁₁ = .2	.010	.155	.152	.005	95.0	β₁ = .693	−.064	.295	.286	.009	91.0
	γ₁₂ = 0	.004	.255	.252	.004	94.6	β₂ = .0	−.001	.018	.018	−.001	95.1
	γ₂₁ = .4	.008	.158	.154	.003	94.5	β₃ = .0	−.001	.017	.018	−.001	94.9
	γ₂₂ = 0	.004	.246	.244	.004	95.5	α₁ = −.511	.037	.240	.231	−.006	91.4
							α₂ = .0	.007	.391	.354	.000	94.8
400	γ₁₁ = .2	.000	.111	.108	−.002	94.0	β₁ = .693	−.047	.208	.212	−.010	91.8
	γ₁₂ = 0	−.001	.183	.179	.000	94.4	β₂ = .0	.001	.012	.013	.000	95.1
	γ₂₁ = .4	−.001	.111	.107	−.004	93.6	β₃ = .0	−.001	.012	.013	−.001	95.4
	γ₂₂ = 0	.003	.171	.172	.003	95.5	α₁ = −.511	.020	.172	.173	−.005	92.5
							α₂ = .0	.018	.271	.273	.020	95.7

Open in a new tab

Bias, empirical bias; ESD, empirical standard deviation; JSE, standard error estimator by the jackknife method; Bias_j, empirical bias of bias corrected Jackknife estimator; CP, coverage probability of the 95% confidence intervals.

Table 2.

Simulation results under Scenario 2

	Proportional rate model						Proportional rate ratio model
n	γ	Bias	ESD	JSE	Bias_j	CP	(β,α)	Bias	ESD	JSE	Bias_j	CP
Low event scenario: # of first type of events=1.48, # of second type of events=1.49
200	γ₁₁ = .2	.000	.214	.212	.003	95.4	β₁ = .511	−.073	.424	.393	.013	94.9
	γ₁₂ = 0	−.017	.365	.370	−.018	95.0	β₂ = .182	.005	.225	.220	−.009	94.0
	γ₂₁ = .4	.001	.208	.208	.003	95.0	α₁ = .336	−.049	.340	.323	−.030	92.9
	γ₂₂ = 0	−.008	.371	.368	−.008	94.9	α₂ = .0	−.032	.648	.585	−.039	95.5
400	γ₁₁ = .2	.000	.147	.149	.001	94.7	β₁ = .511	−.067	.313	.316	−.004	94.8
	γ₁₂ = 0	.009	.266	.262	.009	94.4	β₂ = .182	.012	.170	.170	.003	93.7
	γ₂₁ = .4	.000	.148	.146	.002	95.3	α₁ = .336	−.019	.259	.253	−.015	92.1
	γ₂₂ = 0	.013	.269	.262	.013	94.5	α₂ = .0	.007	.505	.479	.012	94.6
High event scenario: # of first type of events=1.97, # of second type of events=2.08
200	γ₁₁ = .2	−.003	.203	.202	.001	95.4	β₁ = .511	−.070	.405	.374	.016	94.7
	γ₁₂ = 0	−.018	.348	.354	−.018	94.4	β₂ = .182	.006	.206	.203	−.007	94.4
	γ₂₁ = .4	.002	.195	.197	.005	95.4	α₁ = .336	−.048	.324	.306	−.032	92.5
	γ₂₂ = 0	−.004	.353	.351	−.004	94.8	α₂ = .0	−.030	.623	.561	−.037	95.3
400	γ₁₁ = .2	−.001	.144	.143	.001	93.9	β₁ = .511	−.062	.301	.295	−.011	94.9
	γ₁₂ = 0	.008	.254	.252	.007	93.8	β₂ = .182	.010	.161	.155	.002	93.8
	γ₂₁ = .4	.001	.141	.139	.002	94.8	α₁ = .336	−.017	.250	.239	−.004	92.7
	γ₂₂ = 0	.012	.255	.250	.012	94.5	α₂ = .0	.003	.490	.457	.009	94.1

Open in a new tab

Seen from the left panel of Table 1, regression coefficients under the proportional rate models were well estimated with small empirical biases under Scenario 1. The associated empirical standard deviations and estimated standard errors agreed well and the coverage probabilities of the 95% confidence intervals were close to 95%. As expected, the standard errors of estimators decreased with increasing average numbers of recurrent events and increasing sample sizes. For the parameters under the proportional rate ratio model, the empirical biases were relatively large with a small sample (n=200), but these biases decreased quickly with increasing sample size (n=400). All coverage probabilities were in a reasonable range from 91.0% to 95.7% . We found that the Jackknife method had a considerable advantage to reduce biases, particularly for small sample sizes, besides that it can accurately capture the variability of the proposed estimating procedure. By comparing the results from low-event and high-event scenarios, we found that the increasing frequency of the recurrent events can improve the precision of the estimated regression coefficients under both the proportional rate and rate ratio models. Note that we included two time-related terms and an un-related continuous covariate X₂ under the model, although the degree of dependence between the two recurrent events did not depend on the time and this covariate. The corresponding parameters β₂, β₃, and α₂ were well estimated with empirical biases smaller than 0.025, suggesting that the proposed method performed very well even when the true values of these parameters are zeros.

We also evaluated the performance of the proposed method in the scenarios with time-varying dependence (Scenario 2). We fitted the proportional rate models specified in Equations (1) and (2) of the Supporting Information and the proportional rate ratio model specified in Equation (4) of the Supporting Information. Seen from Table 2, both the estimators and the associated inferences were accurate; the empirical biases were small and coverage probabilities were close to the nominal values. Again, the jackknife method could help reduce the estimation biases for the scenarios with a small sample size (n=200). Under this scenario, the degree of dependency changed with the time of the second event occurrence, besides the covariate X₁. The satisfactory performance of estimated β₂, α₁ and α₂ confirmed that the proposed model and method can simultaneously evaluate the time trend and covariate effects on the dependence structure.

The rate ratio function under the commonly used shared random effects model for bivariate recurrent event data (Zhu et al., 2010; Ning et al., 2017) is a constant, determined by the coefficient of variation of the shared random effects (Ning et al., 2015). It implies that the shared random effects models assume that the strength of dependency is a constant over time and can not be affected by covariate. Hence the shared random effects models can not be used to characterize the dependence structure if the constant dependence assumption is violated. We conducted sensitivity studies to evaluate the robustness of the shared random effects model with respect to violations of this assumption. We fitted the semiparametric shared random effects model (Zhu et al., 2010; Ning et al., 2017), and summarized results in Table S2 of the Supporting Information. The results confirmed that the shared random effects model cannot capture the underlying dependence structure between the bivariate recurrent events. For example, the estimated rate ratio under Scenario 1 was 1.56, although the true rate ratios were 1.2 and 2.0 for the two subgroups with X₁ = 0 and X₁ = 1, respectively. Interestingly, for the two scenarios, the estimated regression coefficients under the marginal rate models had reasonable performance. This suggests that the shared random effects models had a certain robustness in terms of the parameters under the marginal models when the dependence structure between bivariate recurrent events was misspecified.

In summary, the simulation studies suggest that the proposed method can accurately estimate the covariate effects on the frequency and dependence of recurrent events under both time-invariant and time-varying dependence, with small biases and well-estimated standard errors.

3. APPLICATION

Understanding the frequency and dependence structure between different types of cancer recurrence and associated treatment effects may help clinicians and patients to make better treatment decisions. We applied the proposed method to a soft tissue sarcoma study (Cormier et al., 2004). A cohort of 679 patients was identified from two major cancer centers, in which patients may experience local recurrence of sarcoma (in the same or nearby part of the body where the primary cancer occurred) and distant recurrence (in a different part of the body). All patients in the cohort received definitive surgical resection of the tumor as their initial treatment. A clinical question of interest is whether other initial treatment choices, such as chemotherapy and radiation, have an impact on the frequency and the dependence between the two different types of sarcoma recurrence. The follow-up period ranged from 0.01 to 18.57 years, with a median of 4.2 years. During the follow-up, 820 cancer recurrences were observed among the 674 patients. At least one event of local cancer recurrence was experienced by 235 patients, at least one event of distant cancer recurrence was experienced by 411 patients, and multiple events of cancer recurrence were experienced by 271 patients. Of the patients who experienced multiple events, 200 experienced both types of sarcoma recurrence.

We considered three sets of models to evaluate the relationship between initial treatments and the frequency and dependency of sarcoma recurrence after adjusting for patient age at receiving the initial treatments. We have incorporated our previous findings about the time-varying dependence into the baseline rate ratio by dichotomizing the time scales of the local and distant recurrences at the median follow-up time (4.2 years)

Model (1) : λ_{1} (s) = λ_{10}^{1} (s) \exp {γ_{11}^{1} I (Chemotherapy) + γ_{12}^{1} Age} λ_{2} (t) = λ_{20}^{1} (t) \exp {γ_{21}^{1} I (Chemotherapy) + γ_{22}^{1} Age} ρ (s, t) = \exp {β_{1}^{1} + β_{2}^{1} I (s < 4.2) + β_{3}^{1} I (t < 4.2) + β_{4}^{1} I (s < 4.2, t < 4.2) + α_{1}^{1} I (Chemotherapy) + α_{2}^{2} Age}

Model (2) : λ_{1} (s) = λ_{10}^{2} (s) \exp {γ_{11}^{2} I (Radiation) + γ_{12}^{2} Age} λ_{2} (t) = λ_{20}^{2} (t) \exp {γ_{21}^{2} I (Radiation) + γ_{22}^{2} Age} ρ (s, t) = \exp {β_{1}^{2} + β_{2}^{2} I (s < 4.2) + β_{3}^{2} I (t < 4.2) + β_{4}^{2} I (s < 4.2, t < 4.2) + α_{1}^{2} I (Radiation) + α_{2}^{2} Age},

Model (3) : λ_{1} (s) = λ_{10}^{3} (s) \exp {γ_{11}^{3} I (Chemotherapy) + γ_{12}^{3} I (Radiation) + γ_{13}^{3} Age} λ_{2} (t) = λ_{20}^{3} (t) \exp {γ_{21}^{3} I (Chemotherapy) + γ_{22}^{3} I (Radiation) + γ_{23}^{3} Age} ρ (s, t) = \exp {β_{1}^{3} + β_{2}^{3} I (s < 4.2) + β_{3}^{3} I (t < 4.2) + β_{4}^{3} I (s < 4.2, t < 4.2) + α_{1}^{3} I (Chemotherapy) + α_{2}^{3} I (Radiation) + α_{3}^{3} Age},

where λ₁(.) and λ₂(.) respectively represent the marginal rates of local disease recurrence and distant disease recurrence. In this application, the two types of recurrent events shared the same covariate information (e.g., initial treatment and patients’ characteristics), therefore the assumption of λ_j(t|x_j(t),x_j′(s)) = λ_j(t|x_j(t))(j ≠ j′ ∈ {1, 2}) was satisfied.

Table 3 summarizes the estimates of parameters under the proportional rate models and the proportional rate ratio model, standard error estimates by the Jackknife method, and the associated p-values obtained by the Wald test. The analytic results of Model (1) suggest that, after adjusting age at baseline, the use of adjuvant chemotherapy with definitive surgical resection decreased the frequencies of local and distant cancer recurrences by $\exp (γ_{11}^{1}) = 0.968$ and $\exp (γ_{21}^{1}) = 0.999$ , respectively. Also, the use of chemotherapy was associated with a 0.118 increase in the log of rate ratio. That is to say, using the interpretation based on equation (1), the standardized co-occurrence rate of local and distant recurrences in the chemotherapy group is exp(0.118) = 1.125 times of that in the no-chemotherapy group. However, these effects were not statistically significant. The age effects on the frequencies and dependence were not statistically significant either, although older patients tended to have weaker dependency between the two types of cancer recurrence; there was an $\exp (α_{2}^{1}) = 0.999$ decrease in the rate ratio relative to a one year increase in the age at receiving the initial treatments. Interestingly, the results of Model (2) indicated that the use of adjuvant radiation with definitive surgical resection had opposite effects on frequencies of local and distant recurrences compared to those associated with adjuvant chemotherapy; however, neither of these effects were statistically significant, nor were the effects of the adjuvant radiation on the rate ratio. After including both the adjuvant radiation and chemotherapy into the marginal rate and rate ratio models, we observed similar results with those by Models (1) and (2).

Table 3.

Summary of data analysis for soft tissue sarcoma

Parameter	Estimate	JSE	Wald p-value

	Model (1)
Chemotherapy effect on rate of local recurrence	−0.033	0.127	0.798
Age effect on rate of local recurrence	0.004	0.004	0.270
Chemotherapy effect on rate of distant recurrence	−0.001	0.098	0.991
Age effect on rate of distant recurrence	−0.004	0.003	0.211
Intercept	1.170	0.215	<0.001
Early local recurrence (I(s<4.2))	−1.036	0.294	<0.001
Early distant recurrence (I(t<4.2))	−0.889	0.291	0.002
Early recurrences of both events (I(s<4.2, t<4.2))	1.501	0.460	0.001
Chemotherapy effect on rate ratio	0.118	0.115	0.305
Age effect on rate ratio	−0.001	0.003	0.739
	Model (2)
Radiation effect on rate of local recurrence	0.137	0.138	0.318
Age effect on rate of local recurrence	0.004	0.004	0.240
Radiation effect on rate of distant recurrence	0.006	0.127	0.961
Age effect on rate of distant recurrence	−0.004	0.003	0.207
Intercept	1.183	0.209	<0.001
Early local recurrence (I(s<4.2))	−1.044	0.291	<0.001
Early distant recurrence (I(t<4.2))	−0.903	0.291	0.002
Early recurrences of both events (I(s<4.2, t<4.2))	1.505	0.440	0.001
Radiation effect on rate ratio	0.114	0.140	0.413
Age effect on rate ratio	−0.001	0.003	0.647
	Model (3)
Chemotherapy effect on rate of local recurrence	−0.028	0.167	0.867
Radiation effect on rate of local recurrence	0.137	0.158	0.385
Age effect on rate of local recurrence	0.004	0.004	0.270
Chemotherapy effect on rate of distant recurrence	−0.002	0.107	0.985
Radiation effect on rate of distant recurrence	0.006	0.153	0.968
Age effect on rate of distant recurrence	−0.004	0.003	0.213
Intercept	1.083	0.279	<0.001
Early local recurrence (I(s<4.2))	−1.043	0.285	<.001
Early distant recurrence (I(t<4.2))	−0.896	0.287	0.002
Early recurrences of both events (I(s<4.2, t<4.2))	1.510	0.431	<.001
Chemotherapy effect on rate ratio	0.122	0.160	0.445
Radiation effect on rate ratio	0.117	0.196	0.549
Age effect on rate ratio	−0.001	0.003	0.764

Open in a new tab

Similar to our previous findings (Ning et al., 2015), our analysis indicated that, conditional on the initial treatments and baseline age, the dependence between the local and distant cancer recurrences was significantly positive and not constant over time. For example, conditional on the use of adjuvant chemotherapy and baseline age, the risk of early local recurrence and risk of early distant recurrence was positively dependent, with a rate ratio of $\exp (β_{1}^{1} + β_{2}^{1} + β_{3}^{1} + β_{4}^{1}) = 2.109$ (p-value < 0.001). It indicated a multiplicative increase of 2.109 in the risk of early local cancer recurrence for patients who had early distant cancer recurrence versus those who did not. Alternatively, based on the interpretation for equation (1), this can also be understood as that the standardized co-occurrence rate of early local and early distant disease recurrence is 2.109.

4. DISCUSSION

In this paper, we propose a semiparametric regression model for studying bivariate recurrent event data. The model consists of two levels: the first level employs proportional rate models for marginal rates, and the second level uses a proportional rate ratio model to characterize the dependence structure between the two different types of recurrent events. In the proposed models, we specify the rates and rate ratio (dependence structure) of the recurrent events, but not the full distribution. While the proposed pseudo-partial likelihood may be less efficient than the full likelihood, the specification of the full distribution of bivariate processes generally is notoriously difficult. Furthermore, the assumptions on the dependence structure in the commonly used shared frailty models are usually considered restrictive, in which the dependence is assumed to be constant regardless of the time and covariate values.

In this paper, we extend the partial likelihood method for right-censored single time-to-event data to bivariate recurrent event data to estimate parameters under the proportional rate ratio model. Although the proposed pseudo-partial likelihood involves the conditional probabilities on the bivariate risk sets, the associated computational expenses are reasonable. We used a high-performance computing system deployed at Texas Advanced Computing Center and parallel computing to conduct our simulations and application. For example, in a 200-run simulation for the data under Scenario 2 with a sample size of 200, it took 0.06 hours for the point estimation and 11.54 hours for the variance estimation by the Jackknife method. For the soft sarcoma data, it took 0.57 hours for both the point and variance estimation.

Another advantage of our proposed models is that the marginal rates and rate ratio have separate models, which are not linked by shared random effects. Although the estimated marginal rates play a role in the pseudo-partial likelihood, the rate ratio model and its estimation does not affect the marginal models and the associated estimating results. From this perspective, the proposed model framework is robust and reduces the induced biased inference due to misspecification of the rate ratio model.

One limitation of the proposed inferential procedure is the assumption of non-informative censoring. In many applications, this non-informative censoring assumption may not hold. For regression analysis with single/multiple recurrent events, various approaches have been suggested to accommodate the informative censoring (Wang et al., 2001; Zhao et al., 2012). The inferential procedure for the dependency measure needs to be further generalized to relax the non-informative censoring assumption.

Another challenge of our method is the model specification for the rate ratio. In application, similar to the parametric baseline hazard function, a piece-wise constant function is generally preferred for the baseline rate ratio function, due to its easy interpretation. However, standard diagnostic tools, such as residual plots, cannot be used directly to assess the model’s adequacy. When all covariates are categorical, we can graphically evaluate the goodness-of-fit of the fitted model by the definition of rate ratio. For each subgroup determined by the categorical variables under the model, we respectively estimate the bivariate rate function using its definition and the rate ratio model. Specifically, the two estimators are

{\hat{λ}}_{12}^{M} (s, t) = \hat{ρ} (s, t) \sum_{i = 1}^{n} \int_{0}^{τ} K_{1} (\frac{s - u_{1}}{h}) \frac{d N_{i 1} (u_{1})}{\sum_{j = 1}^{n} I (C_{j 1} ⩾ u_{1})} \sum_{i = 1}^{n} \int_{0}^{τ} K_{2} (\frac{t - u_{2}}{h}) \frac{d N_{i 2} (u_{2})}{\sum_{j = 1}^{n} I (C_{j 2} ⩾ u_{2})}

and

{\hat{λ}}_{12}^{J} (s, t) = \sum_{i = 1}^{n} \int_{0}^{τ} \int_{0}^{τ} K_{12} (\frac{s - u_{1}}{h}, \frac{s - u_{2}}{h}) \frac{d N_{i 1} (u_{1}) d N_{i 2} (u_{2})}{\sum_{j = 1}^{n} I (C_{j 1} ⩾ u_{1}, C_{j 2} ⩾ u_{2})},

where K_j, j = 1, 2, is a symmetric kernel function and K₁₂ is a bivariate kernel function with bandwidth h. Here, the two marginal rate functions and ${\hat{λ}}_{12}^{J}$ are computed by smoothing the empirical subject-specific rate estimators (Chiang et al., 2005), while ${\hat{λ}}_{12}^{M} (s, t)$ uses the fitted rate ratio by our model. If the two estimated bivariate rate functions do not show clear discrepancy, the assumed model on the rate ratio is reasonable. An alternative way to evaluate model adequacy is to use likelihood-ratio-based inference tools. Here, we use the pseudo-partial likelihood, instead of the full likelihood, for the estimation. To this end, the asymptotic behavior of the pseudo-partial likelihood ratio test for two nested models needed to be thoroughly studied. Developing rigorous statistical tools for model checking is beyond the scope of this paper, and this is a worthy objective for future research.

Supplementary Material

Supplementary

NIHMS1596616-supplement-Supplementary.pdf^{(209.9KB, pdf)}

Acknowledgements

The authors thank the editor, associate editor and reviewers for helpful comments and suggestions, which have led to improvements of this article. This work was partially supported by grants from the National Institute of Health (R01CA193878, P30CA016672, and UL1TR003167) and the Andrew Sabin Family Fellowship. The authors acknowledge the Texas Advanced Computing Center at the University of Texas at Austin for providing high performance computing resources that have contributed to the research results reported within this paper.

Footnotes

Supporting Information

Regularity conditions and proofs of Theorem 1 in Section 1, additional simulation details referenced in Section 2, and computational codes with example data sets are available with this paper at the Biometrics website on Wiley Online Library.

Data Availability Statement

We are not able to share the soft tissue sarcoma data per the data sharing policy of MD Anderson Cancer Center. In the Supporting Information, we have included simulated data sets for illustration with the computational code.

References

Cai JW and Schaubel DE (2004). Marginal means/rates models for multiple type recurrent event data. Lifetime Data Analysis 10, 121–138. [DOI] [PubMed] [Google Scholar]
Chiang C-T, Wang M-C, and Huang C-Y (2005). Kernel estimation of rate function for recurrent event data. Scandinavian journal of statistics 32, 77–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cook RJ and Lawless JF (2007). The Statistical Analysis of Recurrent Events. Springer, New York. [Google Scholar]
Cook RJ, Lawless JF, and Lee KA (2010). A copula-based mixed Poisson model for bivariate recurrent events under event-dependent censoring. Statistics in Medicine 29, 694–707. [DOI] [PubMed] [Google Scholar]
Cormier JN, Huang X, Xing Y, Thall PF, Wang X, Benjamin RS, Pollock RE, Antonescu CR, Maki RG, Brennan MF, and Pisters PW (2004). Cohort analysis of patients with localized, high-risk, extremity soft tissue sarcoma treated at two cancer centers: chemotherapy-associated outcomes. The Journal of Clinical Oncology 22, 4567–4574. [DOI] [PubMed] [Google Scholar]
Cox DR (1975). Partial likelihood. Biometrika 62, 269–276. [Google Scholar]
Dalal SR and McIntosh AM (1994). When to stop testing for large software systems with changing code. IEEE Trans. Software Eng 20, 318–323. [Google Scholar]
Fan J and Prentice RL (2002). Covariate-adjusted dependence estimation on a finite bivariate failure time region. Statistica Sinica pages 689–705. [DOI] [PubMed] [Google Scholar]
Lawless JF and Nadeau C (1995). Some simple robust methods for the analysis of recurrent events. Technometrics 37, 158–168. [Google Scholar]
Lin D, Wei L, Yang I, and Ying Z (2000). Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 62, 711–730. [Google Scholar]
Ning J, Chen Y, Cai C, Huang X, and Wang M-C (2015). On the dependence structure of bivariate recurrent event processes: inference and estimation. Biometrika pages 345–358. [Google Scholar]
Ning J, Rahbar MH, Choi S, Piao J, Hong C, del Junco DJ, Rahbar E, Fox EE, Holcomb JB, and Wang M-C (2015). Estimating the ratio of multivariate recurrent event rates with application to a blood transfusion study. Statistical methods in medical research page 0962280215593974. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ning J, Rahbar MH, Choi S, Piao J, Hong C, Del Junco DJ, Rahbar E, Fox EE, Holcomb JB, and Wang M-C (2017). Estimating the ratio of multivariate recurrent event rates with application to a blood transfusion study. Statistical methods in medical research 26, 1969–1981. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schaubel DE and Cai JW (2005). Semiparametric methods for clustered recurrent event data. Lifetime Data analysis 11, 405–425. [DOI] [PubMed] [Google Scholar]
Shao J and Wu CJ (1989). A general theory for jackknife variance estimation. The Annals of Statistics pages 1176–1197. [Google Scholar]
Sun L, Zhu L, and Sun J (2009). Regression analysis of multivariate recurrent event data with time-varying covariate effects. Journal of Multivariate Analysis 100, 2214–2223. [Google Scholar]
Ventura V, Cai C, and Kass RE (2005). Statistical assessment of time-varying dependency between two neurons. J Neurophysiol. 94, 2940–7. [DOI] [PubMed] [Google Scholar]
Wang MC, Qin J, and Chiang CT (2001). Analyzing recurrent event data with informative censoring. Journal of the American Statistical Association 96, 1057–1065. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhao X, Liu L, and Xu W (2012). Analysis of multivariate recurrent event data with time-dependent covariates and informative censoring. Biometrical Journal 54, 585–595. [DOI] [PubMed] [Google Scholar]
Zhu L, Sun J, Tong X, and Srivastava DK (2010). Regression analysis of multivariate recurrent event data with a dependent terminal event. Lifetime data analysis 16, 478–490 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary

NIHMS1596616-supplement-Supplementary.pdf^{(209.9KB, pdf)}

[R1] Cai JW and Schaubel DE (2004). Marginal means/rates models for multiple type recurrent event data. Lifetime Data Analysis 10, 121–138. [DOI] [PubMed] [Google Scholar]

[R2] Chiang C-T, Wang M-C, and Huang C-Y (2005). Kernel estimation of rate function for recurrent event data. Scandinavian journal of statistics 32, 77–91. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Cook RJ and Lawless JF (2007). The Statistical Analysis of Recurrent Events. Springer, New York. [Google Scholar]

[R4] Cook RJ, Lawless JF, and Lee KA (2010). A copula-based mixed Poisson model for bivariate recurrent events under event-dependent censoring. Statistics in Medicine 29, 694–707. [DOI] [PubMed] [Google Scholar]

[R5] Cormier JN, Huang X, Xing Y, Thall PF, Wang X, Benjamin RS, Pollock RE, Antonescu CR, Maki RG, Brennan MF, and Pisters PW (2004). Cohort analysis of patients with localized, high-risk, extremity soft tissue sarcoma treated at two cancer centers: chemotherapy-associated outcomes. The Journal of Clinical Oncology 22, 4567–4574. [DOI] [PubMed] [Google Scholar]

[R6] Cox DR (1975). Partial likelihood. Biometrika 62, 269–276. [Google Scholar]

[R7] Dalal SR and McIntosh AM (1994). When to stop testing for large software systems with changing code. IEEE Trans. Software Eng 20, 318–323. [Google Scholar]

[R8] Fan J and Prentice RL (2002). Covariate-adjusted dependence estimation on a finite bivariate failure time region. Statistica Sinica pages 689–705. [DOI] [PubMed] [Google Scholar]

[R9] Lawless JF and Nadeau C (1995). Some simple robust methods for the analysis of recurrent events. Technometrics 37, 158–168. [Google Scholar]

[R10] Lin D, Wei L, Yang I, and Ying Z (2000). Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 62, 711–730. [Google Scholar]

[R11] Ning J, Chen Y, Cai C, Huang X, and Wang M-C (2015). On the dependence structure of bivariate recurrent event processes: inference and estimation. Biometrika pages 345–358. [Google Scholar]

[R12] Ning J, Rahbar MH, Choi S, Piao J, Hong C, del Junco DJ, Rahbar E, Fox EE, Holcomb JB, and Wang M-C (2015). Estimating the ratio of multivariate recurrent event rates with application to a blood transfusion study. Statistical methods in medical research page 0962280215593974. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Ning J, Rahbar MH, Choi S, Piao J, Hong C, Del Junco DJ, Rahbar E, Fox EE, Holcomb JB, and Wang M-C (2017). Estimating the ratio of multivariate recurrent event rates with application to a blood transfusion study. Statistical methods in medical research 26, 1969–1981. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] Schaubel DE and Cai JW (2005). Semiparametric methods for clustered recurrent event data. Lifetime Data analysis 11, 405–425. [DOI] [PubMed] [Google Scholar]

[R15] Shao J and Wu CJ (1989). A general theory for jackknife variance estimation. The Annals of Statistics pages 1176–1197. [Google Scholar]

[R16] Sun L, Zhu L, and Sun J (2009). Regression analysis of multivariate recurrent event data with time-varying covariate effects. Journal of Multivariate Analysis 100, 2214–2223. [Google Scholar]

[R17] Ventura V, Cai C, and Kass RE (2005). Statistical assessment of time-varying dependency between two neurons. J Neurophysiol. 94, 2940–7. [DOI] [PubMed] [Google Scholar]

[R18] Wang MC, Qin J, and Chiang CT (2001). Analyzing recurrent event data with informative censoring. Journal of the American Statistical Association 96, 1057–1065. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] Zhao X, Liu L, and Xu W (2012). Analysis of multivariate recurrent event data with time-dependent covariates and informative censoring. Biometrical Journal 54, 585–595. [DOI] [PubMed] [Google Scholar]

[R20] Zhu L, Sun J, Tong X, and Srivastava DK (2010). Regression analysis of multivariate recurrent event data with a dependent terminal event. Lifetime data analysis 16, 478–490 [DOI] [PubMed] [Google Scholar]

PERMALINK

Semiparametric Modelling and Estimation of Covariate-Adjusted Dependence Between Bivariate Recurrent Events

Jing Ning

Chunyan Cai

Yong Chen

Xuelin Huang

Mei-Cheng Wang

Summary:

1. METHOD

1.1. Notation and Model

1.2. Estimation Procedure

1.3. Asymptotic Behavior

2. SIMULATIONS

2.1. Design and Data Generation

2.2. Simulation Results

Table 1.

Table 2.

3. APPLICATION

Table 3.

4. DISCUSSION

Supplementary Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Semiparametric Modelling and Estimation of Covariate-Adjusted Dependence Between Bivariate Recurrent Events

Jing Ning

Chunyan Cai

Yong Chen

Xuelin Huang

Mei-Cheng Wang

Summary:

1. METHOD

1.1. Notation and Model

1.2. Estimation Procedure

1.3. Asymptotic Behavior

2. SIMULATIONS

2.1. Design and Data Generation

2.2. Simulation Results

Table 1.

Table 2.

3. APPLICATION

Table 3.

4. DISCUSSION

Supplementary Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases