Joint Analysis of Recurrence and Termination: A Bayesian Latent Class Approach

Zhixing Xu; Debajyoti Sinha; Jonathan R Bradley

doi:10.1177/0962280220962522

. Author manuscript; available in PMC: 2022 Feb 1.

Published in final edited form as: Stat Methods Med Res. 2020 Oct 13;30(2):508–522. doi: 10.1177/0962280220962522

Joint Analysis of Recurrence and Termination: A Bayesian Latent Class Approach

Zhixing Xu ¹, Debajyoti Sinha ¹, Jonathan R Bradley ^1,^*

PMCID: PMC8009817 NIHMSID: NIHMS1644642 PMID: 33050774

Summary:

Like many other clinical and economic studies, each subject of our motivating transplant study is at risk of recurrent events of Non-Fatal Tissue Rejections (NFTR) as well as the terminating event of death due to total graft rejection. For such studies, our model and associated Bayesian analysis aim for some practical advantages over competing methods. Our semiparametric latent-class based joint model has coherent interpretation of the covariate (including race and gender) effects on all functions and model quantities that are relevant for understanding the effects of covariates on future event trajectories. Our fully Bayesian method for estimation and prediction using a complete specification of the prior process of the baseline functions. We also derive a practical and theoretically justifiable partial likelihood based semiparametric Bayesian approach to deal with analysis when there is a lack of prior information about the baseline functions. Our model and method can accommodate fixed as well as time-varying covariates. Our Markov Chain Monte Carlo tools for both Bayesian methods are implementable via publicly available software. Our Bayesian analysis of transplant study and simulation study demonstrate the practical advantages and improved performance of our approach.

Keywords: Bayesian analysis, Frailty, Joint Model, Intensity and rate, Recurrent events

1. Introduction

Data on times to recurrent events until termination are common in various studies in cancer, chronic diseases, organ-transplant, repairable systems and economics. For example, in our motivating study for evaluating the covariate effects on each patient after receiving transplant, two types of responses for each transplant patient are: (1) the recurrent events of Non-fatal Tissue Rejections (NFTR) that were treated effectively by drug therapy, and (2) the terminating event of Graft-versus-Host Disease event (GvHD event) resulting in either total graft rejection or death. Although methodologies for recurrent events data have a long history in the literature (Cook and Lawless, 2007), the topic of recurrent events data with informative termination is a relatively new research field.

Either using the naive assumption of non-informative termination (as defined in Cook and Lawless, 2007) or making inference about every recurrence while treating the termination and the remaining events as nuisances (Hougaard, 2000) often leads to seriously biased and even misleading inference (Miloslavsky et al., 2004). Other methods (see review by Miloslavsky et al., 2004) using an extension of the Coarsening-At-Random (CAR) assumption of Heitjan and Rubin (1991) preclude any inference on the termination event. Also, the CAR assumption is not verifiable from observed data and often lacks any practical meaning especially for transplant and studies with terminating event being death (Huang and Wang, 2004; Sinha et al., 2008). All of these approaches fail to coherently explain covariate effects on termination, evaluate the link between the recurrences and the risk of termination, and make prediction about future event processes. In many studies including our transplant study, evaluation of covariate effects and the predictions of future trajectories of both recurrent and terminating events are important analysis and prediction goals. For example, given some previous evidence of racial disparity on recurrent NFTR after rejection (example, Higgins and Fishman (2006)), one of the major goals of the analysis of our transplant study is a comprehensive and coherent evaluation of the race effect on joint trajectories of both NFTR and fatal GvHD events after transplant. To present a coherent and comeprehensive interpretation of the overall effect of race on both types of events, the main challenge of any useful joint model is to present clinically interpretable effects of race on following functions related to the trajectories of events after transplant: (1) the intensity function of recurrence and the hazard of termination, both conditional on recurrence history; and (2) the mean number and the rate of events, both unconditional on history. The former functions represent the dynamic effects of race and recurrence history on future events. Second set of functions express the non-dynamic (marginal) effects of race on future events. For the sake of physical interpretation, it is further desirable that a joint model should ensure similar signs and magnitudes of the race effect on all of these functions.

Since Lancaster and Intrator (1998), the joint modeling literature of such data has been dominated by models that use a patient-specific “frailty” random-effect shared by both recurrence and termination within a patient (e.g., Liu et al., 2004; Ye et al., 2007; Sinha et al., 2008; Zeng and Lin, 2009; Huang et al., 2010; Kalbfleisch et al., 2013; Xu et al., 2017). Except few (Xu et al., 2017; Paulon et al., 2018), these shared-frailty models usually require an assumption of parametric frailty distribution that can not be easily assessed from the observed data. These shared-frailty models usually lack simultaneous physical interpretations of covariate effects on all functions of interest listed above. For example, to obtain a reasonable expression of effect of race on mean and rate of recurrences over time, most of the existing shared-frailty models need the recurrent events given the frailty to be a Poisson process, an assumption considered too restrictive in practice. Whereas, other shared-frailty models with clear interpretations of covariate effects on the mean recurrence and rate (e.g. Xu et al., 2017) lack practical interpretation of dynamic effects of covariates on risks of new recurrence and termination at time t given current history of events. There are some recent interesting works using copula structure (Shih and Louis, 1995) for bivariate and even multivariate frailty random effects to model association among several types of events, recurrences and termination while preserving some desired marginal distribution of each frailty effect (e.g., Lee and Cook (2019) and Cook et al. (2010)). The goal is to use the desired marginal density of a particular frailty effect, say, a marginal Gamma frailty effect on the recurrent events, to obtain a computationally tractable likelihood. In spite of being more flexible than shared frailty models, these approaches also share some of the same difficulties in expressing simultaneous physical interpretation of covariate effects on all functions of interest. Also, models using multivariate frailty with copula are not amenable to Bayesian partial likelihood-based approach.

Recently popular Joint Latent Class Models (JLCM) for joint analysis of survival and longitudinal responses outlined in Proust-Lima et al. (2014), Barrett et al. (2015) and others avoid several several pitfalls of shared random effects models, such as increasing dimension with sample size and lack of estimation of individualized survival risk given past longitudinal outcome trajectory. Our goal is also to develop latent class models for our current problem to replicate the successes and advantages of these methods for joint analysis of longitudinal and survival outcomes.

In Section 2, we present a novel JLCM for recurrent events and termination with several practical advantages including a prediction of future profiles of recurrent and terminal events given covariates. We demonstrate the methodological advantages of the JLCM compared to existing models by showing that the JLCM has a coherent interpretation of the dynamic effects of the covariates on the risks of future events given the history, as well as the covariate effects on the rate and mean number of recurrences, unconditional on history. In Section 3, we present two semiparametric Bayesian methods of analysis using JLCM. These methods include the directions for choosing the priors, and demonstration of the ease of implementing associated Markov Chain Monte Carlo (MCMC) tools. The fully Bayesian method of Section 3 requires prior opinions on baseline functions; however, it is capable of predicting the future event trajectories. A partial likelihood based Bayesian method of Section 3 is useful when there is no available prior opinions about these unknown baseline functions of both events. Our MCMC based practical Bayesian methods are easy to implement via publicly available software such as OpenBUGS and these programs are made available by the corresponding author. In Section 4, our simulation studies show the performances of the JLCM under different priors compared to existing Bayesian methods. In Section 5, we provide an analysis of transplant data to illustrates the clinical interpretation and advantages of our models and associated methods in practice. Section 6 presents the concluding discussion including the extension of our methods and results to studies with time-varying covariates.

2. Joint Latent Class Model

Our JLCM assumes that future event trajectories of patients i = 1, ⋯, n depend on the latent class M_i of one of K + 1 latent homogeneous sub-populations G₀, G₁, ⋯, G_K. The unknown class membership variables M₁, ⋯, M_n are independent multinomial with

M_{i} ∣ (K, π) \overset{i i d}{~} Mult (π_{0}, \dots, π_{K}),

(1)

where π_j = P[M_i = j] for π = (π₀, …, π_K) is the unknown probability of patient i being from class j, and (K + 1) is the unknown number of latent classes with $\sum_{j = 0}^{K} π_{j} = 1$ . In some applications, this latent class distribution π may be a function of a set of covariates Z, however, for time being we assume that it does not depend on the observed p-dimensional fixed covariate vector x_i = (x_i1, ⋭, x_ip) that only affects the recurrent and termination events. We will later extend our model and methods to time-varying covariates.

Similar to currently popular JLCM models of longitudinal data (e.g, see Proust-Lima et al., 2014, and the references therein), we incorporate an unknown parameter η_j that models the relationship between the profile/trajectory of cumulative counts of NFTR recurrence N_i(t) and the “point-process of termination” $D_{i} (t) = 1_{[T_{i} ⩽ t]}$ of termination time T_i for all patients from latent class G_j (see (2) and (3)). We make a clear distinction between “termination” at T_i due to GvHD (either death or total graft rejection) and the non-informative “censoring” at C_i (Kalbfleisch and Prentice, 2002) due to loss of follow-up, end of study, and other factors. Additionally for our JLCM, η_j is used to accommodate the dynamic effect of the observed history $H_{i} (t -)$ on increments dN_i(t) = N_i(t+dt)−N_i(t−) and dD_i(t) = D_i(t+dt)−D_i(t−) of both recurrences and termination over time interval [t, t+dt), where $H_{i} (t -)$ up to time t− (and not including t) is defined as the σ-algebra generated by the set {N_i(u), D_i(u), A_i(u) : u < t} and $A_{i} (t) = 1_{[T_{i}, C_{i} ⩾ t]}$ is the “at observation process”. For this purpose, we assume the intensity function to be

lim_{d t \to 0} \frac{P [d N_{i} (t) > 0 ∣ H_{i} (t -), x_{i}, M_{i} = j, η_{j}]}{d t} = λ_{j} (t ∣ x_{i}, H_{i} (t -), η_{j}) = A_{i} (t) λ_{0} (t) [η_{j} k + θ_{i}],

(2)

where θ_i = exp(β′x_i) with $β^{'} x_{i} = \sum_{m = 1}^{p} x_{i m} β_{m}$ is the dynamic effect of covariate x_i, β = (β₁, ⋯, β_p) is the regression parameter, λ₀(t) is the baseline intensity function, and N_i(t−) = k is the number of past recurrences at time t (included as part of the history $H_{i} (t -)$ ). For a patient with {M_i = j}, the parameter η_j in (2), quantifies the dynamic effect of past recurrence history $H_{i} (t -)$ ) on the risk of future recurrence {dN_i(t) > 0} because every past recurrence contributes to an additional η_jλ₀(t) to the risk of dN_i(t) around time t. In particular, the first NFTR event for any latent group G_j has the common hazard function λ₀(t) exp(βx_i) with Cox’s (1972) relative risk model for the covariate effect. The class G₀ with η₀ = 0 includes patients for whom future recurrence dN_i(t) does not depend on number of past recurrences N_i(t−).

For the increment dD_i(t) in our JLCM, we assume the relative risk model (Cox, 1972)

lim_{d t \to 0} \frac{P [d D_{i} (t) ∣ H_{i} (t -); x_{i}, M_{i} = j, η_{j}]}{d t} = A_{i} (t) h_{j} (t ∣ η_{j}; x_{i}) = A_{i} (t) h_{0} (t) e^{γ x_{i} + α η_{j}},

(3)

where the unknown γ = (γ₁, ⋯, γ_p) quantifies the dynamic effect of covariate vector x_i on dD_i(t), and the scalar parameter α represents the fixed effect of the class-specific profile parameter η_j on the future risk/hazard of termination T_i when M_i = j. The practical assumption in (3) ensures that different latent classes have different risks of termination. Also, assumptions (2) and (3) together ensure that all patients within same class G_j share the same joint regression profile of recurrences and termination characterized by the unknown class-profile parameter η_j of G_j. For longitudinal data, the JLCM is a popular modeling option that allows for practical interpretation of covariate effects, heterogeneity of the population and comparison of various patients’ response profiles within and across latent classes while bypassing distributional assumption on random effects. Our novel JLCM for recurrence and termination also aims to achieve all of these above goals.

A major challenge for a joint model is to present a good physical interpretation of the covariates effects on joint process {N_i, D_i}. Existing joint models use a shared patient-specific frailty random effect W_i to accommodate the dynamic dependence between dN_i(t) and dD_i(t) given the history $H_{i} (t -)$ (Huang and Wang, 2004; Liu et al., 2004; Ouyang et al., 2013; Qu et al., 2017). These models even accommodate the effect of history $H_{i} (t -)$ on {dN_i(t), dD_i(t)} via the shared W_i. Consequently, the dynamic effects of x_i on {N_i, D_i} can only be explained conditional on random W_i that varies among patients and cannot be reliably estimated. Furthermore, any direct interpretation of the dynamic effect of x_i on the marginal intensity $λ (t ∣ H_{i} (t -), x_{i})$ and on the marginal hazard $h (t ∣ H_{i} (t -), x_{i})$ are lacking because these functions (obtained after integrating the random W_i) do not have any interpretable functional forms. Thus, the profiles of two subjects with different covariate values are difficult to compare without some additional restrictive model assumptions. Unlike them, our JLCM model presents the dynamic effects of x_i on the joint profiles of {dN_i(t), dD_i(t)} via (2) and (3) based on finite dimensional and estimable η.

The JCLM also presents a synthesized interpretation of covariate effects on multiple quantities of interest related to both N_i and D_i. This is apparent when we evaluate the covariate effects on important marginal functions such as the mean and the rate functions—both of them are unconditional on the observed history $H_{i} (t -)$ . We obtain the differential equation dμ_j(t|x_i) = E[dN_i(t)|x_i;M_i = j, η_j] = dΛ₀(t)[η_jμ_j(t|x_i) + θ_i] from (2), where μ_j(t|x_i) = E[N_i(t)|A_i(t) = 1; x_i, M_i = j, η_j] is the mean function (expected number) of recurrences given the patient in class G_j is under risk at time t. Solving this differential equation, we obtain the mean function

μ_{j} (t ∣ x_{i}) = \frac{θ_{i}}{η_{j}} [e x p {η_{j} Λ_{0} (t)} - 1],

(4)

and corresponding rate function dμ_j(t|x_i)/dt = θ_iλ₀(t)exp{η_jΛ₀(t)} given that T_i > t. The (4) implies that even the population rate function $d μ (t ∣ x_{i}) / d t = θ_{i} λ_{0} (t) \sum_{j = 0}^{K} [π_{j} exp {η_{j} Λ_{0} (t)}]$ given {T_i > t} is proportional in time with interpretable fixed effect θ_i = exp(β′x_i) of covariate x_i. This shows that unlike previous frailty models of Oakes (1992), Lawless (1995) and Lin et al. (2000) under non-informative termination, our JLCM model produces interpretable fixed effects of covariates and latent class index on the expected and rate of recurrences for a patient not terminated at time t. This property is similar to the property of the frailty model of Xu et al. (2017). However, for our transplant study as well as other practical applications, it is sensible to focus on the mean $μ_{j}^{*} (t ∣ x_{i})$ of $N_{i}^{*} (t) = N_{i} (M i n {T_{i}, t})$ , the point-process of number of recurrences only until termination time T_i. Using similar arguments as to what were used for deriving (4), we obtain

μ_{j}^{*} (t ∣ x_{i}) = E [N^{*} (t) ∣ x_{i}; M_{i} = j, η_{j}] = θ_{i} \int_{0}^{t} S_{j} (u ∣ x_{i}) λ_{0} (u) e x p {η_{j} Λ_{0} (u)} d u,

(5)

with the corresponding rate-function $r_{j}^{*} (t ∣ x_{i}) \equiv d μ_{j}^{*} (t ∣ x_{i}) / d t = θ_{i} S_{j} (t ∣ x_{i}) λ_{0} (t) e x p {η_{j} Λ_{0} (t)}$ , where $S_{j} (t ∣ x_{i}) = S_{0} {(t)}^{exp (γ x_{i} + α η_{j})}$ is the survival function of T_i with corresponding class-specific hazard function in (3). Unlike previous shared-frailty models, (4) and (5) guarantee that the covariate effect θ_i on the cumulative mean function $μ_{j}^{*} (t ∣ x)$ and the rate function $r_{j}^{*} (t ∣ x_{i})$ (both unconditional on history) is same as the dynamic effect of x_i on the risk function $λ_{j} (t ∣ x_{i}, H_{i} (t -), η_{j})$ for any subject i in G_j. Unlike expression (5) for the JLCM, the shared-frailty models lack any interpretation of the effects of x_i on the marginal mean μ*(t|x) and rate r*(t|x) (after integrating out frailty) because these models provide no simple expressions for these functions (without some strong and unrealistic additional modeling assumptions). There are also issues regarding the sensitivity of these marginal regression functions, say, r*(t|x) and $λ (t ∣ H (t -); x)$ , to the assumed parametric form of the frailty density. Recent shared-frailty models of Xu et al. (2017) focus solely on E[N_i(t)|X_i] without considering termination at T_i, and do not provide the marginal function r*(t|x_i).

3. Bayesian Analysis of Joint Model

The observed data is the set Y₀ = {x_i, y_i, δ_i, N_i(t) for 0 < t ⩽ y_i : i = 1, ⋯, n}, where y_i = min{T_i, C_i} is the last observation time and $δ_{i} = 1_{[T_{i} < C_{i}]}$ is the censoring indicator for patient i. The likelihood under the JLCM in (1)–(3) based the observed data Y₀ is a product of two following parts. Using the contributions from the observed NFTR recurrences N_i(t) in the observation interval (0, y_i], the first part based on the intensity function in (2) is:

L_{R} (β, η, Λ_{0}, M ∣ Y_{0}) = \prod_{i = 1}^{n} \prod_{q = 1}^{Q} [{d Λ_{0} (t_{q}) (N_{i q} W_{i}^{*} + θ_{i})}^{n_{i q}} exp {- A_{i q} Λ_{0 q} (N_{i q} W_{i}^{*} + θ_{i})}],

(6)

where t₁ < ⋯ < t_Q are ordered distinct NFTR recurrence and last observation times y_i from i = 1, ⋯, n subjects, $Λ_{0 q} = Λ_{0} (t_{q}) - Λ_{0} (t_{q - 1})$ is the increment in $Λ_{0} (t) = \int_{0}^{t} λ_{0} (u) d u$ in interval I_q = (t_q−1, t_q] with t₀ = 0, A_iq is the at-risk indicator A_i(t_q) of subject i at time t_q, N_iq = N_i(t_q−) is the number of past NFTR recurrences to subject i before time t_q, n_iq = N_{i, q+1} − N_iq is the number of NFTR recurrences occurring to subject i at time t_q, and $W_{i}^{*} = \sum_{j = 0}^{K} η_{j} I (M_{i} = j)$ . Under the hazard function (3), another part of the likelihood based on the observed (y_i, δ_i) is

L_{S} (γ, η, α, H_{0}, M ∣ Y_{0}) = \prod_{i = 1}^{n} exp {- H_{0} (y_{i}) exp (γ x_{i} + α W_{i}^{*})} {[d H_{0} (y_{i}) exp (γ x_{i} + α W_{i}^{*})]}^{δ_{i}},

(7)

where dH₀(t) is the increment in baseline cumulative hazard $H_{0} (t) = \int_{0}^{t} h_{0} (u) d u$ in the interval [t, t + dt). Full semiparametric Bayesian analysis (see Ibrahim et al., 2005) is based on the joint posterior distribution given by

p (β, γ, α, η, Λ_{0}, H_{0}, M ∣ Y_{0}) \propto L_{R} (β, η, Λ_{0}, M) \times L_{S} (γ, α, H_{0}, M) \times \prod_{i = 1}^{n} p_{C} (M_{i} ∣ K, π) \times p_{1} (Λ_{0}) \times p_{2} (H_{0}) \times p_{3} (η ∣ K) \times p_{4} (K, π) \times p_{5} (β, α, γ),

(8)

where p_C(M_i|K, π) is the multinomial distribution of M_i in (1). The density of p₄(K, π) is the prior of its parameter (K, π), p₁(Λ₀) and p₂(H₀) are two independent prior processes for non-parametric cumulative functions Λ₀(t) and H₀(t) respectively, p₃(η|K) is the prior distribution of η = (η₁, ···, η_K) given K, and p₅(β, α, γ) is the joint prior of the regression parameters (β, α, γ). It is reasonable and common practice to assume a priori mutual independence of the regression parameters, baseline functions, and latent class parameters (η, π, K).

There are several ways to specify a prior p₄(K, π) for unknown latent class variables (K, π). Methods using K +1 to be known, as used in popular JLCM based joint analysis of survival and longitudinal data (Huang and Wang, 2004; Han et al., 2007; Proust-Lima and Taylor, 2009), usually lead to higher than adequate number of classes in practice. The Dirichlet process mixture (DPM) model (Neal, 2000) for $W_{i}^{*}$ in (6) also leads to high computational cost and substantially higher than adequate number of classes. Provided it is supported by the observed data, it is desirable to have a small value of K to ensure that marginal mean, rate and intensity functions in (3) and (5) enable a comprehensive comparison among patients with different covariate values. A JLCM with large value of K is subject to the same criticisms leveled at shared-frailty models because shared-frailty models are in some sense JLCM with different classes for all different patients! So, we use the Mixture of Finite Mixtures (MFM) hierarchical prior (Miller and Harrison, 2016) for p₄(K, π) in (8). This is presented hierarchically as

(π_{0}, \dots, π_{K}) ∣ K ~ D i r_{K + 1} (γ, \dots, γ) and K ∣ ζ ~ Pois (ζ),

(9)

where Dir_m(a₁, …, a_m) is the Dirichlet distribution with parameter (a₁, …, a_m), and Pois(ζ) is the Poisson distribution with mean ζ. A popular choice for the prior process p₁(Λ₀) in (8) is the Gamma process (Kalbfleisch, 1978) denoted by GP(Λ*(t), b_λ), with a “prior guess” (prior mean) Λ*(t) of Λ₀(t) and precision b_λ (assumed known). For example, Λ*(t) = a_λt represents the user-specified a_λ > 0 being the prior guess for baseline intensity λ₀(t). Similarly, we use $p_{2} (H_{0})$ as $G P (H_{0}^{*} (t), b_{h})$ with prior mean $H_{0}^{*} (t) = a_{h} t$ and precision b_h for some known a_h, b_h > 0. Unless there are substantial prior information about functions (Λ₀, H₀), these two Gamma processes with small precision b_λ and b_h can be reasonably approximated by independent Gamma priors for unknown increments Λ_0q = Λ₀(t_q)−Λ₀(t_q−1) and H_0q = H₀(t_q) − H₀(t_q−1) for q = 1, ⋯, Q with prior mean (t_q − t_q−1)a_λ and variance (t_q − t_q−1)a_λ/b_λ, and prior mean (t_q − t_q−1)a_h and variance (t_q − t_q−1)a_h/b_h, respectively.

When we have useful prior information about both Λ₀(t) and H₀(t), we recommend a full semiparametric Bayesian analysis that is capable of inference as well as prediction using our JLCM in (2), (3) and (9). For such an analysis, we need MCMC samples from the posterior in (8). However, when there is a lack of credible prior information about (λ₀, h₀), we recommend following partial likelihood based semiparametric Bayesian inference.

Bayesian Analysis with Partial Likelihood:

Under the intensity function of (2) for JLCM, the partial likelihood for the recurrent events is

P L_{R} (β, η, M ∣ Y_{0}) = \prod_{i = 1}^{n} \prod_{q = 1}^{Q} {\frac{W_{i}^{*} N_{i q} + θ_{i}}{\sum_{s = 1}^{n} A_{s} (t_{j}) (W_{s}^{*} N_{s q} + θ_{s})}}^{n_{i q}},

(10)

where A_s(t_q) is the ”at risk” indicator of whether subject s is at observation at time t_q. Similarly, for observed (y_i, δ_i), the partial likelihood under the hazard in (3) is

P L_{S} (γ, α, η, M ∣ Y_{0}) = \prod_{i = 1}^{n} {\frac{exp (γ x_{i} + α W_{i}^{*})}{\sum_{s = 1}^{n} A_{s} (y_{i}) exp (γ x_{s} + α W_{s}^{*})}}^{δ_{i}} .

(11)

Following arguments of Ibrahim et al. (2005), we can prove that the joint posterior

p_{P L} (β, γ, α, η, M ∣ Y_{0}) \propto P L_{R} (β, η, M ∣ Y_{0}) \times P L_{S} (γ, α, η, M ∣ Y_{0}) \times \prod_{i = 1}^{n} p_{C} (M_{i} ∣ K, π) \times p_{3} (η ∣ K) \times p_{4} (K, π) \times p_{5} (β, α, γ)

(12)

based on the partial likelihoods of (10) and (11) is always a proper joint density as long as the priors p₃(η|K), p₄(K, π), and p₅(β, α, γ) are proper. In Appendix I, we present a proof of the posterior of (12) being an approximation of the marginal posterior obtained via integrating (Λ₀, H₀) from the full posterior of (8) under very “diffuse” Gamma processes for p₁(Λ₀) and p₂(H₀). This gives a theoretical justification to use the posterior in (12) when there is no substantial prior opinion available for (Λ₀, H₀). Unlike the full posterior of (8), the posterior of (12) does not involve (Λ₀, H₀) and needs fewer steps within the MCMC while sacrificing the ability to make useful posterior predictions and posterior estimation of number and rate of future events.

The choice of priors for regression and variance component parameters often have substantial influence on Bayesian estimates (Gelman et al., 2006, 2008). For frailty models, the sensitivity of the results of Bayesian analysis to the priors of the frailty parameter is already well documented (Ouyang et al., 2013). Following Gelman et al. (2006), we present Bayesian analysis of JLCM using the ordered uniform distribution of size K as the “non-informative” prior and the ordered half-Cauchy distributions of size K and scale 2.5 as the “weakly-informative” prior for η₁ < ⋯ < η_K. We use independent Cauchy density with center 0 and scale 2.5 as the priors for the regression parameters β and γ because these priors for regression parameters often outperform other non-informative and weakly-informative priors, including Gaussian and Laplace priors (Gelman et al., 2008). We use a Gamma(1, 1) density as the prior for the parameter α associated with the class-effects η_j on termination.

4. Simulation Study

Our first two simulation studies compare the performances of Bayesian estimates of mainly the single regression parameter obtained from 3 methods: (1) JLCM with ordered uniform in (−3, +3) priors for η₁ < ··· < η_K, (2) JLCM with ordered half-Cauchy prior on η₁ < ··· < η_K, (3) shared-frailty model of Huang and Wang (2004). We compare the performances of these 3 Bayesian methods at sample sizes n = 100 and n = 400. To compare performances of the Bayesian estimates from competing methods, these two as well as other simulation studies use 500 replicates of datasets from each simulation model and sample-size to approximate the relative bias (RB), the average posterior standard deviation (SD), and the approximate square-root of mean square error (RMSE) of the Bayesian estimates under different methods. To facilitate fair comparisons among all three models, we present results of only full Bayesian analysis (partial likelihood based Bayesian analysis is not readily available for shared-frailty model) of them. Following conventional choices (Bender et al., 2005), we use independent Cauchy priors with center 0 and scale 2.5 for all regression parameters, GP(a_λt, b_λ) and GP(a_ht, b_h) with b_λ = b_h = 0.001 and a_λ = a_h = 1 for cumulative baseline functions Λ₀ and H₀ respectively.

All simulation models use the baseline functions λ₀(t) = 1 and h₀(t) = 0.5, and fixed censoring time C_i = 2. For Simulation Study 1 and 2, we simulate from JLCM with η = (0, 0.4, 0.8) for K + 1 = 3, a positive association between recurrence and termination with α = 0.5, and independent Bernoulli covariate x_i ~ Ber(0.5). The only difference between two simulation models is that the simulation model of former has same direction of covariate effects on risks of both recurrence and termination with β = γ = 1, whereas in later simulation model these true covariate effects are in opposite directions with β = 1 and γ = −1. For, The values of RB, SD and RMSE in Table 1 (for Simulation Study 1) and Table 2 (for Simulation Study 2) indicate that JLCM based Bayesian estimates under uniform priors for η perform the best among competing methods. As expected, the RB and RMSE for smaller sample-size n = 100 are slightly larger than corresponding values obtained from larger datasets (n = 400), however, the estimates for both sample sizes have very small RB. Especially for the estimating η₁ and η₂, the JLCM performs better while using ordered uniform priors compared to using half-Cauchy priors on η, because the later method substantially underestimates η and over-estimates the number of latent groups K with a large RMSE. The RMSE values of the estimates of regression parameter from both JLCM based methods are smaller than the corresponding RMSE values from the shared-frailty model based estimates. Thus, the JLCM bases methods substantially outperform the shared-frailty method when the data is generated from a JLCM.

Table 1.

Comparison of Bayesian estimates using data simulated from a JLCM with a same covariate effects on recurrence and termination risks: RB is the relative bias, SD is the average posterior Standard-Deviation, and RMSE is the square-root of mean square error based on 500 replicates.

		n=100			n=400
Methods	Paramter	RB	SD	RMSE	RB	SD	RMSE
JLCM with uniform prior for η	α	−0.026	0.421	0.165	−0.004	0.331	0.160
	β	−0.008	0.198	0.193	−0.005	0.139	0.137
	γ	−0.094	0.220	0.214	−0.057	0.154	0.177
	η₁	0.022	0.262	0.097	0.006	0.221	0.083
	η₂	0.078	0.433	0.145	0.004	0.381	0.124
	K	0.159	0.877	0.480	0.150	0.801	0.451
JLCM with Cauchy prior for η	α	0.043	0.497	0.211	0.008	0.361	0.185
	β	−0.081	0.187	0.203	0.005	0.122	0.111
	γ	−0.126	0.218	0.250	−0.074	0.146	0.205
	η₁	−0.942	0.028	0.377	−0.918	0.014	0.367
	η₂	−0.909	0.083	0.727	−0.948	0.016	0.758
	K	0.852	0.475	1.727	0.862	0.542	1.752
Shared-frailty Model	β	−0.163	0.207	0.263	−0.147	0.152	0.212
Shared-frailty Model	γ	−0.127	0.245	0.276	−0.084	0.217	0.178

Open in a new tab

Table 2.

Summary of performances of estimates from different methods when data is simulated from a JLCM with opposite covariate effects on recurrence and termination risks: RB is the relative bias, SD is the average posterior Standard-Deviation, and RMSE is the square-root of mean square error based on 500 replicates.

		n=100			n=400
Methods	Parameter	RB	SD	RMSE	RB	SD	RMSE
JLCM with uniform prior for η	α	−0.034	0.423	0.017	−0.009	0.311	0.011
	β	0.004	0.174	0.004	0.004	0.151	0.003
	γ	0.066	0.284	0.066	0.047	0.236	0.044
	η₁	0.035	0.230	0.014	0.008	0.222	0.009
	η₂	0.033	0.373	0.026	0.005	0.364	0.015
	K	0.150	0.813	0.300	0.114	0.747	0.280
JLCM with Cauchy prior for η	α	−0.076	0.411	0.038	−0.026	0.330	0.033
	β	−0.050	0.166	0.050	−0.016	0.136	0.034
	γ	0.068	0.283	0.068	0.051	0.256	0.062
	η₁	−0.938	0.031	0.375	−0.653	0.013	0.328
	η₂	−0.895	0.096	0.716	−0.613	0.018	0.686
	K	0.923	0.333	1.846	0.935	0.313	1.548
Shared-frailty Model	β	−0.049	0.218	0.222	−0.024	0.217	0.197
Shared-frailty Model	γ	0.186	0.327	0.375	0.173	0.259	0.376

Open in a new tab

Simulation Study 3 tests the robustness of JLCM based Bayesian estimates via comparing these three estimates when the true simulation model is the shared-frailty model of Huang and Wang (2004) with conditional intensity function $λ (t ∣ x_{i}, H_{i} (t -), W_{i}) = λ_{0} (t) e x p (x_{i} β) (1 + W_{i})$ and hazard function $h (t ∣ x_{i}, H_{i} (t -), W_{i}) = h_{0} (t) e x p (x_{i} β) (1 + W_{i})$ with β = γ = 1, and the frailty density W_i ~ Gamma(1.5, 1.5). Table 4 shows that the estimated regression parameters from all three competing methods have comparable RB and RMSE when the sample size is small (n = 100). However, as the sample-size increases (n = 400), the RB values of shared-frailty based regression estimates seem to decrease faster than those from JLCM based estimates. Thus, the JLCM with uniform prior for η is preferable for Bayesian estimates unless we are assured about the validity of the shared-frailty assumption and the sample size is large.

Table 4.

Summary statistics for estimates using data simulated from simulation study 4 to 6 that introduced in Section 4.4. RB is the average relative bias, SD is the average posterior Standard-Deviation, and RMSE is the approximate square-root of mean square error.

		JLCM			Shared-frailty Model
Simulation Model	Parameter	RB	SD	RMSE	RB	SD	RMSE
Simulation Study 4	α	−0.054	0.415	0.167	-	-	-
	β₁	0.020	0.218	0.234	−0.153	0.236	0.281
	β₂	0.021	0.160	0.165	−0.068	0.173	0.182
	β₃	−0.015	0.211	0.216	−0.094	0.238	0.246
	γ₁	−0.194	0.245	0.240	−0.327	0.293	0.283
	γ₂	−0.132	0.170	0.170	−0.080	0.206	0.184
	γ₃	−0.010	0.243	0.235	−0.034	0.292	0.269
	η₁	0.036	0.262	0.091	-	-	-
	η₂	0.085	0.428	0.135	-	-	-
	K	0.107	0.866	0.464	-	-	-
Simulation Study 5	α	−0.061	0.408	0.152	-	-	-
	β₁	−0.002	0.216	0.215	−0.168	0.235	0.470
	β₂	0.027	0.150	0.151	0.026	0.163	0.166
	β₃	0.004	0.183	0.189	0.041	0.220	0.218
	γ₁	0.228	0.259	0.243	0.485	0.304	0.445
	γ₂	0.024	0.177	0.167	0.161	0.213	0.200
	γ₃	0.108	0.276	0.282	0.184	0.324	0.317
	η₁	0.042	0.235	0.090	-	-	-
	η₂	0.040	0.377	0.124	-	-	-
	K	0.107	0.820	0.453	-	-	-
Simulation Study 6	β₁	−0.016	0.157	0.150	−0.088	0.224	0.191
	β₂	−0.051	0.155	0.147	−0.169	0.223	0.181
	β₃	−0.011	0.157	0.154	−0.088	0.223	0.187
	γ₁	−0.240	0.221	0.215	−0.319	0.271	0.261
	γ₂	−0.152	0.221	0.208	−0.205	0.270	0.261
	γ₃	−0.262	0.221	0.209	−0.289	0.270	0.252

Open in a new tab

Our next three simulation studies now compare the estimates from JLCM with ordered uniform prior for η (since it performs better than Cauchy prior in previous three simulation studies) with those from the shared-frailty model when the simulated datasets have both binary and continuous covariates and the interaction among them. So, each of these simulation studies use 500 replicates of datasets of n = 100 subjects in each with two independent covariates x₁ ~ Ber(0.5) and x₂ ~ N(0.25, 1) and their interaction x₃ = x₁×x₂. In Simulation Study 4, the simulation model is JLCM with β₁ = 0.5, β₂ = 0.2, β₃ = 0.6, γ₁ = 0.3, γ₂ = 0.4 and γ₃ = 0.3 to ensure that the simulated datasets have approximately the same expected value of Xβ and the same a number of recurrent events until terminationas as in Simulation Study 1. In Simulation Study 5, we simulate from same JLCM except with γ₁ = −0.3, γ₂ = −0.4 and γ₃ = −0.3 to ensure the direction of covariate effects on recurrent events to be different from the effects on termination (unlike in Simulation Study 4).

For Simulation Study 4–5, the values of RB, SD and RMSE of the estimates from two competing methods are in Table 4. These results show that the estimates from JLCM have similar performances to the JLCM based estimates in Simulation Study 1–2 with single binary covariate. However, the estimates from the shared-frailty model are perform worse than the results for JLCM except for the γ₂ corresponding to the effect of continuous covariate on termination. These results emphasize the earlier findings that the JLCM based estimates have substantially better performance than the shared-frailty model when the underlying true model is JLCM. Again, unlike Simulation Study 4 and 5 using simulations from JLCM, the Simulation Study 6 uses simulations from the shared-frailty model to assess the robustness of the estimates from JLCOM. The results in Table 4 show JLCM based estimates have comparable and even smaller RB than the shared-frailty model for some parameters. Values of SD and RMSE from JLCM are sometime little smaller than those from the shared-frailty model to indicate better performance of JLCM here. Overall, JLCM model based estimates have better performances than estimates from the shared-frailty model when there are multiple covariates.

Overall, these simulation studies show that the JLCM with ordered uniform priors for η performs better than JLCM with Cauchy priors, especially for small sample-size. JLCM gives reasonable estimates of regression parameters even when the true model is the shared-frailty model, and the estimates from JLCM performs much better than shared-frailty when the true model is JLCM.

5. Analysis of Heart Transplant Data

We compare (1) JLCM with ordered uniform priors for η and (2) shared-frailty model with gamma frailty using Bayesian analyses of a study of n = 114 cardiac transplant patients treated between 1992–2007 under these two competing models. Each patient is at risk of recurrent Non-Fatal Tissue Rejections (NFTR), usually treated with medication, as well as death due to GvHD (considered termination event). Some patients are censored due to loss of follow-up at their last follow-up times. The maximum number of observed recurrent NFTR events amnong these patients is 7, where the median and maximum of follow-up periods are 3 and 17.8 months. There are two binary covariates: race with x₁ = 1 for African American (AA) patients and x₁ = 0 otherwise, and Gender with x₂ = 0 for male and 1 for female.

We use independent mean 0 and variance 1 Gaussian priors for the regression parameters β_k and γ_k for k = 1, 2 to accommodate effectively non-informative prior opinions about the effects of race and gender, ordered uniform priors for vector η in JLCM, and exponential prior for the variance of the Gamma frailty of the shared-frailty model. To summarize the Bayesian analysis under two competing models, Table 5 presents the posterior means as Bayesian estimates (BE), posterior standard deviation (SD) and 95% credible interval (CI) as Bayesian interval estimates of the relevant parameters of interest.

Table 5.

Results of heart transplant data based on the partial likelihood with the non-informative prior. BE is the posterior mean (Bayesian point estimate), SD is the posterior Standard-Deviation and 95% CI is the 95% credible interval of the parameter.

	JLCM			Frailty Model
Parameter	BE	SD	95% CI	BE	SD	95% CI
β₁	0.428	0.205	(0.030,0.828)	0.429	0.159	(0.109,0.732)
β₂	0.261	0.212	(−0.153,0.651)	0.177	0.172	(−0.185,0.524)
γ₁	0.063	0.460	(−0.967,0.861)	−0.055	0.977	(−1.974,1.835)
γ₂	−0.076	0.435	(−0.973,0.807)	0.031	1.012	(−1.929,1.933)
α	0.152	0.126	(0.006,0.477)	-	-	-
η₁	0.504	0.399	(0.018,1.455)	-	-	-
η₂	1.054	0.511	(0.218,2.166)	-	-	-
η₃	1.661	0.566	(0.598,2.730)	-	-	-
K	3.212	0.755	(2.000,4.000)	-	-	-

Open in a new tab

For Bayesian analysis under JLCM, the interval estimates of K, π and η in Table 5 show a strong data evidence that this study has three latent classes with no class G₀ (K = 3 and π₀ being very close to 0). This means that this patient population has no latent class for which the number of past NFTR events has no effect on the risk of GvHD event of the patient. To understand and assess the future risk of GvHD for every patient, the effect of his/her past history of NFTR events has to be considered. The Bayesian point estimates of class effects are ${\hat{η}}_{1} = 0.504$ , ${\hat{η}}_{2} = 1.054$ , and ${\hat{η}}_{3} = 1.661$ . Results show strong evidence of increased risk and rate of NFTR recurrence for any AA patient (compared to non-AA patient) with no termination at time t because the CI of exp(β₁) is (1.03, 2.28). However, there is no strong evidence of direct race-effect on the risk of termination because the CI of γ₁ is (−0.967, 0.861), containing 0. Also, the evidence of gender-effects on both recurrence and termination are weak because the CIs of both β₂ and γ₂ contain 0. These suggest that in spite of the strong data evidence of higher risk and higher rate of NFTR recurrences for an AA patient at any time t, there is no good data evidence of the AA patient being at higher risk of death from fatal GvHD after adjustment of the effects of of latent class and number of past recurrences. As a consequence of JLCM’s property in (5) and results of our Bayesian analysis imply an increased population lifetime rate $r^{*} (t ∣ x) = \sum_{k = 0}^{K} π_{k} r_{k}^{*} (t ∣ x)$ and an population lifetime mean NFTR recurrence $μ^{*} (t ∣ x) = \sum_{k = 0}^{K} π_{k} μ_{k}^{*} (t ∣ x)$ for an AA patient compared to another non-AA patient at time t because when γ = 0 (as our Bayesian analysis results suggest for this study) we have $r^{*} (t ∣ x) = exp (β x_{i}) λ_{0} (t) \sum_{k = 0}^{K} π_{k} S_{k} (t) exp {η_{k} Λ_{0} (t)}$ and similar expression for μ*(t|x).

The advantages of our JLCM based analysis is that we can compare the expected event profiles of two patients, say, an African American (AA) patient (x₁ = 1) versus a non-AA patient (x₁ = 0) of same gender within same latent class. The ratio $e^{β_{1}}$ of their NFTR recurrence rates before termination at any time t has posterior mean 1.53 and CI (1.030,2.288) if they are from the same latent class. The ratio of risks of first recurrence (dynamic comparison given past history of recurrences at time t) between these two patients is also same as the rate-ratio $e^{β_{1}}$ . However, this ratio of risks of recurrence is $(η_{j} + e^{β_{1}})$ if, say, an AA male patient is at risk for the second recurrence and the non-AA male patient is still at risk of first NFTR at that time-point. The Bayesian point estimates for this risk-ratio are 2.038, 2.588 and 3.195 when they from classes 1, 2 or 3 respectively. Because our JLCM based analysis produces moderate number of latent classes, it is possible to compare the dynamic event profiles and mean/rate of events among patients from two different latent classes and even among patients with latent classes unknown. For example, the interval estimate of ratio of mean number of recurrences is (1.5, 4.3) when the latent class is unknown and the model even incorporates covariate effects on termination. Unfortunately, for the sake of brevity, we omit detailed comparisons of future event trajectories of different patients.

JLCM based Bayesian analysis also allows the estimation of probability π_k of any patient being in a latent class G_k and also facilitates the updating the estimates given the past events history of any subject. For example, Bayesian point estimate of π₃ is 0.7 for an AA Male patient with recurrence history as a patient i = 6 and without termination compared to this Bayesian estimate being less than 0.2 for a future patient with events history similar to the patient i = 1.

In Table 5, the posterior means and CIs of β₁ and β₂ under shared-frailty model are close to the corresponding estimates from JLCM. Overall, analysis from both models have agreement about the evidences of dynamic effects of race and gender on NFTR recurrence and termination conditional on history. However, the shared-frailty model cannot effectively interpret the ratio of rates of NFTR recurrence and ratio of termination risk of two patients with different covariate values. So, the JLCM based analysis is preferable because it allows comparisons of event profiles of two future patients and accommodates a comprehensive interpretation of covariate effects on all relevant functions.

6. Conclusion and Discussion

Our novel JLCM achieves five major practical/clinical goals: (1) explaining the effect of covariates on the future event profiles within each patient; (2) evaluating the risk of events in [t, t + dt) given the history $H (t -)$ ; (3) assessing the risk of termination given $H (t -)$ ; (4) explaining the heterogeneity among patients via latent class parameters η; (5) providing predictions of future events. Unlike JLCM, existing methods often focus on single main response of interest (say, recurrence) and the corresponding regression function of interest (say, mean number of recurrence), and regression parameters of mean recurrence, in general, do not have any physical interpretation for another regression function, say, for hazard function for termination (Miloslavsky et al., 2004).

We can accommodate right-predictable time-varying covariate x_i(t) within the joint latent class model of (2–3) via re-expressing them as $λ_{j} (t ∣ H_{i} (t -); η_{j}) = λ_{0} (t) [η_{j} k + exp (β^{'} x_{i} (t))]$ and $h_{j} (t ∣ H_{i} (t -); η_{j}) = h_{0} (t) e^{γ x_{i} (t) + α η_{j}}$ , where the event history $H_{i} (t -) = {N_{i} (u), D_{i} (u), A_{i} (u), x_{i} (u) : u < t}$ now also contains the information about the sample-path $X_{i} (t) = {x_{i} (u) : u ⩽ t}$ of the predictable process {x_i(·)} up to time t. Our full Bayesian method for studies with time-varying covariates is similar to what is presented in Section 3 as long as the entire sample path of time-varying x_i(t) have been available in the interval when A_i(t) = 1. To facilitate the partial likelihoods (10) and (11) for our Bayesian method based on partial likelihoods will only require this time-varying x_i(t) to be measured/known for all subjects at risk/observation at each event time (Li et al., 2016). Instead of (2), $d μ_{j} (t ∣ X_{i} (t)) = E [d N_{i} (t) ∣ x_{i} (t); η_{j}] = d Λ_{0} (t) [η_{j} μ_{j} (t ∣ x_{i} (t)) + e^{β x_{i} (t)}$ is the new differential equation of the mean function (expected number) $μ_{j} (t ∣ X_{i}) = E [N_{i} (t) ∣ A_{i} (t) = 1; X_{i} (t), η_{j}]$ of recurrences given the patient in class G_j, with class-effect η_j, is under observation at time t. For ease of presentation, we consider the special case of piecewise constant x_i(·) with X_i(t) = x_ik and for all t ∈ I_k = (a_k−1, a_k] with the grid 0 = a₀ < a₁ < ⋯ < a_K−1 < a_K = ∞. The solution of this differential equation in this case is the recursive formula

μ_{j} (t ∣ X_{i} (t)) = μ_{j} (a_{k - 1} ∣ X_{i} (a_{k - 1})) e^{η_{j} Λ_{0} (a_{k - 1}, t)} + \frac{θ_{i k}}{η_{j}} [e^{η_{j} Λ_{0} (a_{k - 1}, t)} - 1] for t \in (a_{k - 1}, a_{k}],

(13)

where θ_ik = exp(βx_ik) and $Λ_{0} (a, b) = \int_{a}^{b} λ_{0} (t) d t$ for 0 ⩽ a < b. Unlike (4), the class-specific rate function $d μ_{j} (t ∣ X_{i} (t)) / d t = {θ_{i k} + η μ_{j} (a_{k - 1} ∣ X_{i} (a_{k - 1}))} λ_{0} (t) exp {η_{j} Λ_{0} (a_{k - 1}, t)}$ as well as the population rate function $d μ (t ∣ X_{i} (t)) / d t$ given {T_i > t} corresponding to (13) can not be expressed as a product of exp(β′x_i(t)) and a baseline function free of $X_{i} (t))$ . However, the expression in (13) shows that similar to the fixed covariate case, the effect of the sample-path $X_{i} (t)$ of time-varying x₍·) on mean function has two parts. The multiplicative effect of the current covariate value x_i(t) is accommodated in the second-term of right-hand-side of equation (13), and the first part accommodates the effects of past sample path x_i(u) for u < t. Obviously for this case, past sample-path x_i(u) for u < t may be different from the current value x_i(t) of the covariate. Using arguments similar to what were used for deriving (5), we obtain the mean $μ_{j}^{*} (t ∣ X_{i} (t)) = E [N_{i}^{*} (t) ∣ X_{i} (t); η_{j}] = \int_{0}^{t} S_{j} (u ∣ X_{i} (u)) d μ_{j} (u ∣ X_{i} (u))$ and the corresponding rate function

r_{j}^{*} (t ∣ X_{i} (t)) = {θ_{i k} + μ_{j} (a_{k - 1} ∣ X_{i} (a_{k - 1}))} S_{j} (t ∣ X_{i} (t)) λ_{0} (t) exp {η_{j} Λ_{0} (a_{k - 1}, t)}

(14)

of $N_{i}^{*} (t) = N_{i} (M i n {T_{i}, t})$ for $t \in (a_{k - 1}, a_{k}]$ , where $S_{j} (t ∣ X_{i} (t)) = exp [- \int_{0}^{t} h_{0} (u) {exp (γ x_{i} (u)) + α η_{j}} d u]$ . In (14), the first term of $r_{j}^{*} (t ∣ X_{i} (t))$ representing the effect of the current value of covariate x_i(t) is proportional to θ_ik = exp(β′x_i(t)). Computing the posterior estimates of $μ_{j}^{*} (t ∣ X (t))$ and $r_{j}^{*} (t ∣ X (t))$ of any future patient are straightforward within Bayesian analysis as long as we use full Bayesian analysis (instead of partial likelihood based Bayesian analysis) that presents a Bayesian estimate of Λ₀(t).

We present an innovative MCMC based tool that is scalable via popular Bayesian software such as JAGS (used in this paper) and WinBUGS because our computational method does not need Reversible Jump MCMC. This code is made available in Appendix II. We note that this JAGS code is not computationally feasible for massive datasets, and in this setting, we suggest optimizing the code using other software besides the standard JAGS option. Our simulation results show that JLCM produces good regression estimates even when the true model is not JLCM. Even though, we only consider non-negative η_j (appropriate for our transplant study), one can, in principle, consider even negative η_j as long as exp(βx) + η_jN_i(t−) > 0 for all observed values of N_i(t−). Irrespective of the true model, JLCM based analysis is preferable because it allows comparisons of event profiles of two future subjects (via estimating class effects) and accommodates a comprehensive interpretation of covariate effects on all relevant functions.

Table 3.

Summary statistics for estimates using data simulated from model introduced in Section 4.4 (i.e., a shared-frailty model). RB is the average relative bias, SD is the average posterior Standard-Deviation, and RMSE is the approximate square-root of mean square error.

		n=100			n=400
Methods	Parameter	RB	SD	RMSE	RB	SD	RMSE
JLCM with uniform prior for η	β	−0.031	0.200	0.202	−0.012	0.116	0.121
JLCM with uniform prior for η	γ	−0.157	0.207	0.260	−0.108	0.149	0.159
JLCM with Cauchy prior for η	β	−0.050	0.198	0.207	−0.003	0.153	0.152
JLCM with Cauchy prior for η	γ	−0.152	0.207	0.257	−0.105	0.151	0.163
Shared-frailty Model	β	−0.036	0.208	0.211	−0.006	0.144	0.143
Shared-frailty Model	γ	−0.122	0.235	0.265	−0.087	0.146	0.151

Open in a new tab

Appendix I: Partial Likelihood Based Posterior As The Marginal Posterior

We are going to show the partial likelihood based posterior in (12) is an approximation of the marginal posterior after integrating out the cumulative baseline function Λ₀(t) and H₀(t) from the joint posterior of (8). For GP(Λ*, b_λ) prior on Λ₀(t), each increment Λ_0q of the Λ₀(t) in in interval I_q = (t_q−1, t_q], has a Gamma prior Ga(a_λw_0qb_λ, b_λ), where w_q = (t_q −t_q−1) and t₁ < ⋯ < t_Q are the ordered distinct event times. Then we integrate out the increments dΛ₀(t) from the (6) as follows,

P L_{R} (β, η, M ∣ a_{λ}, b_{λ}; Y_{0}) = \int L_{R} (β, η, Λ_{0}, M ∣ Y_{0}) \times p_{1} (Λ_{0} ∣ a_{λ}, b_{λ}) d Λ_{0} = \prod_{q = 1}^{Q} {\int \prod_{i = 1}^{n} {Λ_{0 q} (N_{i q} W_{i}^{*} + θ_{i})}^{n_{i q}} \times e^{- Λ_{0 q} \sum_{i = 1}^{n} A_{i q} (N_{i q} W_{i}^{*} + θ_{i})} \times e^{- b_{λ} Λ_{0 q}} {(Λ_{0 q})}^{a_{λ} w_{q} b_{λ} - 1} d Λ_{0 q}} = \prod_{q = 1}^{Q} {\prod_{i = 1}^{n} {(N_{i q} W_{i}^{*} + θ_{i})}^{n_{i q}} \times \int e^{- Λ_{0 q} [\sum_{i = 1}^{n} A_{i q} (N_{i q} W_{i}^{*} + θ_{i}) + b_{λ}]} \times {(Λ_{0 q})}^{\sum_{i = 1}^{n} n_{i q} + a_{λ} w_{q} b_{λ} - 1} d Λ_{0 q}} = \prod_{q = 1}^{Q} {\prod_{i = 1}^{n} (N_{i q} W_{i}^{*} + θ_{i})}^{n_{i q}} \frac{Γ (\sum_{i = 1}^{n} n_{i q} + a_{λ} w_{q} b_{λ})}{[b_{λ} + \sum_{i = 1}^{n} A_{i q} (N_{i q} W_{i}^{*} + θ_{i})] \sum_{i = 1}^{n} n_{i q} + a_{λ} w_{q} b_{λ}} \propto \prod_{q = 1}^{Q} {\prod_{i = 1}^{n} (N_{i q} W_{i}^{*} + θ_{i})}^{n_{i q}} {[b_{λ} + \sum_{i = 1}^{n} A_{i q} (N_{i q} W_{i}^{*} + θ_{i})]}^{- \sum_{i = 1}^{n} n_{i q} - a_{λ} w_{q} b_{λ}} .

When we choose a very diffuse Gamma processes with b_λ and a_λ → 0, then the above marginal likelihood PL_R(β, η, M) from recurrent events is approximately (in the limit) $\prod_{q = 1}^{Q} {\prod_{i = 1}^{n} (N_{i q} W_{i}^{*} + θ_{i})}^{n_{i q}} {[\sum_{i = 1}^{n} A_{i q} (N_{i q} W_{i}^{*} + θ_{i})]}^{- \sum_{i = 1}^{n} n_{i q}}$ , same as the partial likelihood of (10) from recurrent events. Using similar steps as above, we can show that the marginal likelihood (after integrating H₀(·)) from (y_i, δ_i)

P L_{S} (γ, η, M ∣ Y_{0}) \int L_{S} (γ, η, H_{0}, M ∣ Y_{0}) \times p_{2} (H_{0} ∣ a_{h}, b_{h}) d H_{0} \propto \prod_{i = 1}^{n} e^{(γ x_{i} + α W_{i}^{*}) δ_{i}} {[b_{h} + \sum_{i = 1}^{n} A_{j} (y_{i}) (γ x_{i} + α W_{i}^{*})]}^{- δ_{i} - a_{h} w_{q} b_{h}} \to P L_{S} (γ, η, M ∣ Y_{0}),

as b_h and a_h → 0 (11).

Appendix II: Model Code in JAGS

# input data:
# x1, x2, x3: covariates
# YN[i, j]: number of events happened before time t[j] for subject i.
# Y[i, j]: indicator to show patients i is at risk or not at time t[j]
# t[j]: time point when j-th event happen among all subjects
# T: number of total different event time for all subjects
# N: total subjects number
# final[i]: location of censored time for subject i in variable t.
#Start model
model{
  # compute the log-likelihood by using the zero-trick in Poisson distribution 
  for(i in 1:N) { #Begin loop over subjects
    zeros[i]^~ dpois(zeros.mean[i])
    M[i]^~ dcat(pi[]) # the group for i-th subject
    for(j in 1:T) {#Begin loop over distinct recurrent event times
      Log.S1[i, j]=−dL0[j]*(K[i, j]*eta[M[i]]+exp(x1[i]*beta[1]+x2[i]*beta[2]+x3[i]*beta[3]))*Y[i, j]
      Log.Lambda1[i, j]=(log(dL0[j])−log(t[j+1]-t[j])+log(K[i, j]*eta[M[i]]+exp(x1[i]*beta[1]+x2[i]*beta[2]+x3[i]*beta[3])))*YN[i, j]
      dH[i, j]=dH0[j]*exp(x1[i]*gamma[1]+x2[i]*gamma[2]+x3[i]*gamma[3]+alpha*eta[M[i]])*Y[i, j]
    }
    L1[i]=sum(Log.Lambda1[i, 1:T])+sum(Log.S1[i, 1:T]) 
    log.H1[i]=−sum(dH[i, 1:T]) 
    log.H2[i]=(log(dH0[final[i]-1])−log(t[final[i]]-t[final[i]−1])+x1[i]*gamma[1]+x2[i]*gamma[2]+x3[i]*gamma[3]+alpha*eta[M[i]])*fail[i]
    L2[i]=log.H1[i]+log.H2[i] 
    LL[i]=L1[i]+L2[i]
    zeros.mean[i]=−LL[i]+C
  }
  # prior settings 
  for (j in 1:T) {#Gamma process prior
    dL0[j]^~dgamma((t[j+1]-t[j]), 0.001)
    dH0[j]^~dgamma((t[j+1]-t[j]), 0.001)
  }
  #prior for regression parameters 
  for(i in 1:3){
    beta[i]^~dnorm(0, 0.16) 
    gamma[i]^~dnorm(0, 0.16)
  }
  alpha^~dgamma(1, 1)
  # ordered prior for eta W[1]=0
  for(m in 2:num_class){
    W[m]^~dunif(0, 3)
  } 
  eta=sort(W)
  #establish a Dirichlet prior
  for(m in 1:num_class){ 
    a[m]^~dgamma(1, 1) 
    p[m]=ifelse(m<=KM, 1, 0)
    pi[m]<-a[m]*p[m]
  }
  #number of groups
  KM1^~dpois(num_class −1)T(0, num_class −1) # number of groups exclude group 0.
  KM=KM1+1
}

References

Barrett J, Diggle P, Henderson R, and Taylor-Robinson D (2015). Joint modelling of repeated measurements and time-to-event outcomes: flexible model specification and exact likelihood inference. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 77, 131–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bender R, Augustin T, and Blettner M (2005). Generating survival times to simulate cox proportional hazards models. Statistics in Medicine 24, 1713–1723. [DOI] [PubMed] [Google Scholar]
Cook RJ and Lawless J (2007). The statistical analysis of recurrent events. Springer Science & Business Media. [Google Scholar]
Cook RJ, Lawless JF, and Lee K-A (2010). A copula-based mixed poisson model for bivariate recurrent events under event-dependent censoring. Statistics in Medicine 29, 694–707. [DOI] [PubMed] [Google Scholar]
Cox D (1972). Regression analysis and life table. Journal of the RoyalStatistical Society: Series B (Statistical Methodology) 34, 187–222. [Google Scholar]
Gelman A et al. (2006). Prior distributions for variance parameters in hierarchical models (comment on article by browne and draper). Bayesian Analysis 1, 515–534. [Google Scholar]
Gelman A, Jakulin A, Pittau MG, and Su Y-S (2008). A weakly informative default prior distribution for logistic and other regression models. The Annals of Applied Statistics pages 1360–1383. [Google Scholar]
Han J, Slate EH, and Peña EA (2007). Parametric latent class joint model for a longitudinal biomarker and recurrent events. Statistics in Medicine 26, 5285–5302. [DOI] [PMC free article] [PubMed] [Google Scholar]
Heitjan DF and Rubin DB (1991). Ignorability and coarse data. The Annals of Statistics pages 2244–2253. [Google Scholar]
Higgins R and Fishman J (2006). Disparities in solid organ transplantation for ethnic minorities: Facts and solutions. American Journal of Transplantation 6, 2556–2562. [DOI] [PubMed] [Google Scholar]
Hougaard P (2000). Analysis of multivariate survival data. Springer Science & Business Media. [Google Scholar]
Huang C-Y, Qin J, and Wang M-C (2010). Semiparametric analysis for recurrent event data with time-dependent covariates and informative censoring. Biometrics 66, 39–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
Huang C-Y and Wang M-C (2004). Joint modeling and estimation for recurrent event processes and failure time data. Journal of the American Statistical Association 99, 1153–1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ibrahim JG, Chen M-H, and Sinha D (2005). Bayesian survival analysis. Wiley Online Library. [Google Scholar]
Kalbfleisch JD (1978). Non-parametric bayesian analysis of survival time data. Journal of the Royal Statistical Society. Series B (Methodological) pages 214–221. [Google Scholar]
Kalbfleisch JD and Prentice RL (2002). Relative risk (cox) regression models. The Statistical Analysis of Failure Time Data, Second Edition pages 95–147. [Google Scholar]
Kalbfleisch JD, Schaubel DE, Ye Y, and Gong Q (2013). An estimating function approach to the analysis of recurrent and terminal events. Biometrics 69, 366–374. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lancaster T and Intrator O (1998). Panel data with survival: hospitalization of hiv-positive patients. Journal of the American Statistical Association 93, 46–53. [Google Scholar]
Lawless J (1995). The analysis of recurrent events for multiple subjects. Applied Statistics pages 487–498. [Google Scholar]
Lee J and Cook RJ (2019). Dependence modeling for multi-type recurrent events via copulas. Statistics in Medicine 38, 4066–4082. [DOI] [PubMed] [Google Scholar]
Li S, Sun Y, Huang CY, Follmann DA, and Krause R (2016). Recurrent event data analysis with intermittently observed time-varying covariates. Statistics in medicine 35, 30493065. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lin D, Wei L, Yang I, and Ying Z (2000). Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 62, 711–730. [Google Scholar]
Liu L, Wolfe RA, and Huang X (2004). Shared frailty models for recurrent events and a terminal event. Biometrics 60, 747–756. [DOI] [PubMed] [Google Scholar]
Miller JW and Harrison MT (2016). Mixture models with a prior on the number of components. Journal of the American Statistical Association. [DOI] [PMC free article] [PubMed] [Google Scholar]
Miloslavsky M, Kele s, S., Laan MJ, and Butler S (2004). Recurrent events analysis in the presence of time-dependent covariates and dependent censoring. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 66, 239–257. [Google Scholar]
Neal RM (2000). Markov chain sampling methods for dirichlet process mixture models. Journal of computational and graphical statistics 9, 249–265. [Google Scholar]
Oakes D (1992). Frailty models for multiple event times. In Survival Analysis: State of the Art, pages 371–379. Springer. [Google Scholar]
Ouyang B, Sinha D, Slate EH, and Van Bakel AB (2013). Bayesian analysis of recurrent event with dependent termination: an application to a heart transplant study. Statistics in Medicine 32, 2629–2642. [DOI] [PMC free article] [PubMed] [Google Scholar]
Paulon G, Iorio M, Guglielmi A, and Ieva F (2018). Joint modeling of recurrent events and survival: a bayesian non-parametric approach. Biostatistics. [DOI] [PubMed] [Google Scholar]
Proust-Lima C, Séne M, Taylor JM, and Jacqmin-Gadda H (2014). Joint latent class models for longitudinal and time-to-event data: A review. Statistical Methods in Medical Research 23, 74–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
Proust-Lima C and Taylor JM (2009). Development and validation of a dynamic prognostic tool for prostate cancer recurrence using repeated measures of posttreatment psa: a joint modeling approach. Biostatistics 10, 535–549. [DOI] [PMC free article] [PubMed] [Google Scholar]
Qu L, Sun L, and Liu L (2017). Joint modeling of recurrent and terminal events using additive models. Statistics and Its Interface 10, 699–710. [Google Scholar]
Shih JH and Louis TA (1995). Inferences on the association parameter in copula models for bivariate survival data. Biometrics pages 1384–1399. [PubMed] [Google Scholar]
Sinha D, Maiti T, Ibrahim JG, and Ouyang B (2008). Current methods for recurrent events data with dependent termination: a bayesian perspective. Journal of the American Statistical Association 103, 866–878. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xu G, Chiou SH, Huang C-Y, Wang M-C, and Yan J (2017). Joint scale-change models for recurrent events and failure time. Journal of the American Statistical Association 112, 794–805. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ye Y, Kalbfleisch JD, and Schaubel DE (2007). Semiparametric analysis of correlated recurrent and terminal events. Biometrics 63, 78–87. [DOI] [PubMed] [Google Scholar]
Zeng D and Lin D (2009). Semiparametric transformation models with random effects for joint analysis of recurrent and terminal events. Biometrics 65, 746–752. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] Barrett J, Diggle P, Henderson R, and Taylor-Robinson D (2015). Joint modelling of repeated measurements and time-to-event outcomes: flexible model specification and exact likelihood inference. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 77, 131–148. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] Bender R, Augustin T, and Blettner M (2005). Generating survival times to simulate cox proportional hazards models. Statistics in Medicine 24, 1713–1723. [DOI] [PubMed] [Google Scholar]

[R3] Cook RJ and Lawless J (2007). The statistical analysis of recurrent events. Springer Science & Business Media. [Google Scholar]

[R4] Cook RJ, Lawless JF, and Lee K-A (2010). A copula-based mixed poisson model for bivariate recurrent events under event-dependent censoring. Statistics in Medicine 29, 694–707. [DOI] [PubMed] [Google Scholar]

[R5] Cox D (1972). Regression analysis and life table. Journal of the RoyalStatistical Society: Series B (Statistical Methodology) 34, 187–222. [Google Scholar]

[R6] Gelman A et al. (2006). Prior distributions for variance parameters in hierarchical models (comment on article by browne and draper). Bayesian Analysis 1, 515–534. [Google Scholar]

[R7] Gelman A, Jakulin A, Pittau MG, and Su Y-S (2008). A weakly informative default prior distribution for logistic and other regression models. The Annals of Applied Statistics pages 1360–1383. [Google Scholar]

[R8] Han J, Slate EH, and Peña EA (2007). Parametric latent class joint model for a longitudinal biomarker and recurrent events. Statistics in Medicine 26, 5285–5302. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] Heitjan DF and Rubin DB (1991). Ignorability and coarse data. The Annals of Statistics pages 2244–2253. [Google Scholar]

[R10] Higgins R and Fishman J (2006). Disparities in solid organ transplantation for ethnic minorities: Facts and solutions. American Journal of Transplantation 6, 2556–2562. [DOI] [PubMed] [Google Scholar]

[R11] Hougaard P (2000). Analysis of multivariate survival data. Springer Science & Business Media. [Google Scholar]

[R12] Huang C-Y, Qin J, and Wang M-C (2010). Semiparametric analysis for recurrent event data with time-dependent covariates and informative censoring. Biometrics 66, 39–49. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Huang C-Y and Wang M-C (2004). Joint modeling and estimation for recurrent event processes and failure time data. Journal of the American Statistical Association 99, 1153–1165. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] Ibrahim JG, Chen M-H, and Sinha D (2005). Bayesian survival analysis. Wiley Online Library. [Google Scholar]

[R15] Kalbfleisch JD (1978). Non-parametric bayesian analysis of survival time data. Journal of the Royal Statistical Society. Series B (Methodological) pages 214–221. [Google Scholar]

[R16] Kalbfleisch JD and Prentice RL (2002). Relative risk (cox) regression models. The Statistical Analysis of Failure Time Data, Second Edition pages 95–147. [Google Scholar]

[R17] Kalbfleisch JD, Schaubel DE, Ye Y, and Gong Q (2013). An estimating function approach to the analysis of recurrent and terminal events. Biometrics 69, 366–374. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] Lancaster T and Intrator O (1998). Panel data with survival: hospitalization of hiv-positive patients. Journal of the American Statistical Association 93, 46–53. [Google Scholar]

[R19] Lawless J (1995). The analysis of recurrent events for multiple subjects. Applied Statistics pages 487–498. [Google Scholar]

[R20] Lee J and Cook RJ (2019). Dependence modeling for multi-type recurrent events via copulas. Statistics in Medicine 38, 4066–4082. [DOI] [PubMed] [Google Scholar]

[R21] Li S, Sun Y, Huang CY, Follmann DA, and Krause R (2016). Recurrent event data analysis with intermittently observed time-varying covariates. Statistics in medicine 35, 30493065. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] Lin D, Wei L, Yang I, and Ying Z (2000). Semiparametric regression for the mean and rate functions of recurrent events. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 62, 711–730. [Google Scholar]

[R23] Liu L, Wolfe RA, and Huang X (2004). Shared frailty models for recurrent events and a terminal event. Biometrics 60, 747–756. [DOI] [PubMed] [Google Scholar]

[R24] Miller JW and Harrison MT (2016). Mixture models with a prior on the number of components. Journal of the American Statistical Association. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] Miloslavsky M, Kele s, S., Laan MJ, and Butler S (2004). Recurrent events analysis in the presence of time-dependent covariates and dependent censoring. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 66, 239–257. [Google Scholar]

[R26] Neal RM (2000). Markov chain sampling methods for dirichlet process mixture models. Journal of computational and graphical statistics 9, 249–265. [Google Scholar]

[R27] Oakes D (1992). Frailty models for multiple event times. In Survival Analysis: State of the Art, pages 371–379. Springer. [Google Scholar]

[R28] Ouyang B, Sinha D, Slate EH, and Van Bakel AB (2013). Bayesian analysis of recurrent event with dependent termination: an application to a heart transplant study. Statistics in Medicine 32, 2629–2642. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] Paulon G, Iorio M, Guglielmi A, and Ieva F (2018). Joint modeling of recurrent events and survival: a bayesian non-parametric approach. Biostatistics. [DOI] [PubMed] [Google Scholar]

[R30] Proust-Lima C, Séne M, Taylor JM, and Jacqmin-Gadda H (2014). Joint latent class models for longitudinal and time-to-event data: A review. Statistical Methods in Medical Research 23, 74–90. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] Proust-Lima C and Taylor JM (2009). Development and validation of a dynamic prognostic tool for prostate cancer recurrence using repeated measures of posttreatment psa: a joint modeling approach. Biostatistics 10, 535–549. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] Qu L, Sun L, and Liu L (2017). Joint modeling of recurrent and terminal events using additive models. Statistics and Its Interface 10, 699–710. [Google Scholar]

[R33] Shih JH and Louis TA (1995). Inferences on the association parameter in copula models for bivariate survival data. Biometrics pages 1384–1399. [PubMed] [Google Scholar]

[R34] Sinha D, Maiti T, Ibrahim JG, and Ouyang B (2008). Current methods for recurrent events data with dependent termination: a bayesian perspective. Journal of the American Statistical Association 103, 866–878. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] Xu G, Chiou SH, Huang C-Y, Wang M-C, and Yan J (2017). Joint scale-change models for recurrent events and failure time. Journal of the American Statistical Association 112, 794–805. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] Ye Y, Kalbfleisch JD, and Schaubel DE (2007). Semiparametric analysis of correlated recurrent and terminal events. Biometrics 63, 78–87. [DOI] [PubMed] [Google Scholar]

[R37] Zeng D and Lin D (2009). Semiparametric transformation models with random effects for joint analysis of recurrent and terminal events. Biometrics 65, 746–752. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Joint Analysis of Recurrence and Termination: A Bayesian Latent Class Approach

Zhixing Xu

Debajyoti Sinha

Jonathan R Bradley

Summary:

1. Introduction

2. Joint Latent Class Model

3. Bayesian Analysis of Joint Model

Bayesian Analysis with Partial Likelihood:

4. Simulation Study

Table 1.

Table 2.

Table 4.

5. Analysis of Heart Transplant Data

Table 5.

6. Conclusion and Discussion

Table 3.

Appendix I: Partial Likelihood Based Posterior As The Marginal Posterior

Appendix II: Model Code in JAGS

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Joint Analysis of Recurrence and Termination: A Bayesian Latent Class Approach

Zhixing Xu

Debajyoti Sinha

Jonathan R Bradley

Summary:

1. Introduction

2. Joint Latent Class Model

3. Bayesian Analysis of Joint Model

Bayesian Analysis with Partial Likelihood:

4. Simulation Study

Table 1.

Table 2.

Table 4.

5. Analysis of Heart Transplant Data

Table 5.

6. Conclusion and Discussion

Table 3.

Appendix I: Partial Likelihood Based Posterior As The Marginal Posterior

Appendix II: Model Code in JAGS

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases