Bayesian Model Assessment in Joint Modeling of Longitudinal and Survival Data with Applications to Cancer Clinical Trials

Danjie Zhang; Ming-Hui Chen; Joseph G Ibrahim; Mark E Boye; Wei Shen

doi:10.1080/10618600.2015.1117472

. Author manuscript; available in PMC: 2017 Feb 23.

Published in final edited form as: J Comput Graph Stat. 2017 Feb 16;26(1):121–133. doi: 10.1080/10618600.2015.1117472

Bayesian Model Assessment in Joint Modeling of Longitudinal and Survival Data with Applications to Cancer Clinical Trials

Danjie Zhang ^*, Ming-Hui Chen ^†, Joseph G Ibrahim ^‡, Mark E Boye ^§, Wei Shen ^§

PMCID: PMC5321618 NIHMSID: NIHMS740627 PMID: 28239247

Summary

Joint models for longitudinal and survival data are routinely used in clinical trials or other studies to assess a treatment effect while accounting for longitudinal measures such as patient-reported outcomes (PROs). In the Bayesian framework, the deviance information criterion (DIC) and the logarithm of the pseudo marginal likelihood (LPML) are two well-known Bayesian criteria for comparing joint models. However, these criteria do not provide separate assessments of each component of the joint model. In this paper, we develop a novel decomposition of DIC and LPML to assess the fit of the longitudinal and survival components of the joint model, separately. Based on this decomposition, we then propose new Bayesian model assessment criteria, namely, ΔDIC and ΔLPML, to determine the importance and contribution of the longitudinal (survival) data to the model fit of the survival (longitudinal) data. Moreover, we develop an efficient Monte Carlo method for computing the Conditional Predictive Ordinate (CPO) statistics in the joint modeling setting. A simulation study is conducted to examine the empirical performance of the proposed criteria and the proposed methodology is further applied to a case study in mesothelioma.

Keywords: CPO, DIC, LPML, Monte Carlo method, Patient-reported outcome (PRO)

1 Introduction

Recently, joint modeling of longitudinal and time-to-event outcomes has become more popular in the analysis of patient-reported outcomes (PROs) for the purpose of evaluating the efficacy and tolerability of cancer treatment. In oncology applications, information from the patients’ perspectives can be useful in evaluating actual patients’ experiences on dimensions known to be important to them and also associated with treatment outcomes. The field of PROs has evolved and reached a common understanding about good clinical practices for the use of PROs (Rothman et al., 2009). In addition, the U.S. and European regulators have published guidance on the use of these measures to support PRO-based claims in pharmaceutical product labeling (European Medicines Agency, 2005; US Food and Drug Administration Guidance for Industry, 2009) (DeMuro et al., 2013). Siddiqui et al. (2014) reviewed and addressed issues regarding the “why, how, and what” of PROs as well as cancer survivorship because it closely relates to PROs. Building on previous joint modeling work in a highly symptomatic and particularly fatal cancer (Wang et al., 2012; Hatfield et al., 2011, 2012; and Zhang et al., 2014, 2015a), we develop new Bayesian methodology on how to evaluate the distinct effects of longitudinal and time-to-event outcomes on the fit of a joint model.

A popular approach in joint modeling of longitudinal and survival data is based on shared random effects, where the longitudinal component and the survival component of the joint model share common random effects and these random effects then induce correlation between the longitudinal and survival data. There are two basic formulations of the joint model. The first is the “trajectory model” (TM), where one substitutes the time trajectory function from the longitudinal component into the hazard function of the survival component, and in this case, the trajectory function acts like a time-varying covariate in the survival component. The second formulation is the shared parameter model (SPM), which directly includes the random effects as covariates in the survival component. One of the main advantages of the TM is that it leads to a straightforward interpretation of the association between the longitudinal measure and survival time through the direct inclusion of the trajectory function in the hazard. For the SPM, the characterization of the association is much more complex and can only be analytically determined once the random effects have been integrated out, as the two components of the model are independent conditional on these random effects. However, the TM is computationally more expensive compared to the SPM. In addition, the TM requires extrapolation beyond the last time at which the longitudinal measure is observed in the survival component. The SPM typically fits the survival component of the joint model better as it directly includes the random effects as covariates in the survival component. There is a very rich literature concerning these two basic approaches. The TM has been considered in Schluchter (1992), Hogan and Laird (1997), Law et al. (2002), Brown and Ibrahim (2003), Chen et al. (2004) Ibrahim et al. (2004), Brown et al. (2005), Chi and Ibrahim (2006), Chi and Ibrahim (2007), and Ibrahim et al. (2010) for joint modeling with biomedical applications. There has also been much work on using the SPM, including Pawitan and Self (1993), DeGruttola and Tu (1994), Lavalley and DeGruttola (1996), Henderson et al. (2000), Xu and Zeger (2001a, 2001b), and Song et al. (2002) for univariate or multivariate longitudinal data. An excellent review on joint modeling of longitudinal and survival data is given in Tsiatis and Davidian (2004) and an overview of joint models for longitudinal and time-to-event data can be found in Ibrahim, Chen, and Sinha (2001, Chapter 7) and Rizopoulos (2012a). There are several R packages available in fitting joint models, including JM (Rizopoulos, 2012b), JMbayes (Rizopoulos, 2014), and joineR (Philipson et al., 2012). There is also a Stata module stjm (Crowther, 2012; Crowther et al., 2013), which fits shared random effects models. In addition, another R package, lcmm (Proust-Lima et al., 2014), fits joint models based on shared latent classes.

One important issue in the joint modeling of longitudinal and survival data concerns the separate contribution of the model components to the overall goodness-of-fit of the joint model. Zhang et al. (2014) developed a decomposition of AIC and BIC to assess the fit of each component of the joint model. A SAS macro, called JMFit, (Zhang et al., 2015b) implements a variety of popular joint models and provides several model assessment measures including the decomposition of AIC and BIC as well as ΔAIC and ΔBIC. Within the Bayesian framework, Hanson et al. (2011) proposed to use LPML to predict survival times conditional on the longitudinal component of the model. In this paper, we derive a novel decomposition of the DIC and LPML criteria into additive components that will allow us to assess the goodness of fit for each component of the joint model. Such a development is extremely important since it not only allows us to quantify the contribution of the longitudinal data to the fit of the survival data or the contribution of the survival data to the fit of the longitudinal data, but it also allows us to identify which PROs are most highly associated with survival outcomes, a finding with significant clinical implications. In addition, we also develop a new Monte Carlo (MC) method for computing the CPO statistics which may involve intractable high-dimensional integrals. The proposed MC approach for computing the CPO has a potential to lead to a gain in computing time compared to a numerical approximation approach, particularly in the joint modeling setting. To illustrate our proposed method, we only consider (i) polynomial trajectories and independent and identically distributed Gaussian noise for longitudinal measures and (ii) the Cox model with a piecewise constant baseline hazard function for survival data in our simulation study and real data analysis. However, the proposed method can be applied to other types of longitudinal trajectories and other types of survival models such as those considered in Hanson et al. (2011).

The rest of the paper is organized as follows. Section 2 presents the joint models and the likelihood and posterior. The first decomposition (Decomposition I) of DIC and LPML (i.e., DIC= DIC_Long + DIC_Surv|Long and LPML= LPML_Long+LPML_Surv|Long), the corresponding two new criteria (i.e., ΔDIC_Surv and ΔLPML_Surv), as well as a new Monte Carlo method for computing CPO are also developed in Section 2. A simulation study is conducted in Section 3, and a comprehensive analysis of the longitudinal and survival data from a cancer clinical trial is carried out in Section 4. We conclude the paper with a brief discussion in Section 5. Prior specification and posterior computation are discussed in Appendix A of the supplementary material. In addition, we develop the second decomposition (Decomposition II) of DIC and LPML (i.e., DIC= DIC_Surv + DIC_Long|Surv and LPML= LPML_Surv + LPML_Long|Surv) and the corresponding ΔDIC_Long and ΔLPML_Long criteria to assess the fit of the longitudinal data using the information from the survival data in Appendix B of the supplementary material.

2 Bayesian Assessment of Model Fit in the Joint Model

2.1 The Joint Models

Suppose that there are n subjects. For the i^th subject, let y_i(t) denote the longitudinal measure, which is observed at time t ∈ {a_i₁, a_i₂, . . . , a_{im_i}}, where 0 ≤ a_i₁ < a_i₂ < · · · < a_{im_i} and m_i > 1. Note that y_i(0) corresponds to the baseline value. Also let x_i and z_i denote two vectors of covariates, which may include the treatment indicator. We assume a mixed effects regression model for y_i(t) given by

y_{i} (a_{i j}) = g {(a_{i j})}^{'} θ_{i} + x_{i}^{'} γ + ϵ_{i} (a_{i j}),

(2.1)

where g(a_ij) denotes a (q+1)-dimensional vector of functions of a_ij for j = 1, . . . , m_i, θ_i denotes a (q+1)-dimensional vector of random effects, and γ denotes a vector of regression coefficients. In (2.1), we further assume θ_i ~ N(θ, Ω), where θ is a (q+1)-dimensional vector of overall effects, Ω is a (q+1) × (q+1) positive definite covariance matrix, ε_i(a_ij) is the measurement error term, which is assumed to follow a N(0, σ²) distribution and is independent of θ_i. We note that in (2.1), if q = 1, g(a_ij) = (1, a_ij)′ and (g(a_ij))′ θ_i represents a linear trajectory; if q = 2, $g (a_{i j}) = {(1, a_{i j}, a_{i j}^{2})}^{'}$ and (g(a_ij))′ θ_i leads to a quadratic trajectory; and if g(a_ij) = (1, B₁(a_ij), . . . , B_q(a_ij))′, where {B_k(·), k = 1, 2, . . . , q} is a q-dimensional basis for spline functions over a finite interval, (g(a_ij))′θ_i represents a spline trajectory considered in Brown et al. (2005).

Let t_i and δ_i denote the failure time and the censoring indicator, respectively, where δ_i = 1 if t_i is a failure time and 0 if t_i is right-censored for the i^th subject. The hazard function for failure time t_i is assumed to have the form

λ (t ∣ λ_{0}, α, β, θ_{i}, g (t), z_{i}) = λ_{0} (t) \exp {h (α, θ_{i}, g (t)) + z_{i}^{'} β},

(2.2)

where λ₀(t) is the baseline hazard function, h(·) is a linear function of g(t) and θ_i with α being a vector of regression coefficients. Note that λ₀, α, and β are the fixed effects parameters pertaining to the survival component of the joint model. When h(α, θ_i, g(t)) = {g(t)′θ_i}α, where α is a scalar, the hazard function (2.2) leads to the TM. When h does not depend on g(t), that is, $h (α, θ_{i}, g (t)) = θ_{i}^{'} α$ , where α is a (q + 1)-dimensional vector, the hazard function specified by (2.2) defines the SPM. Under the TM, g(t)′θ_i acts a time-varying covariate in the survival component while under the SPM, the random effects θ_i are included as q + 1 covariates in the survival component.

2.2 The Likelihood and Posterior

We first introduce some notation. We rewrite (2.1) as

y_{i} = X_{i} {(θ_{i}^{'}, γ^{'})}^{'} + ϵ_{i},

where y_i = (y_i(a_i₁), . . . , y_i(a_{im_i}))′, $X_{i} = {({(g {(a_{i j})}^{'}, x_{i}^{'})}^{'}, j = 1, \dots, m_{i})}^{'}$ , and ε_i = (ε_i(a_i₁), . . . , ε_i(a_{im_i}))′ ~ N(0, σ²I_{m_i}). Then, the probability density function (pdf) of y_i conditional on θ_i is given by

f (y_{i} ∣ γ, σ^{2}, θ_{i}, x_{i}) = \frac{1}{{(2 π σ^{2})}^{\frac{m_{i}}{2}}} \exp {- \frac{1}{2 σ^{2}} {(y_{i} - X_{i} {(θ_{i}^{'}, γ^{'})}^{'})}^{'} (y_{i} - X_{i} {(θ_{i}^{'}, γ^{'})}^{'})},

and the pdf of θ_i is given by

f (θ_{i} ∣ θ, Ω) = \frac{{∣ Ω ∣}^{- \frac{1}{2}}}{{(2 π)}^{\frac{q + 1}{2}}} \exp {- \frac{1}{2} {(θ_{i} - θ)}^{'} Ω^{- 1} (θ_{i} - θ)},

for i = 1, . . . , n. Letting λ be a vector of parameters for the baseline hazard function λ₀(t), we write

f (t_{i} ∣ λ, α, β, θ_{i}, δ_{i}, z_{i}) = {[λ (t_{i} ∣ λ_{0}, α, β, θ_{i}, g (t_{i}), z_{i})]}^{δ_{i}} \exp {- \int_{0}^{t_{i}} λ (u ∣ λ_{0}, α, β, θ_{i}, g (u), z_{i}) d u},

where λ(t|λ₀, α,β,θ_i, g(t), z_i) is defined in (2.2). We note that when δ_i = 1, f(t_i|λ, α, β, θ_i, δ_i = 1, z_i) reduces to the density of t_i, and when δ_i = 0, f(t_i|λ, α, β, θ_i,δ_i = 0, z_i) is the survival function evaluated at t_i.

Let φ = (θ, γ, σ², Ω, λ, α, β). The joint distribution of (y_i, t_i, θ_i) is written as

f (y_{i}, t_{i}, θ_{i} ∣ φ, δ_{i}, x_{i}, z_{i}) = f (t_{i} ∣ λ, α, β, θ_{i}, δ_{i}, z_{i}) f (y_{i} ∣ γ, σ^{2}, θ_{i}, x_{i}) f (θ_{i} ∣ θ, Ω),

(2.3)

and the marginal joint distribution of (y_i, t_i) is given by

f (y_{i}, t_{i} ∣ φ, δ_{i}, x_{i}, z_{i}) = \int f (y_{i}, t_{i}, θ_{i} ∣ φ, δ_{i}, x_{i}, z_{i}) d θ_{i},

(2.4)

for i = 1, . . . , n. Letting D_obs = {(y_i, t_i, θ_i, x_i, z_i), i = 1, . . . , n} denote the observed data, the observed-data likelihood is given by

L (φ ∣ D_{obs}) = \prod_{i - 1}^{n} f (y_{i}, t_{i} ∣ φ, δ_{i}, x_{i}, z_{i}) .

(2.5)

Using (2.5), the joint posterior of φ takes the form

π (φ ∣ D_{obs}) = \frac{L (φ ∣ D_{obs}) π (φ)}{c (D_{obs})},

(2.6)

where π(φ) is the joint prior, which is specified in Appendix A, and the normalizing constant is given by

c (D_{obs}) = \int \prod_{i = 1}^{n} f (y_{i}, t_{i} ∣ φ, δ_{i}, x_{i}, z_{i}) π (φ) d φ .

(2.7)

We write $θ^{R} = {(θ_{1}^{'}, \dots, θ_{n}^{'})}^{'}$ , which is the vector of all the random effects. Then, the augmented posterior distribution of (φ, θ^R) is given by

π (φ, θ^{R} ∣ D_{obs}) = \frac{\prod_{i = 1}^{n} f (y_{i}, t_{i}, θ_{i} ∣ φ, δ_{i}, x_{i}, z_{i}) π (φ)}{c (D_{obs})},

(2.8)

where f(y_i, t_i, θ_i|φ, δ_i, x_i, z_i) is defined in (2.3). It is easy to see that ∫ π(φ, θ^R|D_obs)dθ^R = π(φ|D_obs). The implementation details of the Gibbs sampling algorithm to sample (φ, θ^R) from (2.8) are given in Appendix A.

2.3 Deviance Information Criterion

The Deviance Information Criterion (DIC) (Spiegelhalter et al., 2002) for the joint model is defined as

DIC = Dev (\overset{‒}{φ}) + 2 p_{D},

(2.9)

where Dev(φ) is the deviance function, $p_{D} = \bar{Dev} (φ) - Dev (\overset{‒}{φ})$ is the effective number of model parameters, and $\overset{‒}{φ}$ and $\bar{Dev} (φ)$ are the posterior means of φ and Dev(φ), respectively, with respect to the posterior distribution in (2.6). To assess the overall fit of the joint model, we specify the deviance function as

Dev (φ) = - 2 \log L (φ ∣ D_{obs}),

where L(φ|D_obs) is given by (2.5). From (2.5), we see that Dev(φ) involves the computation of n integrals as shown in (2.4).

The integration over the random effects specified in (2.4) always poses a major challenge in computing the observed-data likelihood of the joint model. One possible approach is to use a Monte Carlo (MC) approach, but this may be computationally intensive. Adaptive Gaussian quadrature (AGQ) (Pinheiro and Bates, 1995) is another approach to approximate (2.4), and is implemented here to calculate DIC when the dimension of θ_i is low.

2.3.1 DIC Decomposition

To assess the contribution of the longitudinal data to the fit of the survival data, we develop a novel decomposition of DIC in (2.9). Specifically, we decompose DIC into two parts: one part for the longitudinal data and the other part for the survival data conditional on the longitudinal data. Write φ₁ = (θ, γ, σ²,Ω) and φ₂ = (λ, α, β). Let f(θ_i|φ₁, y_i, x_i) be the conditional density of the random effects θ_i given y_i, and also let f(y_i|φ₁, x_i) = ∫ f(y_i| γ, σ², θ_i, x_i)f(θ_i|θ, Ω)dθ_i, which is the marginal density of y_i. Let ${\overset{‒}{φ}}_{1}$ and ${\overset{‒}{φ}}_{2}$ denote the posterior means of φ₁ and φ₂. Define ${Dev}_{Long} (\overset{‒}{φ}) = - 2 \sum_{i = 1}^{n} \log f (y_{i} ∣ {\overset{‒}{φ}}_{1} . x_{i})$ , $p_{D [Long]} = E [- 2 \sum_{i = 1}^{n} \log f (y_{i} ∣ φ_{1}, x_{i}) ∣ D_{obs}] + 2 \sum_{i = 1}^{n} \log f (y_{i} ∣ {\overset{‒}{φ}}_{i}, x_{i})$ , ${Dev}_{Surv ∣ Long} (\overset{‒}{φ}) = - 2 \sum_{i = 1}^{n} \log \int f (t_{i} ∣ {\overset{‒}{φ}}_{2}, θ_{i}, δ_{i}, z_{i}) f (θ_{i} ∣ {\overset{‒}{φ}}_{1}, y_{i}, x_{i}) d θ_{i}$ , and $p_{D [Surv ∣ Long]} = E [- 2 \sum_{i = 1}^{n} \log \int f (t_{i} ∣ φ_{2}, θ_{i}, δ_{i}, z_{i}) f (θ_{i} ∣ φ_{1}, y_{i}, x_{i}) d θ_{i} ∣ D_{obs}] + 2 \sum_{i = 1}^{n} \log \int f (t_{i} ∣ {\overset{‒}{φ}}_{2}, θ_{i}, δ_{i}, z_{i}) f (θ_{i} ∣ {\overset{‒}{φ}}_{1}, y_{i}, x_{i}) d θ_{i}$ . We are led to the following result.

Result 1

DIC and p_D in (2.9) have the following decomposition:

\begin{matrix} DIC = {DIC}_{Long} + {DIC}_{Surv ∣ Long}, \\ p_{D} = p_{D [Long]} + p_{D [Surv ∣ Long]}, \end{matrix}

(2.10)

where ${DIC}_{Long} = {Dev}_{Long} (\overset{‒}{φ}) + 2 p_{D [Long]}$ , and ${DIC}_{Surv ∣ Long} = {Dev}_{Surv ∣ Long} (\overset{‒}{φ}) + 2 p_{D [Surv ∣ Long]}$ .

In (2.10), DIC_Long measures the contribution of the longitudinal data to the total DIC while DIC_Surv|Long quantifies the contribution to the total DIC due to the survival data given the additional information from the longitudinal data.

The marginal distribution of y_i follows

y_{i} ∣ φ_{1}, x_{i} \sim N (X_{i} (\begin{matrix} θ \\ γ \end{matrix}), (σ^{2} I_{m_{i}} + X_{i} (\begin{matrix} Ω & 0 \\ 0 & 0 \end{matrix}) X_{i}^{'}))

and the conditional distribution of the random effects θ_i given the longitudinal data takes the form

θ_{i} ∣ φ_{1}, y_{i}, x_{i} \sim N (Ω_{θ_{i}} [\frac{1}{σ^{2}} (I_{q + 1} 0) X_{i}^{'} (y_{i} - X_{i} (\begin{matrix} 0 \\ I_{p} \end{matrix}) γ) + Ω^{- 1} θ], Ω_{θ_{i}}),

where $Ω_{θ_{i}} = {(Ω^{- 1} + \frac{1}{σ^{2}} (I_{q + 1} 0) X_{i}^{'} X_{i} (\begin{matrix} I_{q + 1} \\ 0 \end{matrix}))}^{- 1}$ . These are the quantities needed to apply Result 1.

2.3.2 ΔDIC_Surv

When we fit the survival data alone, i.e., α = 0 in (2.2), the hazard function reduces to $λ (t ∣ λ_{0}, α = 0, β, θ_{i}, z_{i}) = λ_{0} (t) \exp (z_{i}^{'} β)$ and the density for t_i becomes $f_{0} (t_{i} ∣ λ, β, δ_{i}, z_{i}) = {λ_{0} (t_{i}) \exp (z_{i}^{'} β)}^{δ_{i}} \exp [- \exp (z_{i}^{'} β) {\int_{0}^{t_{i}} λ_{0} (u) d u}]$ Write D_{Surv, obs} = {(t_i, δ_i, z_i), i = 1,...,n} and let

{DIC}_{Surv, 0} = {Dev}_{Surv, 0} (\overset{‒}{λ}, \overset{‒}{β}) + 2 p_{D [Surv, 0]},

where ${Dev}_{Surv, 0} (\overset{‒}{λ}, \overset{‒}{β}) = - 2 \sum_{i = 1}^{n} \log f_{0} (t_{i} ∣ \overset{‒}{λ}, \overset{‒}{β}, δ_{i}, z_{i})$ , and $p_{D [Surv, 0]} = E [- 2 \sum_{i = 1}^{n} \log f_{0} (t_{i} ∣ λ, β, δ_{i}, z_{i}) ∣ D_{Surv, obs}] + 2 \sum_{i = 1}^{n} \log f_{0} (t_{i} ∣ \overset{‒}{λ}, \overset{‒}{β}, δ_{i}, z_{i})$ . We now propose the following model assessment criterion:

Δ {DIC}_{Surv} = {DIC}_{Surv, 0} - {DIC}_{Surv ∣ Long} .

(2.11)

In (2.11), ΔDIC_Surv measures the gain of the fit in the survival component due to the longitudinal data with a penalty for the additional parameters in the survival component of the joint model. A model with a large value of ΔDIC_Surv is more preferred. When $2 (p_{D [Surv ∣ Long]} - p_{D [Surv, 0]}) > {Dev}_{Surv, 0} (\overset{‒}{λ}, \overset{‒}{β}) - {Dev}_{Surv ∣ Long} (\overset{‒}{φ})$ , then ΔDIC_Surv < 0. That is, when the penalty for the additional parameters in the survival component outweighs the gain of the fit in the survival component, ΔDIC_Surv can be negative.

2.4 Conditional Predictive Ordinate

2.4.1 CPO Computation

Let $D_{obs}^{(- i)} = {(y_{j}, t_{j}, δ_{j}, x_{j}, z_{i}), j = 1, \dots, i - 1, i + 1, \dots, n}$ denote the observed data with the i^th subject deleted. The Conditional Predictive Ordinate (CPO) (e.g., Geisser and Eddy, 1979; Gelfand et al., 1992; and Gelfand and Dey, 1994) for the i^th subject is defined as

{CPO}_{i} = \int f (y_{i}, t_{i} ∣ φ, δ_{i}, x_{i}, z_{i}) π (φ ∣ D_{obs}^{(- i)}) d φ,

(2.12)

where

π (φ ∣ D_{obs}^{(- i)}) = \frac{\prod_{j \neq i} f (y_{j}, t_{j} ∣ φ, δ_{j}, x_{j}, z_{j}) π (φ)}{c (D_{obs}^{(- i)})},

(2.13)

and $c (D_{obs}^{(- i)})$ is the normalizing constant, i.e., $c (D_{obs}^{(- i)}) = \int \prod_{j \neq i} f (y_{j}, t_{j} ∣ φ, δ_{j}, x_{j}, z_{j}) π (φ) d φ$ . Following Chen et al. (2000), we obtain the first CPO identity.

CPO Identity I

CPO_i in (2.12) can be rewritten as

{CPO}_{i} = \frac{1}{\int \frac{1}{f (y_{i}, t_{i} ∣ φ, δ_{i}, x_{i}, z_{i})} π (φ ∣ D_{obs}) d φ} .

(2.14)

The proof of this identity directly follows from Chapter 10 of Chen et al. (2000). CPO Identity I leads to the development of a popular Monte Carlo estimate of CPO using Gibbs samples from the posterior distribution given D_obs instead of $D_{obs}^{(- i)}$ . Letting {φ_b, b = 1, . . . , B} denote a Gibbs sample of φ from π(φ|D_obs) and using (2.14), a Monte Carlo estimate of ${CPO}_{i}^{- 1}$ is given by

{\hat{CPO}}_{i}^{- 1} = \frac{1}{B} \sum_{b = 1}^{B} \frac{1}{f (y_{i}, t_{i} ∣ φ_{b}, δ_{i}, x_{i}, z_{i})} .

(2.15)

The numerical approximation of ${CPO}_{i}^{- 1}$ in (2.15) involves the integral over the random effects and can be calculated using AGQ to approximate (2.4). However, this method would likely be computationally intensive when the dimension of the random effects is high. To circumvent this numerical integration issue in (2.15), we develop a second CPO identity and then propose a new efficient MC method which directly uses the Gibbs samples generated from the augmented posterior distribution π(φ, θ^R|D_obs) in (2.8) to calculate ${CPO}_{i}^{- 1}$ .

CPO Identity II

Let w_i(θ_i) be a normalized weight function such that ∫ w_i(θ_i)dθ_i = 1. Then, CPO_i in (2.12) can be expressed as

{CPO}_{i} = \frac{1}{\int \frac{w_{i} (θ_{i})}{f (y_{i}, t_{i}, θ_{i} ∣ φ, δ_{i}, x_{i}, z_{i})} π (φ, θ^{R} ∣ D_{obs}) d θ^{R} d φ} .

(2.16)

Now, let ${(φ_{b}, θ_{b}^{R}), b = 1, \dots, B}$ denote a Gibbs sample of (φ, θ^R) from π(φ, θ^R|D_obs). Using the CPO Identity II in (2.16), a Monte Carlo estimate of ${CPO}_{i}^{- 1}$ is given by

{\hat{CPO}}_{i}^{- 1} = \frac{1}{B} \sum_{b = 1}^{B} \frac{w_{i} (θ_{i b})}{f (y_{i}, t_{i}, θ_{i b} ∣ φ_{b}, δ_{i}, x_{i}, z_{i})} .

Under certain ergodic conditions, ${\hat{CPO}}_{i}^{- 1}$ is unbiased and consistent for any normalized weight function w_i. However, the Monte Carlo error of ${\hat{CPO}}_{i}^{- 1}$ depends on the choice of w_i. The following theorem characterizes the optimal choice of w_i in minimizing the variance of the Monte Carlo estimator ${\hat{CPO}}_{i}^{- 1}$ when ${(φ_{b}, θ_{b}^{R}), b = 1, \dots, B}$ is a sample from π(φ, θ^R|D_obs).

Theorem 1

Let

w_{i, opt} (θ_{i}) = \frac{f (y_{i}, t_{i}, θ_{i} ∣ φ, δ_{i}, x_{i}, z_{i})}{f (y_{i}, t_{i} ∣ φ, δ_{i}, x_{i}, z_{i})} .

Then, for any normalized weight function w_i, we have

Var (\frac{w_{i, opt} (θ_{i})}{f (y_{i}, t_{i}, θ_{i} ∣ φ, δ_{i}, x_{i}, z_{i})} ∣ D_{obs}) \leq Var (\frac{w_{i} (θ_{i})}{f (y_{i}, t_{i}, θ_{i} ∣ φ, δ_{i}, x_{i}, z_{i})} ∣ D_{obs}),

where the variance is taken with respect to the posterior distribution π(φ, θ^R|D_obs).

Remark 1

The result established in Theorem 1 provides the best choice of w_i. However, this optimal weight function is expensive to compute. Since the optimal weight function w_i,_opt is analogous to the optimal weight function in the importance-weighted marginal density estimation (IWMDE) of the marginal posterior density proposed by Chen (1994), we may follow the guidelines discussed in Geweke (1989) and Chen (1994) to construct a good weight function w_i which is similar to w_i,_opt. One possible choice of w_i is a multivariate normal density, which is constructed via the Laplace approximation to the joint density f(y_i, t_i, θ_i|φ, δ_i, x_i, z_i) in (2.3). Another possible choice of w_i is w_i,_cond(θ_i) = f(θ_i|φ₁, y_i, x_i), which is the conditional density of the random effects θ_i given y_i. Note that when y_i and t_i are independent, w_i,_cond(θ_i) = w_i,_opt. Therefore, w_i,_cond(θ_i) may be a reasonable choice for computing the CPO_i.

2.4.2 CPO Decomposition

In this subsection, we first establish the third CPO identity which will lead to the decomposition of CPO.

CPO Identity III

The CPO in (2.12) can also be expressed as

{CPO}_{i} = \frac{c (D_{obs})}{c (D_{obs}^{(- i)})} = \frac{f (y_{i}, t_{i} ∣ φ, δ_{i}, x_{i}, z_{i}) π (φ ∣ D_{obs}^{(- i)})}{π (φ ∣ D_{obs})},

(2.17)

which is true for all φ.

Since plugging in any numerical value for φ in (2.17) results in the CPO, we have

{CPO}_{i} = \frac{f (y_{i}, t_{i} ∣ φ^{*}, δ_{i}, x_{i}, z_{i}) π (φ^{*} ∣ D_{obs}^{(- i)})}{π (φ^{*} ∣ D_{obs})},

(2.18)

where φ* is a fixed value of φ, which may be chosen as the posterior mean. We note that (2.18) is similar to the identity of Chib (1995). Let $φ_{1}^{*}$ and $φ_{2}^{*}$ denote the posterior means of φ₁ and φ₂. From (2.4) and (C.1), we have

f (y_{i}, t_{i} ∣ φ^{*}, δ_{i}, x_{i}, z_{i}) = f (y_{i} ∣ φ_{1}^{*}, x_{i}) f (t_{i} ∣ φ_{2}^{*}, φ_{1}^{*}, δ_{i}, y_{i}, x_{i}, z_{i}),

where $f (t_{i} ∣ φ_{2}^{*}, φ_{1}^{*}, δ_{i}, y_{i}, x_{i}, z_{i}) = \int f (t_{i} ∣ φ_{2}^{*}, θ_{i}, δ_{i}, z_{i}) f (θ_{i} ∣ φ_{1}^{*}, y_{i}, x_{i}) d θ_{i}$ . We also observe that

\begin{matrix} π (φ^{*} ∣ D_{obs}^{(- i)}) = π (φ_{i}^{*} ∣ D_{obs}^{(- i)}) π (φ_{2}^{*} ∣ φ_{1}^{*}, D_{obs}^{(- i)}), \\ π (φ^{*} ∣ D_{obs}) = π (φ_{1}^{*} ∣ D_{obs}) π (φ_{2}^{*} ∣ φ_{1}^{*}, D_{obs}) . \end{matrix}

(2.19)

Using (2.18) and the facts of the joint densities stated above, we propose the CPO decomposition:

{CPO}_{i} = {CPO}_{i, Long} \cdot {CPO}_{i, Surv ∣ Long},

(2.20)

where

{CPO}_{i, Long} = \frac{f (y_{i} ∣ φ_{i}^{*}, x_{i}) π (φ_{1}^{*} ∣ D_{obs}^{(- i)})}{π (φ_{1}^{*} ∣ D_{obs})},

(2.21)

and

{CPO}_{i, Surv ∣ Long} = \frac{f (t_{i} ∣ φ_{2}^{*}, φ_{1}^{*}, δ_{i}, y_{i}, z_{i}) π (φ_{2}^{*} ∣ φ_{1}^{*}, D_{obs}^{(- i)})}{π (φ_{2}^{*} ∣ φ_{1}^{*}, D_{obs})} .

(2.22)

Remark 2

Let D_Long,obs = {(y_i, x_i), i = 1, . . . , n} denote the observed longitudinal data and D_Surv,obs = {(t_i, δ_i, z_i), i = 1, . . . , n} denote the survival data, respectively. Also let $D_{Long, obs}^{(- i)} = {(y_{i}, x_{i}), j = 1, \dots, i - 1, i + 1, \dots, n}$ and $D_{Surv, obs}^{(- i)} = {(t_{i}, δ_{i}, z_{i}), j = 1, \dots, i - 1, i + 1, \dots, n}$ denote the observed longitudinal and survival data with the i^th subject deleted, respectively. Assume that D_Long,obs and D_Surv,obs are independent and π(φ₁, φ₂) = π(φ₁)π(φ₂). Under these assumptions, we have

{CPO}_{i, Long} = {CPO}_{i, Long alone} = \int f (y_{i} ∣ φ_{i}, x_{i}) π (φ_{i} ∣ D_{Long, obs}^{(- i)}) d φ_{1},

(2.23)

{CPO}_{i, Surv ∣ Long} = {CPO}_{i, Surv 0} = \int f_{0} (t_{i} ∣ φ_{2}, δ_{i}, z_{i}) π (φ_{2} ∣ D_{Surv, obs}^{(- i)}) d φ_{2},

(2.24)

and

{CPO}_{i} = \int f (y_{i} ∣ φ_{1}, z_{i}) π (φ_{1} ∣ D_{Long, obs}^{(- i)}) d φ_{1} \times \int f_{0} (t_{i} ∣ φ_{2}, δ_{i}, z_{i}) π (φ_{2} ∣ D_{Surv, obs}^{(- i)}) d φ_{2},

where

\begin{matrix} π (φ_{1} ∣ D_{Long, obs}^{(- i)}) = \frac{\prod_{j \neq i} f (y_{j} ∣ φ_{1}, x_{j}) π (φ_{1})}{\int \prod_{j \neq i} f (y_{j} ∣ φ_{1}, x_{j}) π (φ_{1}) d φ_{1}}, \\ π (φ_{2} ∣ D_{Surv, obs}^{(- i)}) = \frac{\prod_{j \neq i} f_{0} (t_{j} ∣ φ_{2}, δ_{j}, z_{j}) π (φ_{2})}{\int \prod_{j \neq i} f_{0} (t_{j} ∣ φ_{2}, δ_{j}, z_{j}) π (φ_{2}) d φ_{2}}, \end{matrix}

and f₀(t_j|φ₂, δ_j, z_j) is defined in Section 2.3.2 with φ₂ = (λ α = 0, β). Therefore, CPO_i,_Long and CPO_i,_Surv|Long reduce to the usual CPOs for the longitudinal data and the survival data separately, and the CPO decomposition (2.20) holds under the usual definition of CPO.

Next, we develop useful in the following theorem for $\frac{π (φ_{1}^{*} ∣ D_{obs}^{(- i)})}{\partial (φ_{1}^{*} ∣ D_{obs})}$ , CPO_i,_Long, and CPO_i,_Surv|Long, which facilitate the computation and further understanding of these quantities.

Theorem 2

For CPO_i, CPO_i,_Long, and CPO_i,_Surv|Long, we have the following identities:

\frac{π (φ_{1}^{*} ∣ D_{obs}^{(- i)})}{π (φ_{1}^{*} ∣ D_{obs})} = {CPO}_{i} \int \frac{1}{f (y_{i}, t_{i} ∣ φ_{1}^{*}, φ_{2}, δ_{i}, x_{i}, z_{i})} π (φ_{2} ∣ φ_{1}^{*}, D_{obs}) d φ_{2},

{CPO}_{i, Long} = {CPO}_{i} \int \frac{1}{f (t_{i} ∣ φ_{2}, φ_{1}^{*}, δ_{i}, y_{i}, x_{i}, z_{i})} π (φ_{2} ∣ φ_{1}^{*}, D_{obs}) d φ_{2},

(2.25)

and

{CPO}_{i, Surv ∣ Long} = \frac{1}{\int \frac{1}{f (t_{i} ∣ φ_{2}, φ_{1}^{*}, δ_{i}, y_{i}, x_{i}, z_{i})} π (φ_{2} ∣ φ_{1}^{*}, D_{obs}) d φ_{2}} .

(2.26)

Remark 3

The identity in (2.26) is quite attractive as it has a similar form as the usual CPO_i in (2.14). We also see from (2.26) that CPO_i,_Surv|Long is free of $φ_{2}^{*}$ . In addition, CPO_i,_Surv|Long can be directly calculated from (2.26). Thus, if only CPO_i,_Surv|Long is of interest, it is not necessary to compute the overall CPO_i. However, it does not appear possible that CPO_i,_Long can be computed directly without knowing CPO_i and CPO_i,_Surv|Long.

Remark 4

To avoid the calculation of $f (y_{i}, t_{i} ∣ φ_{1}^{*}, φ_{2}, δ_{i}, x_{i}, z_{i})$ , we use the same idea as in (2.16) and obtain

\frac{π (φ_{1}^{*} ∣ D_{obs}^{(- i)})}{π (φ_{1}^{*} ∣ D_{obs})} = {CPO}_{i} \int \frac{w_{i} (θ_{i})}{f (y_{i}, t_{i}, θ_{i} ∣ φ_{1}^{*}, φ_{2}, δ_{i}, x_{i}, z_{i})} π (φ_{2}, θ^{R} ∣ φ_{1}^{*}, D_{obs}) d θ^{R} d φ_{2},

where the optimal choice of w_i(θ_i) is $\frac{f (y_{i}, t_{i}, θ_{i} ∣ φ_{1}^{*}, φ_{2}, δ_{i}, x_{i}, z_{i})}{f (y_{i}, t_{i} ∣ φ_{1}^{*}, φ_{2}, δ_{i}, x_{i}, z_{i})}$ . Similarly,

{CPO}_{i, Surv ∣ Long} = \frac{1}{f (y_{i} ∣ φ_{1}^{*}, x_{i}) \int \frac{w_{i} (θ_{i})}{f (y_{i}, t_{i}, θ_{i} ∣ φ_{1}^{*}, φ_{2}, δ_{i}, x_{i}, z_{i})} π (φ_{2}, θ^{R} ∣ φ_{1}^{*}, D_{obs}) d θ^{R} d φ_{2}},

where the optimal choice of w_i(θ_i) is $\frac{f (y_{i}, t_{i}, θ_{i} ∣ φ_{1}^{*}, φ_{2}, δ_{i}, x_{i}, z_{i})}{f (y_{i}, t_{i} ∣ φ_{1}^{*}, φ_{2}, δ_{i}, x_{i}, z_{i})}$ .

2.4.3 LPML and LPML Decomposition

The logarithm of the Pseudo marginal likelihood (LPML) (Ibrahim et al., 2001) is defined as

LPML = \sum_{i = 1}^{n} \log ({CPO}_{i}) .

We note that there is a relationship between the DIC and the LPML in large samples (see Draper and Krnjajić (2005, Section 4)). Using the decomposition of CPO in (2.20), we are led to the following result.

Result 2

LPML can be decomposed as

LPML = {LPML}_{Long} + {LPML}_{Surv ∣ Long},

where ${LPML}_{Long} = \sum_{i = 1}^{n} \log {CPO}_{i, Long}$ , ${LPML}_{Surv ∣ Long} = \sum_{i = 1}^{n} \log {CPO}_{i, Surv ∣ Long}$ and CPO_i,_Long and CPO_i,_Surv|Long are given by (2.21) and (2.22), respectively.

2.4.4 ΔLPML_Surv

Define ${LPML}_{Surv, 0} = \sum_{i = 1}^{n} \log ({CPO}_{i, Surv 0})$ , where CPO_i,_Surv0 is given by (2.24). We propose the model assessment criterion

Δ {LPML}_{Surv} = {LPML}_{Surv ∣ Long} - {LPML}_{Surv, 0} .

ΔLPML_Surv quantifies the gain of the fit in the survival component due to the longitudinal data with a penalty for the additional parameters in the survival component of the joint model. A model with a large value of ΔLPML_Surv is more preferred. From Remark 3, it is easy to see that if our interest is on ΔLPML_Surv only, we do not need to compute the overall LPML for the joint model. Similar to ΔDIC_Surv, it is not guaranteed that ΔLPML_Surv is non-negative.

3 A Simulation Study

We conduct a simulation study to evaluate the empirical performance of ΔDIC_Surv and ΔLPML_Surv in selecting the true model or identifying the true longitudinal data. We generate longitudinal and survival data under the SPM with the simple exponential baseline. Specifically, we first simulate θ_i = (θ₀_i, θ₁_i)′ N(θ, Ω), where θ = (θ₀, θ₁)′ = (0.1, 0.5)′ and $Ω = (\begin{matrix} Ω_{00} & Ω_{01} \\ Ω_{10} & Ω_{11} \end{matrix}) = (\begin{matrix} 0.7 & - 0.1 \\ - 0.1 & 0.06 \end{matrix})$ . We then simulate the longitudinal data from a N(μ_i(a_ij), σ²) distribution with a linear trajectory μ_i(a_ij) = θ₀_i + a_ij θ₁_i + x_iγ. For the survival data, we set z_i = x_i and generate t* from an exponential regression model, i.e., $t_{i}^{*} {[- λ \exp {θ_{0 i} α_{1} + θ_{1 i} α_{2} + z_{i} β}]}^{- 1} \log (1 - U)$ , where U ~ U(0, 1), and draw the censoring times C_i from an exponential distribution with mean 10. Then, we compute $t_{i} = \min {t_{i}^{*}, C_{i}}$ and δ_i = 1 if $t_{i}^{*} \leq C_{i}$ and 0 otherwise. The treatment indicator x_i is generated from a Bernoulli(0.5) distribution. For each subject, 6 or 7 time points (a_ij, j = 1, . . . , 6 or 7) for the longitudinal measures are chosen to be (0 + ζ_i₁, 21 + ζ_i₂, 42 + ζ_i₃, 63 + ζ_i₄, 84 + ζ_i₅, 105 + ζ_i₆)/30.4375 if ζ_i₇ > 0 and (0 + ζ_i₁, 21 + ζ_i₂, 42 + ζ_i₃, 63 + ζ_i₄, 84 + ζ_i₅, 105 + ζ_i₆, 126 + ζ_i₇)/30.4375 if ζ_i₇ ≤ 0, where ζ_ij ~ U(−3, 3) for j = 1, . . . , 7, and 30.4375 = 365.25/12. The design values of the parameters are given as Ω₀₀ = 0.7, Ω₁₀ = Ω₀₁ = −0.1, Ω₁₁ = 0.06, δ² = 0.3, θ₀ = 0.1, θ₁ = 0.5, γ = −0.2, α₁ = 0.3, α₂ = 1.6, β= −0.4, and λ = 0.08. 500 datasets are simulated independently with n = 400 subjects in each simulated dataset. The resulting censoring percentage is about 40%.

Let D_T denote the dataset generated from the true SPM model. One additional set of longitudinal data is generated by adding noise to the true longitudinal measures. More specifically, it is simulated from a N(μ_ℓ_i(a_ij), δ²) distribution with linear trajectories μ_ℓ_i(a_ij) = (θ₀_i + τ_ℓ₀_i) + a_ij(θ₁_i + τ_ℓ₁_i) + x_iγ, where (τ_ℓ0_i, τ_ℓ1_i)′ ~ N(0, 0.2²I₂), and the values of the other parameters remain the same as before. By combining this longitudinal dataset with the same survival data in D_T, we obtain the additional dataset and denote it as D_W.

We consider the following scenarios to fit different joint models to the datasets D_T and D_W:

TRUE: Fit the true joint model to D_T . In the true joint model, (2.1) becomes
$y_{i} (a_{i j}) = θ_{0 i} + a_{i j} θ_{1 i} + x_{i} γ + ϵ_{i} (a_{i j}),$ (3.1)
and (2.2) becomes
$λ \exp {θ_{0 i} α_{1} + θ_{1 i} α_{2} + z_{i} β} .$ (3.2)
Long: Fit the joint model with (3.1) and (3.2) to D_W. In this case, D_W is fit by the joint model with misspecified longitudinal submodel.
SurvI: Fit the joint model with (3.1) and misspecified survival submodel to D_T. In this joint model, (3.2) reduces to λexp{θ₀_i α₁ + z_i β}.
SurvII: Fit the joint model with (3.1) and misspecified survival submodel to D_T. In this joint model, (3.2) reduces to λexp{θ₁_i α₂ + z_i β}.
TM: Fit the joint model with (3.1) and misspecified survival submodel to D_T. In this joint model, (3.2) becomes λexp{(θ₀_i + θ₁_it)α + z_i β}.
Long&Surv: Fit the joint model with misspecified longitudinal and survival submodels to D_T. In this joint model, (3.1) becomes y_i(a_ij) = θ₀_i + x_iγ + ε_i(a_ij), and (3.2) reduces to γexp{θ₀_i α₁ + z_i β}.

In all the six scenarios, the exponential regression model, namely, λexp{z_i β}, fits the true survival data D_T in computing DIC_{Surv, 0} and LPML_{Surv, 0}. Thus, the values of DIC_{Surv, 0} and LPML_{Surv, 0} are the same for all of the six scenarios. Since ΔDIC_Surv = DIC_{Surv, 0} − DIC_Surv|Long and ΔLPML_Surv = LPML_Surv|Long − LPML_{Surv, 0}, ΔDIC_Surv and ΔLPML_Surv can be used to assess the fit of the survival component of the joint model for all of the six scenarios. We also note that in scenario (ii), both components of the joint model are correctly specified but fit to the longitudinal data, which are less correlated to the survival data; in scenarios (iii), (iv), and (v), the longitudinal component is correctly specified, the survival component is misspecified, and both components fit the true longitudinal and survival data; and in scenario (vi), both components of the joint model are misspecified but fit the true data.

For each simulated dataset, we take 5000 Gibbs samples with 100 burn-in iterations. The means of ΔDIC_Surv and ΔLPML_Surv as well as the frequencies of ranking each model as best based on ΔDIC_Surv and ΔLPML_Surv are reported in Table 1. From this table, we see that True has the largest means of ΔDIC_Surv and ΔLPML_Surv, which are 18.72 and 9.37, and gets ranked as the best with 423 times out of 500 by both criteria, while SurvI has the smallest means of ΔDIC_Surv and ΔLPML_Surv and never gets ranked as the best by these two criteria in these 500 simulated datasets. These results show that both ΔDIC_Surv and ΔLPML_Surv can correctly identify the true model or the true data.

Table 1.

Means of ΔDIC_Surv and ΔLPML_Surv and frequencies of ranking each model as best based on ΔDIC_Surv and ΔLPML_Surv

	ΔDIC_Surv		ΔLPML_Surv
Data	Mean	Frequency	Mean	Frequency
True	18.72	423	9.37	423
Long	10.67	27	5.35	28
SurvI	1.31	0	0.63	0
SurvII	10.54	19	5.23	22
TM	4.09	8	1.92	6
Long&Surv	10.59	23	5.30	21

Open in a new tab

Figure 1 shows boxplots of the ΔDIC_Surv and ΔLPML_Surv differences between True and each of Long, SurvI, SurvII, TM, and Long&Surv. We see from this figure that boxplots for ΔDIC_Surv and ΔLPML_Surv differences are almost above zero, indicating that the true model does fit the true data much better than other models based on either ΔDIC_Surv or ΔLPML_Surv. These results are consistent with those based on the means of ΔDIC_Surv and ΔLPML_Surv and the frequencies of ranking each model as best as shown in Table 1.

Boxplots of the ΔDIC_Surv differences and the ΔLPML_Surv differences between True and each of Long, SurvI, SurvII, TM, and Long&Surv.

4 Analysis of the EMPHACIS Data

We consider a dataset from a multicenter, randomized, single-blind, EMPHACIS lung cancer clinical trial (Evaluation of MTA in Mesothelioma in a Phase III Study with Cisplatin) (Vogelzang et al., 2003). The study drug was multi-targeted antifolate (MTA) pemetrexed given in combination with cisplatin (the PEM/Cis arm), and the active-treatment comparator was cisplatin alone (the Cis arm). The treatment for both arms was structured as six 21-day cycles of therapy; patients receiving treatment benefit could receive additional cycles based on investigator discretion. Malignant pleural mesothelioma is characterized by rapid disease progression, high symptom burden, and a relatively short median survival of 12 months after diagnosis (Thompson et al., 2014). Accordingly, patient-reported assessments are important for evaluation of disease progression and patients’ response to therapy. In oncology, the patients’ importance ratings on the magnitude of progression-free survival improvement has been shown to depend on the severity of disease-related symptoms (Bridges et al., 2012). We analyzed the disease-specific patient-reported Lung Cancer Symptom Scales (LCSS) (Patricia et al., 2006) to evaluate the patient-level association of five of the six instrument items (i.e., anorexia, cough, dyspnea, fatigue, and pain) with progression-free survival using the EMPHACIS trial data. Progression free survival time (PFS) is defined as the time from randomization to the time until documented progression or death from any cause. We are interested in the association between post-baseline LCSS measurements and PFS. The main goal of applying joint models in this study is to assess the association of each longitudinal LCSS symptom with PFS and the treatment effects on each LCSS item and PFS simultaneously. More importantly, with the decomposition of DIC and LPML, the longitudinal LCSS symptoms can be compared in terms of their contribution to the fit of the PFS data as well as the gain in the fit of the longitudinal data for each LCSS symptom using the information from the PFS data can be determined.

Our study cohort consists of 425 patients with at least one post-baseline value of each longitudinal measure and seven binary covariates, including race (x_i₁ = 1 if white), gender (x_i₂ = 1 if male), age (x_i₃ = 1 if age ≥ 65), Karnofsky status (x_i₄ = 1 if Karnofsky status is high), baseline stage of disease (x_i₅ = 1 if stage I/II), vitamin supplementation (x_i₆ = 1 if full vitamin supplementation), and treatment assignment (x_i₇ = 1 if the i^th patient is in the pemetrexed/cisplatin arm). Among the 425 patients, 394 patients experienced disease progression. Among these 394 patients, there were only 129 distinct disease progression times. In all the computations, we used z_i = x_i and standardized these five LCSS measures, where the means and standard deviations were 30.79 and 27.19, 11.48 and 17.93, 31.41 and 26.33, 39.38 and 27.06, and 24.64 and 24.90 for anorexia, cough, dyspnea, fatigue, and pain, respectively. The total numbers of longitudinal measures (i.e., $\sum_{i = 1}^{n} m_{i}$ ) including the baseline measures were 5504, 5544, 5553, 5530, and 5546 for anorexia, cough, dyspnea, fatigue, and pain.

In (2.2), we assume a piecewise constant hazard function for λ₀(t) defined as

λ_{0} (t) = λ_{k}, t \in (s_{k - 1}, s_{k}] for k = 1, \dots, K,

(4.1)

where 0 = s₀ < s₁ < s₂ < . . . < s_K−1 < s_K = ∞ is a finite partition of the time axis. The s_k's in (4.1) were constructed based on the percentiles such as the first (Q₁), second (Q₂), and third (Q₃) quartiles of the PFS times. Let D_anorexia, D_cough, D_dyspnea, D_fatigue, and D_pain denote the five observed LCSS longitudinal datasets and also let D_Surv denote the observed PFS data. We fit the shared parameter model and the trajectory model with a linear trajectory, denoted by SPML and TML, respectively, to each pair of the PFS data and one of the five LCSS longitudinal outcomes corresponding to anorexia, cough, dyspnea, fatigue, and pain, namely, D_anorexia + D_Surv, D_cough + D_Surv, D_dyspnea + D_Surv, D_fatigue + D_Surv, and D_pain + D_Surv. The prior π(g=f) in (2.6) is specified in Appendix A of the supplementary material. For TML, we specify a N(0, 10000) prior distribution for α.

To construct the partition {s_k, k = 0, 1, . . . , K}, we adopt the left bi-sectional quantile partition (LBSQP) method proposed in Zhang et al. (2015b). We use DIC_{Surv, 0} and LPML_{Surv, 0} to determine the number of intervals (K) in (4.1). We start with a large value of K, which is close to the number of distinct PFS times, and work down to a smaller value of K. For the EMPHACIS data, K = 100 should be sufficiently large given that there were only 129 distinct PFS times. We determine an “optimal” value of K according to DIC_{Surv, 0} and LPML_{Surv, 0} by fitting the PFS data alone. Table S1 of the supplementary material shows the results for various values of K. From Table S1, we see that the respective values of DIC_{Surv, 0} and LPML_{Surv, 0} were 2070.61 and −1070.94 for K = 100; 2022.56 and −1012.62 for K = 35; 2018.49 and −1010.07 for K = 30; 2026.85 and −1014.27 for K = 25; and 2206.05 and −1103.10 for K = 2. Thus, the piecewise constant baseline hazard function with K = 30 fit the PFS data alone best according to both DIC_{Surv, 0} and LPML_{Surv, 0}. We then fit each of the LCSS longitudinal and PFS data, D_anorexia + D_Surv, D_cough + D_Surv, D_dyspnea + D_Surv, D_fatigue + D_Surv, and D_pain + D_Surv, with the “best” value of K = 30 in fitting the PFS data alone along with K = 25 and K = 35. We used the Laplace approximation to construct a multivariate normal density for w_i in computing LPML (MC), LPML_Surv|Long (MC), and ΔLPML_Surv (MC). Table 2 shows DIC, DIC_Surv|Long, ΔDIC_Surv, LPML, LPML_Surv|Long, and ΔLPML_Surv using the proposed MC method for each of the five PROs for K = 25, 30, and 35 under SPML and TML, respectively. The values of p_D and p_D_[Surv|Long] associated with DIC and DIC_Surv|Long are given in Table S2 of the supplemental material. Table S3 of the supplemental material shows LPML, LPML_Surv|Long, and ΔLPML_Surv using the AGQ approach. We see from Table 2 and Table S3 that LPML (MC), LPML_Surv|Long (MC), and ΔLPML_Surv (MC) are very close to LPML (GQ), LPML_Surv|Long (GQ), and ΔLPML_Surv (GQ). We also see from Table 2 that (a) according to DIC_Surv|Long and LPML_Surv|Long, the joint model with K = 30 fit the longitudinal and survival data better than those models with K = 25 and K = 35 under both SPML and TML; and (b) according to DIC and LPML, SPML fit D_anorexia + D_Surv, D_dyspnea + D_Surv, D_fatigue + D_Surv, and D_pain + D_Surv better than TML except for D_cough + D_Surv. Among the five PROs, pain had the largest values of ΔDIC_Surv and ΔLPML_Surv while cough had the smallest values of ΔDIC_Surv and ΔLPML_Surv under both SPML and TML. These results indicate that pain led to the most gain in fitting the PFS data while cough had the least contribution to the fit of the PFS data. We mention here that the overall DIC and LPML were not able to determine the contribution of the longitudinal data in fitting the survival data for these five LCSS longitudinal measures under the joint modeling framework. From Table 2, we observe that the smallest DIC_Long (or largest LPML_Long) value was the main reason for dyspnea having the smallest DIC (largest LPML) value, which had no implication on the contribution of the LCSS data to the fit of the PFS data. In addition, DIC and LPML were not directly comparable among these five PROs since the total numbers of longitudinal measures were different.

Table 2.

The Decompositions of DIC and LPML for five PROs under SPML and TML with different K

K	Model		Anorexia	Cough	Dyspnea	Fatigue	Pain
25	SPML	DIC	14022.37	14271.39	11920.73	13001.30	12843.70
		DIC_Surv\|Long	2004.52	2022.77	2007.75	1995.93	1975.15
		ΔDIC_Surv	22.33	4.08	19.10	30.91	51.70

		LPML	−7015.02	−7145.62	−5965.58	−6504.33	−6428.93
		LPML_Surv\|Long	−1003.19	−1012.15	−1004.82	−998.63	−988.28
		ΔLPML_Surv	11.08	2.12	9.45	15.63	25.99

	TML	DIC	14024.40	14269.30	11927.25	13007.29	12858.02
		DIC_Surv\|Long	2006.64	2020.66	2015.58	2001.91	1990.13
		ΔDIC_Surv	20.20	6.19	11.27	24.93	36.71

		LPML	−7016.13	−7144.96	−5968.82	−6507.33	−6436.08
		LPML_Surv\|Long	−1004.31	−1011.09	−1008.75	−1001.89	−995.99
		ΔLPML_Surv	9.96	3.17	5.51	12.38	18.27

30	SPML	DIC	14014.67	14262.97	11911.96	12993.00	12835.64
		DIC_Surv\|Long	1996.62	2014.65	1999.37	1987.86	1967.00
		ΔDIC_Surv	21.86	3.84	19.11	30.62	51.49

		LPML	−7011.19	−7141.36	−5960.94	−6500.14	−6424.85
		LPML_Surv\|Long	−999.00	−1008.05	−1000.58	−994.65	−984.21
		ΔLPML_Surv	11.07	2.02	9.48	15.41	25.86

	TML	DIC	14016.65	14260.43	11919.21	12999.26	12849.74
		DIC_Surv\|Long	1998.91	2012.10	2007.31	1994.24	1982.24
		ΔDIC_Surv	19.58	6.39	11.18	24.25	36.24

		LPML	−7012.14	−7140.22	−5964.61	−6503.19	−6431.90
		LPML_Surv\|Long	−1000.25	−1006.79	−1004.63	−997.96	−991.98
		ΔLPML_Surv	9.82	3.28	5.44	12.11	18.09

35	SPML	DIC	14018.84	14267.09	11914.40	12997.21	12839.63
		DIC_Surv\|Long	2000.43	2018.43	2002.76	1991.57	1970.54
		ΔDIC_Surv	22.13	4.13	19.81	31.00	52.02

		LPML	−7013.76	−7143.93	−5962.76	−6502.81	−6427.37
		LPML_Surv\|Long	−1001.53	−1010.42	−1002.91	−997.12	−986.54
		ΔLPML_Surv	11.09	2.20	9.71	15.50	26.07

	TML	DIC	14019.89	14264.71	11923.25	13002.92	12853.22
		DIC_Surv\|Long	2002.30	2015.98	2011.28	1998.05	1986.04
		ΔDIC_Surv	20.27	6.58	11.28	24.52	36.52

		LPML	−7014.31	−7143.10	−5967.13	−6505.60	−6434.11
		LPML_Surv\|Long	−1002.67	−1009.37	−1007.04	−1000.34	−994.39
		ΔLPML_Surv	9.95	3.25	5.58	12.28	18.23

Open in a new tab

Tables 3 and 4 show the posterior estimates and 95% highest posterior density (HPD) intervals of the hazard ratio (HR) of the overall treatment effect on PFS (β₁) and the estimates (Est) of the regression coefficients α associated with the random effects under SPML and TML with K = 30, respectively. We observe that except for dyspnea under SPML, the HRs under the joint model (ranging from 0.614 to 0.634 under SPML and ranging from 0.608 to 0.636 under TML) were smaller than or close to the HR of 0.638 when fitting the PFS data alone.

Table 3.

Parameter estimates under SPML with K = 30

	β ₁		α ₁		α ₂
PRO	HR	95% HPD Int.	Est	95% HPD Int.	Est	95% HPD Int.
Anorexia	0.614	(0.495, 0.756)	0.365	(0.202, 0.530)	1.178	(0.449, 1.893)
Cough	0.634	(0.516, 0.777)	0.200	(0.060, 0.343)	0.608	(−0.060, 1.230)
Dyspnea	0.641	(0.522, 0.790)	0.203	(0.068, 0.343)	1.412	(0.770, 2.069)
Fatigue	0.620	(0.498, 0.765)	0.367	(0.205, 0.534)	1.437	(0.706, 2.176)
Pain	0.622	(0.502, 0.776)	0.349	(0.206, 0.489)	1.938	(1.354, 2.537)

Open in a new tab

When fitting the PFS data alone, the estimate and 95% HPD interval of exp(β₁) are 0.638 and (0.526, 0.785).

Table 4.

Parameter estimates under TML with K = 30

	β ₁		α
PRO	HR	95% HPD Int.	Est	95% HPD Int.
Anorexia	0.620	(0.501, 0.760)	0.320	(0.186, 0.455)
Cough	0.636	(0.520, 0.782)	0.192	(0.064, 0.318)
Dyspnea	0.631	(0.518, 0.776)	0.223	(0.098, 0.340)
Fatigue	0.620	(0.501, 0.759)	0.343	(0.215, 0.478)
Pain	0.608	(0.491, 0.751)	0.391	(0.273, 0.515)

Open in a new tab

We used the overlapping batch statistics approach with a batch size of 2000 (Meketon and Schmeiser, 1984; and Chen et al., 2000, Section 3.3) to compute the Monte Carlo (MC) standard errors of DIC_Surv|Long, ΔDIC_Surv, LPML_Surv|Long, and ΔLPML_Surv under SPML and TML. The results are reported in Table 5. From this table, we see that (i) the MC standard errors ranged from 0.074 to 0.620 for all of DIC_Surv|Long, ΔDIC_Surv, LPML_Surv|Long, and ΔLPML_Surv, which were reasonably small compared to the magnitudes of their estimated values; and (ii) the MC standard errors of LPML_Surv|Long (GQ) and LPML_Surv|Long (MC), and ΔLPML_Surv (GQ) and ΔLPML_Surv (MC) were very close, which empirically confirmed that the proposed MC approach for estimating LPML_Surv|Long and ΔLPML_Surv were as accurate as the numerical approximation approach for computing these quantities. Table 6 shows the running times in minutes on an Intel i686 processor machine with 16 GB of RAM memory using a GNU/Linux operating system for computing ΔDIC_Surv, ΔLPML_Surv (GQ), and ΔLPML_Surv (MC) under SPML and TML with K = 30 based on an Markov chain Monte Carlo (MCMC) sample size of 20,000. From Table 6, we see that (i) the running times for computing ΔLPML_Surv (MC) were similar to those for computing ΔDIC_Surv under SPML though ΔLPML_Surv (MC) required two MCMC samples; (ii) SPML required much less running time than TML; and (iii) ΔLPML_Surv (GQ) required the most running time.

Table 5.

MC Standard Errors of DIC_Surv|Long, ADIC_Surv, LPML_Surv|Long, and ΔLPML_Surv under SPML and TML with K = 30 based on an MC sample size of 20,000

Model		Anorexia	Cough	Dyspnea	Fatigue	Pain
SPML	DIC_Surv\|Long	0.391	0.263	0.397	0.396	0.458
	ΔDIC_Surv	0.530	0.444	0.535	0.534	0.581
	LPML_Surv\|Long (GQ)	0.248	0.246	0.200	0.196	0.225
	LPML_Surv\|Long (MC)	0.248	0.246	0.201	0.197	0.222
	ΔLPML_Surv (GQ)	0.314	0.313	0.278	0.275	0.297
	ΔLPML_Surv (MC)	0.314	0.313	0.279	0.275	0.294

TML	DIC_Surv\|Long	0.288	0.346	0.451	0.512	0.370
	ΔDIC_Surv	0.460	0.498	0.576	0.624	0.515
	LPML_Surv\|Long (GQ)	0.104	0.125	0.114	0.073	0.074
	LPML_Surv\|Long (MC)	0.104	0.125	0.114	0.073	0.074
	ΔLPML_Surv (GQ)	0.219	0.230	0.224	0.206	0.207
	ΔLPML_Surv (MC)	0.219	0.230	0.224	0.206	0.207

Open in a new tab

Table 6.

Running Times in Minutes for Computing ΔDIC_Surv, ΔLPML_Surv (GQ), and ΔLPML_Surv (MC) under SPML and TML with K = 30 based on an MC sample size of 20,000

Model		Anorexia	Cough	Dyspnea	Fatigue	Pain
SPML	MCMC Sampling	6.0	5.9	5.8	6.3	5.9
	ΔDIC_Surv	16.0	14.9	18.3	17.7	16.9
	ΔLPML_Surv (GQ)	18.2	17.6	21.0	20.3	21.4
	ΔLPML_Surv (MC)	17.6	16.0	18.5	17.9	18.5

TML	MCMC Sampling	523.9	526.2	525.7	521.9	516.6
	ΔDIC_Surv	1142.7	1070.9	1195.1	1153.4	1245.6
	ΔLPML_Surv (GQ)	1681.2	1711.8	1704.2	1718.3	1665.4
	ΔLPML_Surv (MC)	1488.4	1494.2	1397.0	1486.3	1373.8

Open in a new tab

Finally, we computed relevant quantities under the second decomposition of DIC and LPML given in Appendix B of the supplementary material to quantify the contribution of the PFS data to the fit of the longitudinal data. The results are shown in Table 7. As mentioned earlier, the total numbers of observations for these five symptoms were different, implying that ΔDIC_Long and ΔLPML_Long were not directly comparable for the EMPHACIS data. Therefore, we consider the relative ΔDIC_Long and ΔLPML_Long defined by

R Δ {DIC}_{Long} = \frac{Δ {DIC}_{Long}}{{DIC}_{Long, alone}} \times 1000

and

R Δ {LPML}_{Long} = \frac{Δ {LPML}_{Long}}{∣ {LPML}_{Long, alone} ∣} \times 1000 .

From Table 7, we see that pain had the largest relative improvement in terms of both RΔDIC_Long and RΔLPML_Long (MC), which were 5.00 and 6.18 under SPML and 3.52 and 4.83 under TML, and cough had the smallest relative improvement with RΔDIC_Long = 0.35 and RΔLPML_Long = 1.46 (MC) under SPML and RΔDIC_Long = 0.57 and RΔLPML_Long = 1.90 (MC) under TML. The values of RΔDIC_Long and RΔLPML_Long (MC) for anorexia, dyspnea, and fatigue, were 1.89 and 2.77, 1.89 and 3.04, and 2.91 and 3.89, respectively, under SPML; and 1.68 and 2.76, 1.09 and 2.33, and 2.26 and 3.43, respectively, under TML.

Table 7.

Decomposition II of DICs and LPMLs under SPML and TML with K = 30

Model		Anorexia	Cough	Dyspnea	Fatigue	Pain
Long alone	DIC_Long,alone	12017.52	12248.60	9911.66	11005.02	10867.34

	LPML_Long,alone	−6011.72	−6133.68	−4959.88	−5505.25	−5439.62

SPML	DIC_Surv	2019.87	2018.65	2019.26	2019.99	2022.59
	(p_D[Surv])	37.28	37.32	36.94	37.06	36.42
	DIC_Long\|Surv	11994.80	12244.31	9892.94	10973.01	10813.05
	(p_D[Long\|Surv])	15.17	14.95	15.70	15.10	15.94
	ΔDIC_Long	22.72	4.28	18.73	32.00	54.29
	RΔDIC_Long	1.89	0.35	1.89	2.91	5.00

	LPML_Long\|Surv (GQ)	−5995.08	−6124.71	−4944.80	−5483.84	−5405.97
	LPML_Long\|Surv (MC)	−5995.08	−6124.70	−4944.78	−5483.83	−5406.03
	ΔLPML_Long (GQ)	16.64	8.97	15.08	21.41	33.65
	ΔLPML_Long (MC)	16.64	8.98	15.09	21.42	33.59
	RΔLPML_Long (GQ)	2.77	1.46	3.04	3.89	6.19
	RΔLPML_Long (MC)	2.77	1.46	3.04	3.89	6.18

TML	DIC_Surv	2019.31	2018.87	2018.34	2019.08	2020.69
	(p_D[Surv])	37.29	37.20	37.28	37.26	36.89
	DIC_Long\|Surv	11997.34	12241.56	9900.87	10980.18	10829.05
	(p_D[Long\|Surv])	14.15	14.12	14.03	14.12	14.53
	ΔDIC_Long	20.18	7.04	10.79	24.84	38.29
	RΔDIC_Long	1.68	0.57	1.09	2.26	3.52

	LPML_Long\|Surv (GQ)	−5995.20	−6122.07	−4948.57	−5486.61	−5413.59
	LPML_Long\|Surv (MC)	−5995.11	−6122.04	−4948.30	−5486.38	−5413.33
	ΔLPML_Long (GQ)	16.51	11.61	11.30	18.64	26.03
	ΔLPML_Long (MC)	16.60	11.64	11.58	18.87	26.29
	RΔLPML_Long (GQ)	2.75	1.89	2.28	3.39	4.78
	RΔLPML_Long (MC)	2.76	1.90	2.33	3.43	4.83

Open in a new tab

5 Discussion

In this paper, we have developed two versions of the DIC and CPO decomposition as well as two sets of new criteria in Section 2 (ΔDIC_Surv, ΔLPML_Surv) and in Appendix B (ΔDIC_Long, ΔLPML_Long). The decompositions, DIC = DIC_Long + DIC_Surv|Long and LPML = LPML_Long + LPML_Surv|Long (Decomposition I), are most useful when our primary goal is to make inferences about the parameters in the survival component of the joint model while using the information from longitudinal data through the joint model. In practice, DIC_Surv|Long and LPML_Surv|Long can be used to select the survival component of the joint model and the main utility of ΔDIC_Surv and ΔLPML_Surv is to determine which longitudinal marker leads to the most gain in the fit of the survival data or which longitudinal marker is most highly associated with the survival outcome. The simulation study in Section 3 and the real data analysis in Section 4 empirically demonstrated that DIC_Surv|Long, LPML_Surv|Long, ΔDIC_Surv, and ΔLPML_Surv are quite effective and promising in selecting the survival component of the joint model and identifying the importance of longitudinal biomarkers in fitting the survival data. Decomposition II and the corresponding RΔDIC_Long and RΔLPML_Long criteria are useful when the main focus of a clinical trial is on the longitudinal markers and the primary goal is to make inferences about the parameters in the longitudinal component of the joint model while using the information from the survival data through the joint model. Similar to Decomposition I, DIC_Long|Surv and LPML_Long|Surv can be used to choose the longitudinal component of the joint model and RΔDIC_Long and RΔLPML_Long are useful to determine the gain in the fit of the longitudinal data while using the information from the survival data through the joint model.

In the AIC decomposition developed in Zhang et al. (2014), dim(φ₁) and dim(φ₂) were manually allocated to AIC_Long and AIC_Surv|Long, respectively, as the dimensions of the parameters. However, the parameters φ₁ are also involved in computing AIC_Surv|Long. Thus, the appropriateness of these dimension allocations needs to be further validated. The DIC decomposition developed in this paper automatically calculates the dimensions of the parameters, p_D_[Long] and p_D_[Surv|Long], in DIC_Long and DIC_Surv|Long. The real data analysis in Section 4 and the results shown in Table S2 of the supplementary material empirically demonstrated that p_D_[Long] ≈ dim(φ₁) and p_D_[Surv|Long] ≈ dim(φ₂). Since the DIC approximates the AIC as discussed in Spiegelhalter et al. (2002) for Gaussian posteriors (or very large samples), our empirical results based on the DIC decomposition confirm that the dimension allocations of the model parameters in the AIC decomposition are quite appropriate. Both the AIC decomposition and the DIC decomposition require the numerical approximation of an intractable integral ∫ f(y_i, t_i, θ_i|φ, δ_i, x_i, z_i)dθ_i in (2.4) for computing the joint distribution of y_i and t_i. The proposed LPML decomposition avoids the calculation of this integral. As demonstrated in both the simulation study and the real data analysis, LPML_Surv|Long and ΔLPML_Surv performed equally well as DIC_Surv|Long and ΔDIC_Surv in selecting the survival model and identifying the important longitudinal markers. In addition, as shown in Table 6, LPML_Surv|Long (MC) and ΔLPML_Surv (MC) require less computing time than LPML_Surv|Long (GQ) and ΔLPML_Surv (GQ). Thus, the LPML decomposition may be potentially more useful in practice.

In Section 2, we proposed two approaches (AGQ and MC) for computing CPO related criteria. As shown in Section 4, both approaches yielded almost identical results. However, the proposed MC method requires less computing time and is more applicable to models involving high-dimensional random effects than the AGQ approach. In Section 4, the LPML_Surv|Long's were calculated based on the CPO decomposition in (2.18) by taking φ* as the posterior mean of φ. We also calculated the LPML_Surv|Long's by taking φ* as the posterior median of φ. For the EMPHACIS Data, under SPML with K = 30, the LPML_Surv|Long's calculated based on the posterior medians were −998.95, −1008.13, −1000.64, −994.54, and −984.13 for anorexia, cough, dyspnea, fatigue, pain, respectively. These values are very close to those given in Table 2. Thus, LPML_Surv|Long is relatively robust to the choice of φ*.

Hanson et al. (2011) introduced the conditional CPO and LPML. Using our notation, the conditional CPO is defined as

{CPO}_{i, Surv}^{c} = \int f (t_{i} ∣ φ_{2}, θ_{i}, δ_{i}, z_{i}) π (φ, θ^{R} ∣ D_{Long, obs}, D_{Surv, obs}^{(- i)}) d θ^{R} d φ,

where $π (φ, θ^{R} ∣ D_{Long, obs}, D_{Surv, obs}^{(- i)})$ is the joint posterior of (φ, θ^R) given D_Long,oba and $D_{Surv, obs}^{(- i)}$ with the survival data deleted for the i^th subject. The conditional LPML in Hanson et al. (2011) is thus defined by

{LPML}_{Surv}^{c} = \sum_{i = 1}^{n} \log ({CPO}_{i, Surv}^{c}) .

For the purpose of assessing the fit of the survival data, ${CPO}_{i, Surv}^{c}$ and ${LPML}_{Surv}^{c}$ do correspond to CPO_i,_Surv|Long and LPML_Surv|Long. However, they are not the same unless the longitudinal data are independent of the survival data. Although ${CPO}_{i, Surv}^{c}$ and ${LPML}_{Surv}^{c}$ cannot be used to assess the overall fit of the joint model or to determine the gain in the fit of the longitudinal data while using the information from the survival data through the joint model, they are quite attractive due to computational simplicity if the primary goal is to make inferences about the parameters in the survival component. We defer to a future project for further investigation of theoretical and empirical comparisons between ${LPML}_{Surv}^{c}$ and LPML_Surv|Long.

Although the proposed Bayesian criteria are developed under the joint model in Section 2, they can be easily extended to models for other types of data such as longitudinal binary/ordinal response or count data as well as other types of survival models such as cure rate models, nonproportional hazards models, and competing risks models discussed in Klein et al. (2013). Furthermore, the proposed MC method for computing CPO is applicable for a variety of Bayesian models involving random effects or latent variables. The potential applications of the proposed methodology to other types of longitudinal data such as multi-dimensional longitudinal data and more complex survival data, such as survival data in the presence of competing risks and/or semi-competing risks, are currently under investigation.

In Sections 3 and 4, we carried out all computations using the FORTRAN 95 software with double precision and IMSL subroutines. The FORTRAN 95 code is available upon request. We are currently working on a user-friendly R interface of the FORTRAN code that has been developed for this paper so that it would be available to practitioners.

Supplementary Material

Supp

NIHMS740627-supplement-Supp.pdf^{(251.1KB, pdf)}

Acknowledgments

We would like to thank the Editor, the Associate Editor, and the two anonymous reviewers for their very helpful comments and suggestions, which have led to a much improved version of the paper. Dr. M.-H. Chen and Dr. J. G. Ibrahim's research was partially supported by NIH grants #GM70335 and #P01CA142538.

Footnotes

Supplementary Materials

In the supplementary material, we provide the details of prior specification and posterior computation (Appendix A); the development of the second decomposition (Decomposition II) of DIC and LPML (Appendix B); the proofs of identities, results, and theorems (Appendix C); and additional tables (Appendix D) for DIC, p_D, and LPML for fitting survival alone with different K, p_D's and p_D_[Surv|Long]'s for five PROs under SPML and TML with different K associated with Table 2, and the decomposition of LPML for five PROs under SPML and TML with different K using Gaussian quadrature associated with Table 2 for the EMPHACIS data in Section 4.

References

Bridges JFP, Mohamed AF, Finnern HW, Woehl A, Hauber AB. Patients’ preferences for treatment outcomes for advanced non-small cell lung cancer: A conjoint analysis. Lung Cancer. 2012;77:224–231. doi: 10.1016/j.lungcan.2012.01.016. [DOI] [PubMed] [Google Scholar]
Brown ER, Ibrahim JG, DeGruttola V. A flexible B-spline model for multiple longitudinal biomarkers and survival. Biometrics. 2005;61:64–73. doi: 10.1111/j.0006-341X.2005.030929.x. [DOI] [PubMed] [Google Scholar]
Brown ER, Ibrahim JG. Bayesian approaches to joint cure rate and longitudinal models with applications to cancer vaccine trials. Biometrics. 2003;59:686–693. doi: 10.1111/1541-0420.00079. [DOI] [PubMed] [Google Scholar]
Chen M-H. Importance-weighted marginal Bayesian posterior density estimation. Journal of the American Statistical Association. 1994;89:818–824. [Google Scholar]
Chen M-H, Ibrahim JG, Sinha D. A new joint model for longitudinal and survival data with a cure fraction. Journal of Multivariate Analysis. 2004;91:18–34. [Google Scholar]
Chen M-H, Shao Q-M, Ibrahim JG. Monte Carlo Methods in Bayesian Computation. Springer-Verlag; New York: 2000. [Google Scholar]
Chi Y-Y, Ibrahim JG. Bayesian approaches to joint longitudinal and survival models accommodating both zero and nonzero cure fractions. Statistica Sinica. 2007;17:445–462. [Google Scholar]
Chi Y-Y, Ibrahim JG. Joint models for multivariate longitudinal and multivariate survival data. Biometrics. 2006;62:432–445. doi: 10.1111/j.1541-0420.2005.00448.x. [DOI] [PubMed] [Google Scholar]
Chib S. Marginal likelihood from the Gibbs output. Journal of the American Statistical Association. 1995;90:1313–1321. [Google Scholar]
Crowther MJ. STJM: Stata module to fit shared parameter joint models of longitudinal and survival data. 2012 http://econpapers.repec.org/software/bocbocode/s457502.htm/
Crowther MJ, Abrams KR, Lambert PC. Joint modeling of longitudinal and survival data. The Stata Journal. 2013;13:165–184. [Google Scholar]
Draper D, Krnjajić M. Technical Report. Department of Applied Mathematics and Statistics, University of California; Santa Cruz: 2005. Bayesian model specification. [Google Scholar]
DeGruttola V, Tu XM. Modeling progression of CD4-lymphocyte count and its relationship to survival time. Biometrics. 1994;50:1003–1014. [PubMed] [Google Scholar]
DeMuro C, Clark M, Doward L, Mordin M, Gnanasakthy A. Assessment of PRO label claims granted by the FDA as compared to the EMA (2006-2010) Value in Health. 2013;16:1150–1155. doi: 10.1016/j.jval.2013.08.2293. [DOI] [PubMed] [Google Scholar]
Geisser S, Eddy WF. A predictive approach to model selection. Journal of the American Statistical Association. 1979;74:153–160. [Google Scholar]
Gelfand AE, Dey DK. Bayesian model choice: Asymptotics and exact calculations. Journal of the Royal Statistical Society, Series B. 1994;56:501–514. [Google Scholar]
Gelfand AE, Dey DK, Chang H. Model Determinating Using Predictive Distributions with Implementation via Sampling-based Methods (with Discussion) In: Bernado JM, Berger JO, Dawid AP, Smith AFM, editors. Bayesian Statistics. Vol. 4. Oxford University Press; Oxford: 1992. pp. 147–167. [Google Scholar]
Geweke J. Bayesian inference in econometrics models using Monte Carlo integration. Econometrica. 1989;57:1317–1340. [Google Scholar]
Hanson TE, Branscum AJ, Johnson WO. Predictive comparison of joint longitudinal-survival modeling: a case study illustrating competing approaches (with Discussion) Lifetime Data Analysis. 2011;17:3–42. doi: 10.1007/s10985-010-9162-0. [DOI] [PubMed] [Google Scholar]
Hatfield LA, Boye ME, Carlin BP. Joint modeling of multiple longitudinal patient-reported outcomes and survival. Journal of Biopharmaceutical Statistics. 2011;21:971–991. doi: 10.1080/10543406.2011.590922. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hatfield LA, Boye ME, Hackshaw MD, Carlin BP. Multilevel bayesian models for survival times and longitudinal patient-reported outcomes with many zeros. Journal of the American Statistical Association. 2012;107:875–885. [Google Scholar]
Henderson R, Diggle PJ, Dobson A. Joint modelling of longitudinal measurements and event time data. Biostatistics. 2000;1:465–480. doi: 10.1093/biostatistics/1.4.465. [DOI] [PubMed] [Google Scholar]
Hogan JW, Laird NM. Mixture models for the joint distribution or repeated measures and event times. Statistics in Medicine. 1997;16:239–257. doi: 10.1002/(sici)1097-0258(19970215)16:3<239::aid-sim483>3.0.co;2-x. [DOI] [PubMed] [Google Scholar]
Ibrahim JG, Chen M-H, Sinha D. Bayesian methods for joint modeling of longitudinal and survival data with applications to cancer vaccine studies. Statistica Sinica. 2004;14:863–883. [Google Scholar]
Ibrahim JG, Chen M-H, Sinha D. Bayesian Survival Analysis. Springer-Verlag; New York: 2001. [Google Scholar]
Ibrahim JG, Chu H, Chen LM. Basic concepts and methods for joint models of longitudinal and survival data. Journal of Clinical Oncology. 2010;28:2796–2801. doi: 10.1200/JCO.2009.25.0654. [DOI] [PMC free article] [PubMed] [Google Scholar]
Klein JP, van Houwelingen HC, Ibrahim JG, Scheike TH, editors. Handbook of Survival Analysis. Chapman & Hall; Boca Raton, FL: 2013. [Google Scholar]
Lavalley MP, DeGruttola V. Model for empirical Bayes estimators of longitudinal CD4 counts. Statistics in Medicine. 1996;15:2289–2305. doi: 10.1002/(SICI)1097-0258(19961115)15:21<2289::AID-SIM449>3.0.CO;2-I. [DOI] [PubMed] [Google Scholar]
Law NJ, Taylor JMG, Sandler H. The joint modeling of a longitudinal disease progression marker and the failure time process in the presence of cure. Biostatistics. 2002;3:547–563. doi: 10.1093/biostatistics/3.4.547. [DOI] [PubMed] [Google Scholar]
Meketon MS, Schmeiser BW. Overlapping batch means: Something for nothing? Proceedings of the Winter Simulation Conference. 1984:227–230. [Google Scholar]
Patricia HJ, Gralla RJ, Liepa AM, Symanowski JT, Rusthoven JJ. Measuring quality of life in patients with pleural mesothelioma using a modified version of the Lung Cancer Symptom Scale (LCSS): psychometric properties of the LCSS-Meso. Supportive Care in Cancer. 2006;14:11–21. doi: 10.1007/s00520-005-0837-0. [DOI] [PubMed] [Google Scholar]
Pawitan Y, Self S. Modeling disease marker processes in AIDS. Journal of the American Statistical Association. 1993;88:719–726. [Google Scholar]
Philipson P, Sousa I, Diggle P, Williamson P, Kolamunnage-Dona R, Henderson R. joineR: Joint modelling of repeated measurements and time-to-event data. R package version 1.0-3. 2012 http://cran.r-project.org/web/packages/joineR/index.html.
Pinheiro JC, Bates DM. Approximations to the log-likelihood function in the nonlinear mixed-effects model. Journal of Computational and Graphical Statistics. 1995;4:12–35. [Google Scholar]
Proust-Lima C, Philipps V, Diakite A, Liquet B. lcmm: Estimation of extended mixed models using latent classes and latent processes. R package version 1.6-4. 2014 http://cran.r-project.org/web/packages/lcmm/index.html.
Rizopoulos D. Joint Models for Longitudinal and Time-to-Event Data: With Applications in R. CRC Press/Chapman & Hall; Boca Raton, FL: 2012a. [Google Scholar]
Rizopoulos D. JM: Joint modeling of longitudinal and survival data. R package version 1.1-0. 2012b http://rwiki.sciviews.org/doku.php?id=packages:cran:jm.
Rizopoulos D. JMbayes: Joint modeling of longitudinal and time-to-event data under a Bayesian approach. R package version 0.5-3. 2014 http://cran.r-project.org/web/packages/JMbayes/index.html.
Rothman M, Burke L, Erickson P, Leidy NK, Patrick DL, Petrie CD. Use of existing patient-reported outcome (PRO) instruments and their modification: the ISPOR good research practices for evaluating and documenting content validity for the use of existing instruments and their modification PRO task force report. Value in Health. 2009;12:1075–1083. doi: 10.1111/j.1524-4733.2009.00603.x. [DOI] [PubMed] [Google Scholar]
Schluchter MD. Methods for the analysis of informatively censored longitudinal data. Statistics in Medicine. 1992;11:1861–1870. doi: 10.1002/sim.4780111408. [DOI] [PubMed] [Google Scholar]
Siddiqui F, Liu AK, Watkins-Bruner D, Movsas B. Patient-reported outcomes and survivorship in radiation oncology: overcoming the cons. Journal of Clinical Oncology. 2014;32:2920–2927. doi: 10.1200/JCO.2014.55.0707. [DOI] [PMC free article] [PubMed] [Google Scholar]
Song X, Davidian M, Tsiatis AA. An estimator for the proportional hazards model with multiple longitudinal covariates measured with error. Biostatistics. 2002;3:511–528. doi: 10.1093/biostatistics/3.4.511. [DOI] [PubMed] [Google Scholar]
Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B. 2002;64:583–639. [Google Scholar]
Thompson JK, Westbom CM, Shukla A. Malignant mesothelioma: development to therapy. Journal of Cellular Biochemistry. 2014;115:1–7. doi: 10.1002/jcb.24642. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tsiatis AA, Davidian M. Joint modeling of longitudinal and time-to-event data: An overview. Statistica Sinica. 2004;14:809–834. [Google Scholar]
Vogelzang NJ, Rusthoven JJ, Symanowski J, Denham C, Kaukel E, Ruffie P, Gatzemeier U, Boyer M, Emri S, Manegold C, Niyikiza C, Paoletti P. Phase III study of pemetrexed in combination with cisplatin versus cisplatin alone in patients with malignant pleural mesothelioma. Journal of Clinical Oncology. 2003;21:2636–2644. doi: 10.1200/JCO.2003.11.136. [DOI] [PubMed] [Google Scholar]
Wang P, Shen W, Boye ME. Joint modeling of longitudinal outcomes and survival using latent growth modeling approach in a mesothelioma trial. Health Services and Outcomes Research Methodology. 2012;12:182–199. doi: 10.1007/s10742-012-0092-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xu J, Zeger SL. Joint analysis of longitudinal data comprising repeated measures and times to events. Applied Statistics. 2001a;50:375–387. [Google Scholar]
Xu J, Zeger SL. The evaluation of multiple surrogate endpoints. Biometrics. 2001b;57:81–87. doi: 10.1111/j.0006-341x.2001.00081.x. [DOI] [PubMed] [Google Scholar]
Zhang D, Chen M-H, Ibrahim JG, Boye ME, Wang P, Shen W. Assessing model fit in joint models of longitudinal and survival data with applications to cancer clinical trials. Statistics in Medicine. 2014;33:4715–4733. doi: 10.1002/sim.6269. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang D, Chen M-H, Ibrahim JG, Boye ME, Shen W. Assessment of fit in longitudinal data for joint models with applications to cancer clinical trials. In: Chen Z, Liu A, Qu Y, Tang L, Ting N, Tsong Y, editors. Applied Statistics in Biomedicine and Clinical Trials Design - Selected Papers from 2013 ICSA/ISBS Joint Statistical Meetings. Springer; New York: 2015a. pp. 347–365. New York: Springer. In press. [Google Scholar]
Zhang D, Chen M-H, Ibrahim JG, Boye ME, Shen W. JMFit: A SAS Macro for Joint Models of Longitudinal and Survival Data. Journal of Statistical Software. 2015b doi: 10.18637/jss.v071.i03. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp

NIHMS740627-supplement-Supp.pdf^{(251.1KB, pdf)}

[R1] Bridges JFP, Mohamed AF, Finnern HW, Woehl A, Hauber AB. Patients’ preferences for treatment outcomes for advanced non-small cell lung cancer: A conjoint analysis. Lung Cancer. 2012;77:224–231. doi: 10.1016/j.lungcan.2012.01.016. [DOI] [PubMed] [Google Scholar]

[R2] Brown ER, Ibrahim JG, DeGruttola V. A flexible B-spline model for multiple longitudinal biomarkers and survival. Biometrics. 2005;61:64–73. doi: 10.1111/j.0006-341X.2005.030929.x. [DOI] [PubMed] [Google Scholar]

[R3] Brown ER, Ibrahim JG. Bayesian approaches to joint cure rate and longitudinal models with applications to cancer vaccine trials. Biometrics. 2003;59:686–693. doi: 10.1111/1541-0420.00079. [DOI] [PubMed] [Google Scholar]

[R4] Chen M-H. Importance-weighted marginal Bayesian posterior density estimation. Journal of the American Statistical Association. 1994;89:818–824. [Google Scholar]

[R5] Chen M-H, Ibrahim JG, Sinha D. A new joint model for longitudinal and survival data with a cure fraction. Journal of Multivariate Analysis. 2004;91:18–34. [Google Scholar]

[R6] Chen M-H, Shao Q-M, Ibrahim JG. Monte Carlo Methods in Bayesian Computation. Springer-Verlag; New York: 2000. [Google Scholar]

[R7] Chi Y-Y, Ibrahim JG. Bayesian approaches to joint longitudinal and survival models accommodating both zero and nonzero cure fractions. Statistica Sinica. 2007;17:445–462. [Google Scholar]

[R8] Chi Y-Y, Ibrahim JG. Joint models for multivariate longitudinal and multivariate survival data. Biometrics. 2006;62:432–445. doi: 10.1111/j.1541-0420.2005.00448.x. [DOI] [PubMed] [Google Scholar]

[R9] Chib S. Marginal likelihood from the Gibbs output. Journal of the American Statistical Association. 1995;90:1313–1321. [Google Scholar]

[R10] Crowther MJ. STJM: Stata module to fit shared parameter joint models of longitudinal and survival data. 2012 http://econpapers.repec.org/software/bocbocode/s457502.htm/

[R11] Crowther MJ, Abrams KR, Lambert PC. Joint modeling of longitudinal and survival data. The Stata Journal. 2013;13:165–184. [Google Scholar]

[R12] Draper D, Krnjajić M. Technical Report. Department of Applied Mathematics and Statistics, University of California; Santa Cruz: 2005. Bayesian model specification. [Google Scholar]

[R13] DeGruttola V, Tu XM. Modeling progression of CD4-lymphocyte count and its relationship to survival time. Biometrics. 1994;50:1003–1014. [PubMed] [Google Scholar]

[R14] DeMuro C, Clark M, Doward L, Mordin M, Gnanasakthy A. Assessment of PRO label claims granted by the FDA as compared to the EMA (2006-2010) Value in Health. 2013;16:1150–1155. doi: 10.1016/j.jval.2013.08.2293. [DOI] [PubMed] [Google Scholar]

[R15] Geisser S, Eddy WF. A predictive approach to model selection. Journal of the American Statistical Association. 1979;74:153–160. [Google Scholar]

[R16] Gelfand AE, Dey DK. Bayesian model choice: Asymptotics and exact calculations. Journal of the Royal Statistical Society, Series B. 1994;56:501–514. [Google Scholar]

[R17] Gelfand AE, Dey DK, Chang H. Model Determinating Using Predictive Distributions with Implementation via Sampling-based Methods (with Discussion) In: Bernado JM, Berger JO, Dawid AP, Smith AFM, editors. Bayesian Statistics. Vol. 4. Oxford University Press; Oxford: 1992. pp. 147–167. [Google Scholar]

[R18] Geweke J. Bayesian inference in econometrics models using Monte Carlo integration. Econometrica. 1989;57:1317–1340. [Google Scholar]

[R19] Hanson TE, Branscum AJ, Johnson WO. Predictive comparison of joint longitudinal-survival modeling: a case study illustrating competing approaches (with Discussion) Lifetime Data Analysis. 2011;17:3–42. doi: 10.1007/s10985-010-9162-0. [DOI] [PubMed] [Google Scholar]

[R20] Hatfield LA, Boye ME, Carlin BP. Joint modeling of multiple longitudinal patient-reported outcomes and survival. Journal of Biopharmaceutical Statistics. 2011;21:971–991. doi: 10.1080/10543406.2011.590922. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] Hatfield LA, Boye ME, Hackshaw MD, Carlin BP. Multilevel bayesian models for survival times and longitudinal patient-reported outcomes with many zeros. Journal of the American Statistical Association. 2012;107:875–885. [Google Scholar]

[R22] Henderson R, Diggle PJ, Dobson A. Joint modelling of longitudinal measurements and event time data. Biostatistics. 2000;1:465–480. doi: 10.1093/biostatistics/1.4.465. [DOI] [PubMed] [Google Scholar]

[R23] Hogan JW, Laird NM. Mixture models for the joint distribution or repeated measures and event times. Statistics in Medicine. 1997;16:239–257. doi: 10.1002/(sici)1097-0258(19970215)16:3<239::aid-sim483>3.0.co;2-x. [DOI] [PubMed] [Google Scholar]

[R24] Ibrahim JG, Chen M-H, Sinha D. Bayesian methods for joint modeling of longitudinal and survival data with applications to cancer vaccine studies. Statistica Sinica. 2004;14:863–883. [Google Scholar]

[R25] Ibrahim JG, Chen M-H, Sinha D. Bayesian Survival Analysis. Springer-Verlag; New York: 2001. [Google Scholar]

[R26] Ibrahim JG, Chu H, Chen LM. Basic concepts and methods for joint models of longitudinal and survival data. Journal of Clinical Oncology. 2010;28:2796–2801. doi: 10.1200/JCO.2009.25.0654. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] Klein JP, van Houwelingen HC, Ibrahim JG, Scheike TH, editors. Handbook of Survival Analysis. Chapman & Hall; Boca Raton, FL: 2013. [Google Scholar]

[R28] Lavalley MP, DeGruttola V. Model for empirical Bayes estimators of longitudinal CD4 counts. Statistics in Medicine. 1996;15:2289–2305. doi: 10.1002/(SICI)1097-0258(19961115)15:21<2289::AID-SIM449>3.0.CO;2-I. [DOI] [PubMed] [Google Scholar]

[R29] Law NJ, Taylor JMG, Sandler H. The joint modeling of a longitudinal disease progression marker and the failure time process in the presence of cure. Biostatistics. 2002;3:547–563. doi: 10.1093/biostatistics/3.4.547. [DOI] [PubMed] [Google Scholar]

[R30] Meketon MS, Schmeiser BW. Overlapping batch means: Something for nothing? Proceedings of the Winter Simulation Conference. 1984:227–230. [Google Scholar]

[R31] Patricia HJ, Gralla RJ, Liepa AM, Symanowski JT, Rusthoven JJ. Measuring quality of life in patients with pleural mesothelioma using a modified version of the Lung Cancer Symptom Scale (LCSS): psychometric properties of the LCSS-Meso. Supportive Care in Cancer. 2006;14:11–21. doi: 10.1007/s00520-005-0837-0. [DOI] [PubMed] [Google Scholar]

[R32] Pawitan Y, Self S. Modeling disease marker processes in AIDS. Journal of the American Statistical Association. 1993;88:719–726. [Google Scholar]

[R33] Philipson P, Sousa I, Diggle P, Williamson P, Kolamunnage-Dona R, Henderson R. joineR: Joint modelling of repeated measurements and time-to-event data. R package version 1.0-3. 2012 http://cran.r-project.org/web/packages/joineR/index.html.

[R34] Pinheiro JC, Bates DM. Approximations to the log-likelihood function in the nonlinear mixed-effects model. Journal of Computational and Graphical Statistics. 1995;4:12–35. [Google Scholar]

[R35] Proust-Lima C, Philipps V, Diakite A, Liquet B. lcmm: Estimation of extended mixed models using latent classes and latent processes. R package version 1.6-4. 2014 http://cran.r-project.org/web/packages/lcmm/index.html.

[R36] Rizopoulos D. Joint Models for Longitudinal and Time-to-Event Data: With Applications in R. CRC Press/Chapman & Hall; Boca Raton, FL: 2012a. [Google Scholar]

[R37] Rizopoulos D. JM: Joint modeling of longitudinal and survival data. R package version 1.1-0. 2012b http://rwiki.sciviews.org/doku.php?id=packages:cran:jm.

[R38] Rizopoulos D. JMbayes: Joint modeling of longitudinal and time-to-event data under a Bayesian approach. R package version 0.5-3. 2014 http://cran.r-project.org/web/packages/JMbayes/index.html.

[R39] Rothman M, Burke L, Erickson P, Leidy NK, Patrick DL, Petrie CD. Use of existing patient-reported outcome (PRO) instruments and their modification: the ISPOR good research practices for evaluating and documenting content validity for the use of existing instruments and their modification PRO task force report. Value in Health. 2009;12:1075–1083. doi: 10.1111/j.1524-4733.2009.00603.x. [DOI] [PubMed] [Google Scholar]

[R40] Schluchter MD. Methods for the analysis of informatively censored longitudinal data. Statistics in Medicine. 1992;11:1861–1870. doi: 10.1002/sim.4780111408. [DOI] [PubMed] [Google Scholar]

[R41] Siddiqui F, Liu AK, Watkins-Bruner D, Movsas B. Patient-reported outcomes and survivorship in radiation oncology: overcoming the cons. Journal of Clinical Oncology. 2014;32:2920–2927. doi: 10.1200/JCO.2014.55.0707. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R42] Song X, Davidian M, Tsiatis AA. An estimator for the proportional hazards model with multiple longitudinal covariates measured with error. Biostatistics. 2002;3:511–528. doi: 10.1093/biostatistics/3.4.511. [DOI] [PubMed] [Google Scholar]

[R43] Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B. 2002;64:583–639. [Google Scholar]

[R44] Thompson JK, Westbom CM, Shukla A. Malignant mesothelioma: development to therapy. Journal of Cellular Biochemistry. 2014;115:1–7. doi: 10.1002/jcb.24642. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] Tsiatis AA, Davidian M. Joint modeling of longitudinal and time-to-event data: An overview. Statistica Sinica. 2004;14:809–834. [Google Scholar]

[R46] Vogelzang NJ, Rusthoven JJ, Symanowski J, Denham C, Kaukel E, Ruffie P, Gatzemeier U, Boyer M, Emri S, Manegold C, Niyikiza C, Paoletti P. Phase III study of pemetrexed in combination with cisplatin versus cisplatin alone in patients with malignant pleural mesothelioma. Journal of Clinical Oncology. 2003;21:2636–2644. doi: 10.1200/JCO.2003.11.136. [DOI] [PubMed] [Google Scholar]

[R47] Wang P, Shen W, Boye ME. Joint modeling of longitudinal outcomes and survival using latent growth modeling approach in a mesothelioma trial. Health Services and Outcomes Research Methodology. 2012;12:182–199. doi: 10.1007/s10742-012-0092-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R48] Xu J, Zeger SL. Joint analysis of longitudinal data comprising repeated measures and times to events. Applied Statistics. 2001a;50:375–387. [Google Scholar]

[R49] Xu J, Zeger SL. The evaluation of multiple surrogate endpoints. Biometrics. 2001b;57:81–87. doi: 10.1111/j.0006-341x.2001.00081.x. [DOI] [PubMed] [Google Scholar]

[R50] Zhang D, Chen M-H, Ibrahim JG, Boye ME, Wang P, Shen W. Assessing model fit in joint models of longitudinal and survival data with applications to cancer clinical trials. Statistics in Medicine. 2014;33:4715–4733. doi: 10.1002/sim.6269. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] Zhang D, Chen M-H, Ibrahim JG, Boye ME, Shen W. Assessment of fit in longitudinal data for joint models with applications to cancer clinical trials. In: Chen Z, Liu A, Qu Y, Tang L, Ting N, Tsong Y, editors. Applied Statistics in Biomedicine and Clinical Trials Design - Selected Papers from 2013 ICSA/ISBS Joint Statistical Meetings. Springer; New York: 2015a. pp. 347–365. New York: Springer. In press. [Google Scholar]

[R52] Zhang D, Chen M-H, Ibrahim JG, Boye ME, Shen W. JMFit: A SAS Macro for Joint Models of Longitudinal and Survival Data. Journal of Statistical Software. 2015b doi: 10.18637/jss.v071.i03. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Bayesian Model Assessment in Joint Modeling of Longitudinal and Survival Data with Applications to Cancer Clinical Trials

Danjie Zhang

Ming-Hui Chen

Joseph G Ibrahim

Mark E Boye

Wei Shen

Summary

1 Introduction

2 Bayesian Assessment of Model Fit in the Joint Model

2.1 The Joint Models

2.2 The Likelihood and Posterior

2.3 Deviance Information Criterion

2.3.1 DIC Decomposition

Result 1

2.3.2 ΔDICSurv

2.4 Conditional Predictive Ordinate

2.4.1 CPO Computation

CPO Identity I

CPO Identity II

Theorem 1

Remark 1

2.4.2 CPO Decomposition

CPO Identity III

Remark 2

Theorem 2

Remark 3

Remark 4

2.4.3 LPML and LPML Decomposition

Result 2

2.4.4 ΔLPMLSurv

3 A Simulation Study

Table 1.

Figure 1.

4 Analysis of the EMPHACIS Data

Table 2.

Table 3.

Table 4.

Table 5.

Table 6.

Table 7.

5 Discussion

Supplementary Material

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

2.3.2 ΔDIC_Surv

2.4.4 ΔLPML_Surv