Assessing model fit in joint models of longitudinal and survival data with applications to cancer clinical trials

Danjie Zhang; Ming-Hui Chen; Joseph G Ibrahim; Mark E Boye; Ping Wang; Wei Shen

doi:10.1002/sim.6269

. Author manuscript; available in PMC: 2015 Nov 30.

Published in final edited form as: Stat Med. 2014 Jul 20;33(27):4715–4733. doi: 10.1002/sim.6269

Assessing model fit in joint models of longitudinal and survival data with applications to cancer clinical trials

Danjie Zhang ^a, Ming-Hui Chen ^a, Joseph G Ibrahim ^b,^*, Mark E Boye ^c, Ping Wang ^c, Wei Shen ^c

PMCID: PMC4221436 NIHMSID: NIHMS611409 PMID: 25044061

Abstract

Joint models for longitudinaland survival data now have along history of being used in clinical trials or other studies in which the goal is to assess a treatment effect while accounting for longitudinal assessments such as patient-reported outcomes or tumor response. Compared to using survival data alone,the joint modeling of survival and longitudinal data allows for estimation of direct and indirect treatment effects, thereby resulting in improved efficacy assessment. Although global fit indices such as AIC or BIC can be used to rank joint models, these measures do not provide separate assessments of each component of the joint model. In this paper, we develop a novel decomposition of AIC and BIC (i.e., AIC = AIC_Long + AIC_Surv|Long and BIC = BIC_Long + BIC_Surv|Long) that allows us to assess the fit of each component of the joint model, and in particular to assess the fit of the longitudinal component of the model and the survival component separately. Based on this decomposition, we then propose ΔAIC_Surv and ΔBIC_Surv to determine the importance and contribution of the longitudinal data to the model fit of the survival data. Moreover, this decomposition, along with ΔAIC_Surv and ΔBIC_Surv, is also quite useful in comparing, for example, trajectory-based joint models and shared parameter joint models and deciding which type of model best fits the survival data. We examine a detailed case study in mesothelioma to apply our proposed methodology along with an extensive set of simulation studies.

Keywords: AIC, BIC, Patient-reported outcome (PRO), Shared parameter model, Time-varying covariates model, Trajectory model

1. Introduction

Through the joint modeling of longitudinal and survival data, researchers may reduce bias in the estimates of the treatment effect and also increase the power to compare the efcacy of a new oncology treatment with the current standard of care [1, 2]. Although the joint analysis of longitudinal and time-to-event outcomes has been widely published in statistical journals, it has not yet been routinely applied to the analysis of patient-reported outcomes (PROs) for the purpose of evaluating the efcacy and tolerability of cancer treatment. One barrier to the implementation of these methods has been the lack of usable sofitware to guide the programming and evaluation of these joint models. Building on previous joint modeling work in a highly symptomatic and particularly fatal condition [malignant pleural mesothelioma (MPM)] [3, 4], we develop methods to evaluate model fit in order to identify proper model specification.

This work was motivated by the need to adequately assess the differential benefits of alternative medical treatments, particularly in oncology applications where the survival advantage between competing medications may be modest. In this setting, information from the patients’ perspectives can be useful in evaluating actual patients’ experiences on dimensions known to be important to them and also associated with treatment outcomes. Accordingly, the field of PROs has evolved and has reached a common understanding about good clinical practices for the use of PROs [5]. Additionally, the U.S. and European regulators have published guidance on the use of these measures to support PRO-based claims in pharmaceutical product labeling (European Medicines Agency, 2005; US Food and Drug Administration Guidance for Industry, 2009)[6]. Unfortunately, relatively little attention has been paid to similarly advancing the analysis of trial-based PRO data; the inclusion of PRO assessments is seldom done with the rigor used to specify and analyze traditional endpoints of survival and tumor response [7]. Hence, the benefits of good PRO practice standards and insightful regulatory guidance have not generally led to informative conclusions. Published results concerning the use of joint survival/PRO models should help inform decision makers about the impact of anticancer treatment on both survival and patient well-being [3, 4, 8]. Joint modeling of these endpoints can provide a comparative assessment of patient-reported changes in specifc symptoms or global measures (e.g., quality of life or functioning) that correspond to treatment-related changes in survival. Therefore, it could be shown that increased survival was accompanied by relatively better PRO scores or alternatively that extended progression-free or overall survival was experienced at the expense of well-being. To support this joint modeling, we show how to evaluate the distinct effects of longitudinal and time-to-event outcomes on the fit of the joint model, and we develop the necessary SAS code to facilitate use of these methods.

The literature on joint modeling of longitudinal and survival data has burgeoned to the point that it is impractical to make broad general conclusions based on a systematic review of the literature. It is, however, practical and useful to describe the two basic fundamental approaches in joint modeling of longitudinal and survival to achieve this goal. The first is the "trajectory model" (TM) approach, where the trajectory function (mean response) from the longitudinal model is substituted into the hazard function of the survival model, thereby serving as a time-varying covariate in the survival model. The second basic approach is the "shared parameter model" (SPM), where the longitudinal model and survival model share common random effects which then induces correlation between the longitudinal and survival components. Both modeling schemes have advantages and disadvantages. The TM advantage, compared to the SPM, is that it leads to a straight forward interpretation of the association between the longitudinal marker and survival time through the direct inclusion of the trajectory function in the hazard. For the SPM, the characterization of the association is much more complex and can only be analytically determined once the random effects have been integrated out, since the two components of the model are independent conditional on these random effects. Typically, this integration cannot be carried out in closed form, and even if it were, the resulting dependence structure would be very complicated involving lots of parameters and resulting in difcult interpretations.

There have been many papers in the statistical literature concerning these two basic approaches. The TM in joint modeling for cancer vaccine trials in malignant melanoma has been considered in [9, 10, 11, 12]. The TM models have been also used in quality-of-life studies [13, 14, 15, 16], and in AIDS studies [17, 18, 19, 20, 21, 22, 23]. The SPM models have been used in other types of biomedical applications [24, 25]. There has been much work on using the SPM in joint modeling of survival and longitudinal data focused on AIDS studies, and in particular, jointly modeling of survival data and univariate or multivariate longitudinal CD4 counts. These articles include [26, 27, 28]. Other researchers who have used SPM’s with a multivariate longitudinal response include [29, 30, 31, 32]. An excellent general review on joint modeling of longitudinal and survival data was given in [33]. Ibrahim, Chen, and Sinha ([34], Chapter 7) also gave an overview of joint modeling methods. Joint models for longitudinal and survival data in which the survival component of the model is a cure rate model were considered in [35, 10, 11], where the models focus on cancer clinical trials.

One important issue in the joint modeling of longitudinal and survival data concerns the separate contribution of the model components to the overall goodness-of-fit of the joint model. In this paper, we derive a novel decomposition of the AIC and BIC criteria into additive components that will allow us to assess the goodness of fit for each component of the joint model. More importantly, such a decomposition allows us to develop ΔAIC_Surv and ΔBIC_Surv to quantify the change of AIC and BIC in fitting the survival data with and without the longitudinal data. Thus, ΔAIC_Surv and ΔBIC_Surv can be used to determine the importance of the longitudinal data to the model fit of the survival data. In addition, ΔAIC_Surv and ΔBIC_Surv are also very useful in assessing whether a linear trajectory or quadratic trajectory is more suitable and in facilitating a direct comparison between TM’s and SPM’s. These proposed measures will help the data analyst in not only assessing each component of the joint model but also in determining the contribution of the longitudinal data to the fit of the survival data.

The rest of the paper is organized as follows. A detailed description of the longitudinal and survival data from a clinical trial is given in Section 2. The joint models, the time-varying covariates models, and the two-stage models are presented in Section 3 along with their properties. The proposed decomposition of AIC and BIC is developed in Section 4. An extensive simulation study is conducted in Section 5, and a comprehensive analysis of the longitudinal and survival data described in Section 2 is given in Section 6. We conclude the paper with a discussion including some proposed extensions to our research in Section 7.

2. The EMPHACIS Data

Our research was motivated by the large phase III multicenter, randomized, single-blind, EMPHACIS lung cancer clinical trial (Evaluation of MTA in Mesothelioma in a Phase 3 Study with Cisplatin). Although the details of this study have been published elsewhere [36], we provide the essential background information needed for contextual understanding of our proposed methodology. The study drug was pemetrexed (PEM), a multi-targeted antifolate (MTA), which was given in combination with cisplatin (Cis) (the PEM/Cis arm), and the active-treatment comparator was cisplatin alone (the Cis arm); respectively, 226 and 222 patients received at least one cycle of chemotherapy. The treatment for both arms was structured as six 21-day cycles of therapy; patients receiving treatment benefit could receive additional cycles based on investigator discretion.

Malignant pleural mesothelioma is characterized by rapid disease progression, high symptom burden, and a relatively short median survival of 12 months afiter diagnosis [37, 38]. Accordingly, patient-reported assessments are important for evaluation of disease progression and patients’ response to therapy. In oncology, the patients’ importance ratings on the magnitude of progression-free survival improvement has been shown to depend on the severity of disease-related symptoms [39]. We analyzed the disease-specifc patient-reported Lung Cancer Symptom Scales (LCSS) [40] to evaluate the patient-level association of five of the six instrument items (i.e., anorexia, cough, dyspnea, fatigue, and pain) with progression-free survival using the EMPHACIS trial data. The sixth LCSS symptom, hemoptysis, was not analyzed due to research suggesting that this phenomenon is not prevalent in MPM [41]. The three remaining LCSS items are global constructs (interference, quality of life, symptoms), and due to their non-specifcity, we also excluded these from our analysis. Each questionnaire item was assessed using l00-mm visual analogue scales (0=no symptoms, 100 = worst possible symptoms). There were two measurements at baseline. In our analysis, we took the average of the two baseline measurements of each longitudinal outcome as the baseline outcome and reset the measurement time so that the baseline measurement time is zero. Weekly measurements (at days 8 ± 1, 15 ± 1, l9) were taken in each 21-day therapy cycle. The LCSS was also assessed approximately every 3 months afiter the patient had received his or her last dose of treatment if the patient had not initiated subsequent therapy. Progression free survival time (PFS) is defined as the time from randomization to the time until documented progression or death from any cause. Beyond disease progression, very few LCSS assessments were available.

Previously, researchers have investigated the prognostic effect of baseline PRO outcomes on overall survival in patients with MPM [42]. We are, however, interested in the association between post-baseline PRO scores and PFS. The main goal of applying joint models in this study is to assess the association of each longitudinal LCSS symptom with PFS and the treatment effects on each LCSS item and PFS simultaneously. More importantly, with the novel decomposition of AIC and BIC, the longitudinal LCSS symptoms can be compared in terms of their contribution to the overall fit of the survival data.

Our cohort consists of 425 patients with at least one post-baseline value for each longitudinal outcome. The covariates we consider in this study include race/ethnicity, gender, age, Karnofsky status, baseline stage of disease, vitamin supplementation, and treatment assignment. Table 1 shows the baseline characteristics of the patients in each treatment group and the four descriptive statistics (minimum, median, maximum and mean) for PFS.

Table 1.

Baseline characteristics of patients and summary of PFS in each treatment group

		Pemetrexed/Cisplatin n=208 (%)	Cisplatin n=217 (%)
Covariates	Race: White	189 (91%)	202 (93%)
	Gender: Male	169 (81%)	177 (82%)
	Age: ≥ 65	80 (38%)	83 (38%)
	Karnofsky: High (90-100)	112 (54%)	124 (57%)
	Stage: I/II	48 (23%)	46 (21%)
	Vitamin Supplement: Full	156 (75%)	158 (73%)

PFS (in months)	Minimum	0.4	0
PFS (in months)	Median	6.1	3.6
	Maximum	27.1	21.8
	Mean	7.0	4.8

Open in a new tab

3. The Models

Suppose that there are n subjects. For the ith subject, let Y_i(t) denote the longitudinal outcome, which is observed at time t ∈{a_i1, a_i2,...., a_imi}, where a_i1 = 0 <a_i2 < ... <a_im_i and m_i > 1. Note that Y_i(0) corresponds to the baseline value. Let t_i denote the failure time, which may be right-censored, and let δ_i be the censoring indicator such that δ_i = 1 if t_i is a failure time and 0 if t_i is right-censored for the ith subject. Also let z_i be the treatment indicator such that z_i = 1 for the treatment and z_i = 0 for the control. We further let x_i denote the p-dimensional vector of covariates. We first consider the joint model for (Y_i, t_i), which consists of the longitudinal component and survival component presented in Subsections 3.1 and 3.2. We also consider a time-varying covariates (TVC) model for t_i, where Y_i(t) is treated as a time-varying covariate in Subsection 3.3.

3.1. Longitudinal Component of the Joint Model

For the ith subject, we assume a mixed effects regression model for the longitudinal outcome Y_i(t), which is given by

Y_{i} (a_{i j}) = θ_{i}^{'} g (a_{i j}) + γ_{1} z_{i} + γ_{2}^{'} x_{i} + ∊_{i} (a_{i j}),

(3.1)

where $g (a_{i j}) = {(1, a_{i j}, a_{i j}^{2}, \dots, a_{i j}^{q})}^{'}$ is a polynomial vector of order q for j = 1,..., m_i, θ_i is a (q+1)-dimensional vector of random effects, and γ₂ is a p-dimensional vector of regression coefficients. In (3.1), we further assume θ_i ~ N(θ, Σ), where θ is the (q+l)-dimensional vector of overall effects, Σ is a (q+1)×(q+1) positive definite covariance matrix with lower triangle consisting of {Σ₀₀, Σ₁₀, Σ₁₁,..., Σ_qq}, ε_i(a_ij) ~ N(0, σ²), and θ_i and ε_i(a_ij) are independent. We note that in (3.1), if q = 1, g(a_ij) = (1,a_ij)′ and $θ_{i}^{'} g (a_{i j})$ yields a linear trajectory, and if q = 2, $g (a_{i j}) = {(1, a_{i j}, a_{i j}^{2})}^{'}$ and $θ_{i}^{'} g (a_{i j})$ leads to a quadratic trajectory.

3.2. Survival Component of the Joint Model

For failure time t_i, we assume the hazard function is of the general form

λ (t ∣ λ_{0}, β, α, θ_{i}, g (t), γ, z_{i}, x_{i}) = λ_{0} (t) \exp {h (β, θ_{i}, g (t), γ_{1} z_{i}, γ_{2}^{'} x_{i}) + α_{1} z_{i} + α_{2}^{'} x_{i}},

(3.2)

where λ₀(t) is the baseline hazard function, h(·) is a linear function of θ_i, g(t), γ₁z_i, and $γ_{2}^{'} x_{i}$ with β being a vector of the corresponding regression coefcients, $γ = {(γ_{1}, γ_{2}^{'})}^{'}$ , and $α = {(α_{1}, α_{2}^{'})}^{'}$ . Note that in (3.2), θ_i, g(t), γ₁, and γ₂ are the parameters or the functions from the longitudinal component of the joint model in (3.1), and λ₀, β, α₁ and α₂ are the only parameters pertaining to the survival component. When

h (β, θ_{i}, g (t), γ_{1} z_{i}, γ_{2}^{'} x_{i}) = h^{*} (β, θ_{i}^{'} g (t),, γ_{1} z_{i}, γ_{2}^{'} x_{i}),

(3.3)

where h^*(·) is a linear function of $θ_{i}^{'} g (t)$ , γ₁z_i, and $γ_{2}^{'} x_{i}$ , (3.2) leads to the TM. In this case, the hazard function depends on θ_i and g only through $θ_{i}^{'} g$ . When h does not depend on g(t), that is,

h (β, θ_{i}, g (t), γ_{1} z_{i}, γ_{2}^{'} x_{i}) = h^{*} (β, θ_{i}, γ_{1} z_{i}, γ_{2}^{'} x_{i})

where h^*(·) is a linear function of $θ_{i}^{'}$ , γ₁z_i, and $γ_{2}^{'} x_{i}$ , (3.2) reduces to the SPM.

In (3.2), we further assume a piecewise constant hazard model for λ₀(t). Specifcally, we first construct a finite partition of the time axis, 0 = s₀ < s₁ < s₂ < ... < s_K−1 < s_K = ∞. Thus, we have K intervals (0,s₁], (s₁,s₂], ... , (s_K−1,s_K]. Then, we assume a constant baseline hazard within each of the K intervals, that is,

λ_{0} (t) = λ_{k}, t ∊ (s_{k - 1}, s_{k}] for k = 1, . . ., K .

(3.4)

Finally, we write λ = (λ₁,...,λ_K)′. Using (3.4), the complete-data likelihood function for the survival component for the ith subject can be written as

L (λ, β, α ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}, g, γ) = {[λ (t_{i} ∣ λ_{0}, β, α, θ_{i}, g (t_{i}), γ, z_{i}, x_{i})]}^{δ_{i}} \times \exp {- \int_{0}^{t_{i}} λ (u ∣ λ_{0}, β, α, θ_{i}, g (u), γ, z_{i}, x_{i}) d u},

(3.5)

where λ(t|λ₀, β, α, θ_i, g(t), γ,z_i, x_i) is given in (3.2).

Remark 3.1 In (3.2), when β = 0, $h (β, θ_{i}, g (t), γ_{1} z_{i}, γ_{2}^{'} x_{i}) \equiv 0$ and the hazard function reduces to $λ (t ∣ λ_{0}, β = 0, α, θ_{i}, g (t), γ, z_{i}, x_{i}) = λ_{0} (t) \exp (α_{1} z_{i} + α_{2}^{'} x_{i})$ . In this case, we fit the survival data alone (without the longitudinal data) and the likelihood function in (3.5) for the ith subject reduces to

L_{0} (λ, α ∣ t_{i}, δ_{i}, z_{i}, x_{i}) = {λ_{0} (t_{i}) \exp {(α_{1} z_{i} + α_{2}^{'} x_{i})}^{δ_{i}}} \exp [- \exp (α_{1} z_{i} + α_{2}^{'} x_{i}) {\int_{0}^{t_{i}} λ_{0} (u) d u}] .

(3.6)

3.3. The Time-Varying Covariates (TVC) Model

If t_i is of primary interest, the time-varying covariates model (see, for example, [43, 44]) can be used to model the failure time t_i, in which Y_i(t) can be considered as a time-varying covariate. Under the TVC model, the hazard function is assumed to be

λ (t ∣ λ_{0}, β, α, z_{i}, x_{i}, Y_{i} (t)) = λ_{0} (t) \exp {β Y_{i} (t) + α_{1} z_{i} + α_{2}^{'} x_{i}},

(3.7)

where λ₀(t) is the baseline hazard function. Since the longitudinal outcome Y_i(t) is observed only at each of a_i1, ... , a_{im_i}, we let Y_i(t) = Y_i(a_ij) for a_ij ≤ t <a_i,j+1 for j = 1, ... , m_i, where a_i,mi+1 = ∞. Similar to (3.2), a piecewise constant hazard model in (3.4) is assumed for λ₀(t) in (3.7). Finally, we notice that in the TVC model (3.7), Y_i(t) is a one-dimensional covariate and therefore, β is one-dimensional as well.

3.4. The Two-Stage (TS) Model

Instead of directly using the longitudinal outcome Y_i(t) as a covariate in (3.7), (i) we first fit (3.1) to the longitudinal data alone, obtain the estimates of θ_i, γ₁, and γ₂, denoted by ${\hat{θ}}_{i}$ , ${\hat{γ}}_{1}$ and ${\hat{γ}}_{2}$ ; and compute ${\hat{Y}}_{i} (t) = {\hat{θ}}_{i}^{'} g (t) + {\hat{γ}}_{1} z_{i} + {\hat{γ}}_{2}^{'} x_{i}$ ; and (ii) we then use ${\hat{Y}}_{i} (t)$ as a time-varying covariate in the survival model, in which the hazard function is defined as $λ (t ∣ λ_{0}, β, α, z_{i}, x_{i}, {\hat{Y}}_{i} (t)) = λ_{0} (t) \exp {β {\hat{Y}}_{i} (t) + α_{1} z_{i} + α_{2}^{'} x_{i}}$ . At first, it appears that the above hazard function is similar to (3.7). However, there is a substantial difference between Y_i(t) and ${\hat{Y}}_{i} (t)$ . The longitudinal outcome Y_i(t) is observed only at each of the time points a_i1, ..., a_im_i while ${\hat{Y}}_{i} (t)$ is defined at any time t. In addition, ${\hat{Y}}_{i} (t)$ is much less variable than Y_i(t) since ${\hat{Y}}_{i} (t)$ is a smooth function of t and Y_i(t) is random. The model defined here is known as the two-stage (TS) model [33].

4. Assessing the Contribution of Longitudinal Data When Modeling the Survival Data

For the joint model discussed in the previous section, we develop a new method to assess the contribution of the longitudinal data when fitting the survival data. We first introduce some notation. We rewrite (3.1) as follows: $Y_{i} = W_{i} {(θ_{i}^{'}, γ^{'})}^{'} + ∊_{i}$ where Y _i =(Y_i(a_i1),...,Y_i(a_{im_i}))' , W_i is a m_i by (p + q + 2) matrix whose ith row is $(1, a_{i j}, \dots, a_{i j}^{q}, z_{i}, x_{i}^{'})$ , and ε_i =(ε_i(a_i1),...,ε_i(a_{im_i}))’ ~ N(0,σ²I_{m_i}). The complete-data likelihood function of the longitudinal outcomes for the ith subject is given by

L (γ, σ^{2} ∣ Y_{i}, W_{i}, θ_{i}) \propto σ^{- m_{i}} \exp {- \frac{1}{2 σ^{2}} {(Y_{i} - W_{i} {(θ_{i}^{'}, γ^{'})}^{'})}^{'} (Y_{i} - W_{i} {({θ^{'}}_{i}, γ^{'})}^{'})},

(4.1)

for i = 1,...,n. Note that the density of θ_i is given by

f (θ_{i} ∣ θ, \sum) = \frac{∣ \sum ∣^{- \frac{1}{2}}}{{(2 π)}^{\frac{q + 1}{2}}} \exp {- \frac{1}{2} {(θ_{i} - θ)}^{'} \sum^{- 1} (θ_{i} - θ)} .

(4.2)

Let φ = (λ, β, α, γ,σ², θ, Σ). Using (3.5), (4.1), and (4.2), the observed-data likelihood function for (Y_i,t_i,δ_i) for the ith subject is given by

L (φ ∣ Y_{i}, t_{i}, δ_{i}, z_{i}, x_{i}, W_{i}) = \int L (λ, β, α ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}, g, γ) L (γ, σ^{2} ∣ Y_{i}, W_{i}, θ_{i}) f (θ_{i} ∣ θ, \sum) d θ_{i},

(4.3)

for i = 1,..., n. Let $\hat{φ}$ denote the maximum likelihood estimate (MLE) of φ from the joint model. Using (4.3), the Akaike Information Criterion (AIC) [45] for the joint model is given by

AIC = - 2 \sum_{i = 1}^{n} \log L (\hat{φ} ∣ Y_{i}, t_{i}, δ_{i}, z_{i}, x_{i}, W_{i}) + 2 \dim (φ)

(4.4)

and the Bayesian Information Criterion (BIC) [46] is defined as

BIC = - 2 \sum_{i = 1}^{n} \log L (\hat{φ} ∣ Y_{i}, t_{i}, δ_{i}, z_{i}, x_{i}, W_{i}) + \dim (φ) \log n .

(4.5)

4.1. AIC and BIC Decomposition

To assess the contribution of longitudinal data to the fit of the survival data, we need to decompose AIC in (4.4) into two parts: one part for the longitudinal data and the other part for the survival data conditional on the longitudinal data. Write φ₁ =(γ,σ², θ, Σ) and φ₂ =(λ, β, α). We are led to the following theorem.

Theorem 4.1 Let f(θ_i|Y _i,W_i, φ₁) be the conditional density of the random effects θ_i given Y _i, and also let $L (φ_{1} ∣ Y_{i}, W_{i}) = \int L (γ, σ^{2} ∣ Y_{i}, W_{i}, θ_{i}) f (θ_{i} ∣ θ, Σ) d θ_{i}$ , which is the likelihood function corresponding to the marginal distribution of Y_i. Then AIC in (4.4) has the following decomposition:

AIC = {AIC}_{Long} + {AIC}_{Surv ∣ Long},

(4.6)

where ${AIC}_{Long} = - 2 \sum_{i = 1}^{n} \log L ({\hat{φ}}_{1} ∣ Y_{i}, W_{i}) + 2 \dim (φ_{1})$ , ${AIC}_{Surv ∣ Long} = - 2 \sum_{i = 1}^{n} \log \int L ({\hat{φ}}_{2} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}, g, \hat{γ}) f (θ_{i} ∣ Y_{i}, W_{i}, {\hat{φ}}_{1}) d θ_{i} + 2 \dim (φ_{2})$ , and ${\hat{φ}}_{1}$ and ${\hat{φ}}_{2}$ are the MLEs of $φ_{1}$ and $φ_{2}$ .

The proof of Theorem 4.1 is given in the Appendix. BIC in (4.5) has a similar decomposition as in (4.6); this result is stated in the following corollary, and the proof of this corollary directly follows that of Theorem 4.1.

Corollary 4.1 BIC in (4.5) can be decomposed as BIC = BIC_Long + BIC_Surv|Long, where BIC_Long = AIC_Long + dim(φ₁)(log n − 2), and BIC_Surv|Long = AIC_Surv|Long + dim(φ₂)(log n − 2).

Remark 4.1 We note that the AIC decomposition in (4.6) and the BIC decomposition in Corollary 4.1 hold for general longitudinal data models, which may not be normal. However, for normally distributed longitudinal data, f(θ_i|Y_i,W_i, φ₁) and L(φ₁|Y_i,W_i) in Theorem 4.1 are available in closed form. It is easy to see that

E (Y_{i}) = E [E (Y_{i} ∣ θ_{i})] = W_{i} {(θ^{'}, γ^{'})}^{'},

and

V a r (Y_{i}) = E [V a r (Y_{i} ∣ θ_{i})] + V a r [E (Y_{i} ∣ θ_{i})] = σ^{2} I_{m i} + W_{i} (\begin{matrix} Σ & 0 \\ 0 & 0 \end{matrix}) {W^{'}}_{i} .

Thus, the observed-data likelihood function of the longitudinal data takes the form:

L (φ_{1} ∣ Y_{i}, W_{i}) = \frac{{∣ σ^{2} I_{m i} + W_{i} (\begin{matrix} Σ & 0 \\ 0 & 0 \end{matrix}) W_{i}^{'} ∣}^{- \frac{1}{2}}}{(2 π) \frac{m_{i}}{2}} \exp {- \frac{1}{2} {(Y_{i} - W_{i} {(θ^{'}, γ^{'})}^{'})}^{'} \times {(σ^{2} I_{m i} + W_{i} (\begin{matrix} Σ & 0 \\ 0 & 0 \end{matrix}) W_{i}^{'})}^{- 1} (Y_{i} - W_{i} {(θ^{'}, γ^{'})}^{'})} .

(4.7)

After some algebra, we also obtain the conditional distribution of the random effects θ_i given the longitudinal data, which is given by

θ_{i} ∣ Y_{i}, W_{i}, φ_{1} ~ N ({(Σ^{- 1} + \frac{1}{σ^{2}} (I_{q + 1} 0) W_{i}^{'} W_{i} (\begin{matrix} I_{q + 1} \\ 0 \end{matrix}))}^{- 1} [\frac{1}{σ^{2}} (I_{q + 1} 0) {W_{i}}^{'} (Y_{i} - W_{i} (\begin{matrix} 0 \\ I_{p + 1} \end{matrix}) γ) + Σ^{- 1} θ] {(Σ^{- 1} + \frac{1}{σ^{2}} (I_{q + 1} 0) {W_{i}}^{'} W_{i} (\begin{matrix} I_{q + 1} \\ 0 \end{matrix}))}^{- 1}) .

Remark 4.2 SPM and TM discussed in Section 3 can be implemented in SAS via PROC NLMIXED. The NLMIXED procedure calculates ${\hat{φ}}_{1}$ , ${\hat{φ}}_{2}$ , and the overall AIC using adaptive Gaussian quadrature to approximate (4.3). For normally distributed longitudinal data, AIC_Long can be computed using (4.7) and ${\hat{φ}}_{1}$ , which may be implemented in SAS via PROC IML. Subsequently, AIC–AIC_Long gives AIC_Surv|Long. Alternatively, given ${\hat{φ}}_{1}$ and ${\hat{φ}}_{2}$ , we may use a Monte Carlo (MC) method to compute AIC_Surv|Long using (4.6) and an MC sample generated from $f (θ_{i} ∣ Y_{i}, W_{i}, {\hat{φ}}_{1})$ . This alternative approach can be used to validate the total AIC obtained from PROC NLMIXED.

4.2. ΔAIC_Surv and ΔBIC_Surv

AIC_Long (BIC_Long) measures the contribution to the total AIC (BIC) due to the longitudinal data while AIC_Surv|Long (BIC_Surv|Long) quantifes the contribution to the total AIC (BIC) due to the survival data with the additional information from the longitudinal data. Let

\begin{matrix} {AIC}_{Surv, 0} & = - 2 Σ_{i = 1}^{n} \log L_{0} (\hat{λ}, \hat{α} ∣ t_{i}, δ_{i}, z_{i}, x_{i}) + 2 \dim (λ, α), \\ {BIC}_{Surv, 0} & = - 2 Σ_{i = 1}^{n} \log L_{0} (\hat{λ}, \hat{α} ∣ t_{i}, δ_{i}, z_{i}, x_{i}) + \dim (λ, α) \log n, \end{matrix}

(4.8)

where L₀(λ, α|t_i,δ_i,z_i, x_i) is defined by (3.6). We now propose the following two model assessment criteria:

\begin{matrix} Δ {AIC}_{Surv} & = {AIC}_{Surv, 0} - {AIC}_{Surv ∣ Long}, \\ Δ {BIC}_{Surv} = {BIC}_{Surv, 0} - {BIC}_{Surv ∣ Long} . \end{matrix}

(4.9)

Both ΔAIC_Surv and ΔBIC_Surv measure the gain of the fit in the survival component due to the longitudinal data with a penalty for the additional parameters in the survival component of the joint model. The model with a large value of ΔAIC_Surv (ΔBIC_Surv) is more preferred. To address an important practical issue of how ΔAIC_Surv and ΔBIC_Surv are related to the magnitude of the longitudinal outcomes, we establish a useful result, which is formally stated in the following theorem.

Theorem 4.2 The criteria ΔAIC_Surv and ΔBIC_Surv are invariant to location and scale transformations of the longitudinal outcomes. Specifcally, consider a linear transformation: ${Y_{i}}^{*} (a_{i j}) = \frac{Y_{i} (a_{i j}) - b}{c}$ for j =1,...,m_i and i = 1,...,n, where b and c > 0 are two known constants. The resulting criteria corresponding to the transformed longitudinal outcomes $Y_{i}^{*} (a_{i j})$ 's are denoted by ΔAIC_Surv(b, c) and ΔBIC_Surv(b, c). Then, we have ΔAIC_Surv(b, c)= ΔAIC_Surv and ΔBIC_Surv(b, c) = ΔBIC_Surv for all −∞ <b< ∞ and c > 0.

The proof of Theorem 4.2 is given in the Appendix. We note that if $b = \overset{‒}{Y}$ and c = S, where $\overset{‒}{Y} = \frac{\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} Y_{i} (a_{i j})}{\sum_{i = 1}^{n} m_{i}}$ and $S^{2} = \frac{\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {(Y_{i} (a_{i j}) - \overset{‒}{Y})}^{2}}{\sum_{i = 1}^{n} m_{i} - 1}$ , then the $Y_{i}^{*} (a_{i j})$ 's are the standardized longitudinal outcomes. This linear transformation invariant property of ΔAIC_Surv and ΔBIC_Surv allows us to standardize the longitudinal outcomes to improve numerical stability in fitting the joint model of the longitudinal and survival data as well as computing ΔAIC_Surv and ΔBIC_Surv using existing statistical sofitware such as SAS.

5. Simulation Studies

In this section, we conduct extensive simulation studies to examine the empirical performance of ΔAIC_Surv and ΔBIC_Surv in model comparison as well as in the determination of the contribution of the longitudinal data to the goodness-of-fit of the survival model. In the simulation studies, we consider four types of models, namely, SPM, TM, TS and TVC models. Although the definitions of ΔAIC_Surv and ΔBIC_Surv are based on the joint model, they can be extended to the TS and TVC models as well. Specifcally, for the TVC model, AIC is given by

{AIC}_{Surv ∣} Y = - 2 \sum_{i = 1}^{n} \log L (\hat{λ}, \hat{β}, \hat{α} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, Y_{i}) + 2 \dim (λ, β, α),

(5.1)

where $L (λ, β, α ∣ t_{i}, δ_{i}, z_{i}, x_{i}, Y_{i}) = {[λ (t_{i} ∣ λ_{0}, β, α, z_{i}, x_{i}, Y_{i} (t_{i}))]}^{δ_{i}} \exp {- \int_{0}^{t_{i}} λ (u ∣ λ_{0}, β, α, z_{i}, x_{i}, Y_{i} (u)) d u}$ and λ(t|λ₀, β, α,z_i, x_i, Y_i(t)) is given in (3.7). We define ΔAIC_Surv as follows: ΔAIC = AIC_Surv,0 − AIC_Surv|Y, where AIC_Surv,0 is given by (4.8). ΔBIC_Surv can be defined in a similar fashion. Replacing Y by $\hat{Y}$ in (5.1), ΔAIC_Surv and ΔBIC_Surv can be defined for the TS model.

Three simulation studies are considered (i) to examine the performance of ΔAIC_Surv and ΔBIC_Surv in selecting the true model (Simulations I and II) and determining the true longitudinal outcome that is most related to the survival model (Simulation III); (ii) to investigate the empirical properties of the maximum likelihood estimates of the parameters in the joint model (Simulations I and II); and (iii) to test the robustness of the computational procedure to the dimension of the model parameters (Simulation II). In all three simulation studies, we independently generate 500 simulated datasets, and in each dataset there are n = 400 subjects. The treatment indicator z_i is generated from a Bernoulli(0.5) distribution. The time points a_ij’s at which the longitudinal outcomes are taken are fxed at (0, 21, 42, 63, 84, 105, 126)/30.4375, where 30.4375 is the average number of days in each month and is obtained by 365.25/12. Other data generation details are given as follows.

Simulation I: The true model is SPM with linear trajectory denoted by SPML. Specifcally, the longitudinal data is simulated from a N(μ_i(a_ij),σ²) distribution, where μ_i(a_ij) = θ0_i + θ_1iaij + γz_i, and t_i^* is generated from [−λexp{β₁(θ_0i + γz_i)+ β₂θ_1i + αz_i}]⁻¹log(1 − U), where U ~ U(0, 1). The design values of the parameters are given in Table 2. The censoring time C_i is generated from an exponential distribution with mean 100. The right censoring percentage is roughly 8% which mimics the real data analysis. The failure time and censoring indicator are calculated as $t_{i} = \min {t_{i}^{*}, C_{i}}$ δ_i and $t_{i}^{*} \leq C_{i}$ and 0 otherwise.

Table 2.

Parameter estimates of SPML and SPMQ in Simulations I and II

Simulation	Param.	True	EST	SE	SD	RMSE	CP
I	α	−0.4	−0.399	0.105	0.104	0.104	0.956
	γ	0.05	0.050	0.087	0.088	0.088	0.950
	β ₁	0.3	0.301	0.067	0.066	0.066	0.960
	β ₂	1.2	1.211	0.246	0.234	0.234	0.958
	σ	0.5	0.500	0.008	0.008	0.008	0.954
	logλ	−1.7	−1.700	0.076	0.076	0.076	0.964
	θ ₀	−0.01	−0.007	0.063	0.063	0.063	0.944
	θ ₁	0.08	0.079	0.014	0.014	0.014	0.934
	Σ ₀₀	0.7	0.698	0.058	0.058	0.058	0.936
	Σ ₁₀	−0.03	−0.030	0.013	0.013	0.013	0.944
	Σ ₁₁	0.06	0.059	0.006	0.006	0.006	0.930

II	α	−0.4	−0.411	0.113	0.116	0.116	0.946
	γ	0.03	0.033	0.088	0.086	0.086	0.956
	β ₁	0.3	0.307	0.075	0.076	0.077	0.946
	β ₂	1	0.994	0.142	0.146	0.146	0.938
	β ₃	5	4.989	0.209	0.211	0.211	0.942
	σ	0.5	0.499	0.009	0.009	0.009	0.952
	logλ	−1.7	−1.690	0.082	0.081	0.082	0.942
	θ ₀	−0.02	−0.022	0.065	0.067	0.067	0.936
	θ ₁	0.1	0.097	0.037	0.038	0.038	0.950
	θ ₂	−0.1	0.099	0.017	0.017	0.017	0.944
	Σ ₀₀	0.7	0.700	0.063	0.062	0.062	0.954
	Σ ₁₀	−0.08	−0.080	0.037	0.038	0.038	0.952
	Σ ₁₁	0.3	0.298	0.039	0.038	0.038	0.968
	Σ ₂₀	0.01	0.009	0.016	0.016	0.016	0.942
	Σ ₂₁	−0.05	−0.049	0.014	0.013	0.013	0.946
	Σ ₂₂	0.1	0.099	0.008	0.008	0.008	0.936

Open in a new tab

Simulation II: The true model is SPM with quadratic trajectory denoted by SPMQ. The data generation process follows the same steps as in Simulation I. The design values of the parameters are shown in Table 2.

Simulation III: The same setting as in Simulation I is used to generate the longitudinal data and survival times under the true model SPML. This dataset is denoted by D_Long. We also generate three additional sets of longitudinal data, which are associated with the one generated in Simulation I. These additional longitudinal trajectories are simulated from $μ_{l i} (a_{i j}) = (θ_{0 i} + τ_{l 0 i}) + {(θ_{1 i} + τ_{l 1 i})}_{a i j} + γ z_{i}$ , where ${(τ_{l 0 i}, τ_{l 1 i})}^{'} ~ N (0, {σ_{l}}^{2} I_{2})$ , and then the longitudinal data are generated from N(μ_li(a_ij), 0.5²) for l =1, 2, 3, where σ1 = 0.1, σ2 = 0.5, and σ3 = 1. These three sets of longitudinal data each are coupled with the same survival times as in D_Long to form four additional datasets. These resulting datasets are denoted by D_Long1,...,D_Long3.

In Simulation I, we fit SPML, TML (TM with linear trajectory and $h^{*} (β, θ_{i}^{'} g (t), γ_{1} z_{i}, γ_{2}^{'} x_{i}) = β (θ_{i}^{'} g (t) + γ_{1} z_{i})$ in (3.3)), the TS model with linear trajectory, and the TVC model (all with K = 1, where K is defined in (3.4)) to each simulated dataset. In Simulation II, we fit SPML, SPMQ, TML, TMQ (TM with quadratic trajectory), the TS model, and the TVC model to each simulated dataset. In Simulation III, we fit SPML to each of the datasets D_Long, D_Long1, D_Long2, D_Long3 and the corresponding results are labeled as Long, Longl, Long2, and Long3.

In both Simulations I and II, we compute the estimates of the parameters under the true models. Let η denote a parameter in the true model. Also let ${\hat{η}}_{b}$ and $se ({\hat{η}}_{b})$ be the MLEs of η and the standard error of ${\hat{η}}_{b}$ from the b^th simulated dataset for b =1, 2,..., 500. We define the simulation estimate (EST) and the standard error (SE) to be $\overset{‒}{η} = \frac{1}{500} \sum_{i = 1}^{500} {\hat{η}}_{b}$ and $\frac{1}{500} \sum_{i = 1}^{500} se ({\hat{η}}_{b})$ . We also define the simulation standard deviation (SD) and the root of the mean squared error (RMSE) as ${[\frac{1}{500 - 1} \sum_{b = 1}^{500} {({\hat{η}}_{b} - \overset{‒}{η})}^{2}]}^{1 ∕ 2}$ and ${[\frac{1}{500} \sum_{b = 1}^{500} {({\hat{η}}_{b} - η^{*})}^{2}]}^{1 ∕ 2}$ , where η^*is the true value of η. Finally, we define the coverage probability (CP) of the 95% confidence intervals as $\frac{1}{500} \sum_{b = 1}^{500} 1 {L_{b} < η^{*} < U_{b}}$ , where Lb and Ub are the 95% lower and upper limits in the b^th simulation. The ESTs, SEs, SDs, RMSEs, and CPs for the parameters in SPML and SPMQ are reported in Table 2. From this table, we see that for all parameters, the ESTs are very close to the corresponding true values, the SEs are very close to the SDs, and the CPs are always around 95% under both SPML and SPMQ.

Suppose we compare a total of J candidate models. Let ΔAIC_Surv,jb and ΔBIC_Surv,jb denote the values of ΔAIC_Surv and ΔBIC_Surv from the b^th simulated dataset. Then, the frequency of ranking Model j as the best according to ΔAIC_Surv criterion is defined as $\sum_{b = 1}^{500} 1 {Δ {AIC}_{Surv, j b} w Δ {AIC}_{Surv, l b} for all l \neq j}$ A similar frequency can be defined for the ΔBICSurv criterion or the other criteria. If Model l is the true model, the average misspecification rate according to ΔAIC_Surv is given by $1 - \frac{1}{500} \sum_{b = 1}^{500} 1 {Δ {AIC}_{Surv, 1 b} > Δ {AIC}_{Surv, l b} for all l \geq 2}$ .

Table 3 shows the means of ΔAIC_Surv and ΔBIC_Surv as well as the frequencies of ranking each model as best based on ΔAIC_Surv and ΔBIC_Surv for 500 simulated datasets for all three simulations. In Simulations I and III, the true model is SPML with K = 1 while the true model is SPMQ with K = 1 in Simulation II. In all three simulations, the true model always has the largest mean of either ΔAIC_Surv or ΔBIC_Surv and the highest frequency of ranking the true model as best based on either ΔAIC_Surv or ΔBIC_Surv. The average misspecification rates according to ΔAIC_Surv and ΔBIC_Surv in Simulation I, II, and III are 0.24 and 0.538, 0.058 and 0.272, 0.136 and 0.136, respectively. We also see from Table 3 that the differences in the means of ΔAIC_Surv between SPML and TML or SPMQ and TMQ are greater than the differences in the means of ΔBIC_Surv. These results are expected since TML and TMQ have fewer regression coefcients than SPML and SPMQ, and BIC penalizes the dimension of the parameters more than AIC. Similar results are observed based on the frequency of ranking each model as best. Thus, the performance of ΔAIC_Surv is slightly better than ΔBIC_Surv in correctly identifying the true model. It is interesting to note that although neither TM nor the TS model is the true model, TM outperforms the TS model in both Simulations I and II. In Simulations I and II, the TVC model fits the data the most poorly based on both ΔAIC_Surv and ΔBIC_Surv.

Table 3.

Mean of ΔAIC_Surv (ΔBIC_Surv) and frequency of ranking each model as best based on ΔAIC_Surv (ΔBIC_Surv)

		ΔAIC_Surv		ΔBIC_Surv
Simulation	Model	Mean	Frequency	Mean	Frequency
I	SPML	42.55	380	34.57	231
	TML	38.38	76	34.39	181
	TS	38.16	32	34.17	70
	TVC	30.70	12	26.71	18

II	SPML	588.66	29	580.68	136
	SPMQ	596.68	471	584.70	364
	TML	419.99	0	416.00	0
	TMQ	419.52	0	415.53	0
	TS	418.56	0	414.57	0
	TVC	399.51	0	395.52	0

III	Long	42.55	432
	Long1	38.37	67
	Long2	15.64	1
	Long3	6.46	0

Open in a new tab

In Simulation III, the true longitudinal outcome is Long, and Longl is obtained by adding random errors to both the random intercept and slope of the linear trajectory in Long for l = 1,..., 3. Longl to Long3 become gradually further apart from Long since the standard deviation of the random errors increases from 0.1 to l. Since we fit the same model to each of these five datasets, the differences between ΔAIC_Surv and ΔBIC_Surv are the same for Long, Longl, ... , Long3. Thus, only the results based on ΔAIC_Surv are reported in Table 3. We see from this table that Long has the largest mean of ΔAIC_Surv and the highest frequency of ranking Long as best, and the mean and frequency corresponding to Longl decrease as l increases. The results for Long and Long1 are very close, which is expected since Longl is obtained by adding a very small amount of noise to Long.

In addition, Figure 1 shows the boxplots of the ΔAIC_Surv and ΔBIC_Surv differences between the true model and the competing models, respectively. From Figure 1 (a), (b), and (c), we see that most of these boxes are above zero, which indicates that the true model does fit the data much better than the competing models based on ΔAIC_Surv. All boxes for ΔBIC_Surv differences in Figure 1 (e) are also above O. However, the medians of the ΔBIC differences between SPML and TML or between SPML and TS shown in Figure 1 (d) are very close to O. These results are consistent with those based on the means of ΔAIC_Surv and ΔBIC_Surv and the frequencies of ranking each model as best.

Boxplots of the ΔAIC_Surv differences ((a), (b), and (c)) and the ΔBIC_Surv differences ((d) and (e)) between the true and competing models.

6. Analysis of the EMPHACIS Data

In this section, we carry out a detailed analysis of the EMPHACIS data using the models discussed in Section 3 and the ΔAIC_Surv and ΔBIC_Surv criteria proposed in Section 4. As stated in Section 2, data from n = 425 patients are used, and the longitudinal and survival data we consider are one of five patient-reported LCSS outcomes corresponding to anorexia, cough, dyspnea, fatigue, and pain along with progression free survival time in months. The treatment indicator z_i = 1 if the ith patient received pemetrexed/cisplatin and z_i = 0 if the ith patient received cisplatin alone, and the covariates (Table l) include race (x_i1), gender (x_i2), age (x_i3), Karnofsky status (x_i4), baseline stage of disease (x_i5), and vitamin supplementation (x_i6). All six covariates (p = 6) are binary, each taking a value of 0 or 1. Specifcally, x_i1 = 1 if white, x_i2 = 1 if male, x_i3 = 1 if age ≥ 65, x_i4 = 1 if Karnofsky status is high, x_i5 = 1 if stage I/II, and x_i6 = 1 if full vitamin supplementation. In all calculations, we standardized all five patient-reported LCSS outcomes. The LCSS original-scaled item means (standard deviations) were 30.79 (27.19), 11.48 (17.93), 31.41 (26.33), 39.38 (27.06), and 24.64 (24.90) for anorexia, cough, dyspnea, fatigue, and pain, respectively. The total numbers of completed longitudinal assessments (i.e., $\sum_{i = 1}^{n} m_{i}$ ) including the baseline measurements were 5504, 5544, 5553, 5530, and 5546 for anorexia, cough, dyspnea, fatigue, and pain.

We fit the SPML, SPMQ, TML, TMQ, TS and TVC models, where SPML, SPMQ, TML, and TMQ are defined in Section 5, to the PFS data paired with one of the five LCSS longitudinal outcomes corresponding to anorexia, cough, dyspnea, fatigue, and pain. As suggested by an anonymous referee, we also considered the joint model with a quadratic trajectory in the longitudinal component and only a linear trajectory in the the survival component, where the models corresponding to SPMQ and TMQ are denoted by SPMQL and TMQL. The six covariates (x_i’s) and the treatment indicator were included in all the models we estimated. As shown in Table 1, the maximum values of PFS were 27.1 and 21.8 months for the pemetrexed/cisplatin arm and the cisplatin alone arm, respectively. For all the models, we used the piecewise constant hazard model given in (3.4) for the baseline hazard, and the partition intervals were constructed based on the percentiles such as the first (Q₁), second (Q₂), and third (Q₃) quartiles of the PFS times.

We used the ΔAIC_Surv and ΔBIC_Surv criteria as well as the AIC_Surv|Long and BIC_Surv|Long criteria to determine the number of intervals (K) in (3.4). We first fit the PFS data alone using (3.6). The values of AIC_Surv,0 and BIC_Surv,0 defined by (4.8) were 2225.80 and 2258.22 for K = 1, 2206.29 and 2242.76 for K = 2, and 2209.71 and 2254.28 for K = 4. We considered two methods for constructing the three intervals: inserting two intervals within the first interval or the second interval using the piecewise constant hazard model with K = 2. The resulting piecewise constant hazard models with (s₀ = 0, s₁ = Q₁, s₂ = Q₂) and (s₀ = 0, s₁ = Q₂, s₂ = Q₃) are denoted by K = 3⁽¹⁾ and K = 3⁽²⁾. Similarly, we constructed the partitions for K > 3. This approach is desirable when more events occur early in the follow-up. Another advantage of this approach is that the resulting partitions are nested and, hence, the log-likelihood of the joint model increases in K when the longitudinal component remains fxed. The values of AIC_Surv,0 and BIC_Surv,0 were 2208.27 and 2248.79 for K = 3⁽¹⁾ and 2207.73 and 2248.25 for K = 3⁽²⁾. These results indicate that when we fit the PFS data alone, the piecewise constant hazard model with K = 2 fits best according to both AIC_Surv,0 and BIC_Surv,0. We then fit the PFS data and the LCSS longitudinal data jointly. For ease of presentation, we discuss the case for the longitudinal outcomes corresponding to pain only since the results were similar for anorexia, cough, dyspnea, and fatigue. Figure 2 shows the MLEs of λ and the values of AIC_Surv|Long and BIC_Surv|Long for K = 1, 2, 3, and 4 under SPML. From Figure 2, we see that AIC_Surv|Long and BIC_Surv|Long were 2199.30 and 2239.82 for K = 1; 2161.84 and 2206.42 for K = 2; 2l63.01 and 2211.64 for K =3⁽¹⁾; 2163.84 and 2212.47 for K = 3⁽²⁾; and 2164.98 and 2217.66 for K = 4, respectively. Clearly, the best values of AIC_Surv|Long and BIC_Surv|Long were obtained under SPML with K = 2. Thus, according to AIC_Surv|Long and BIC_Surv|Long, SPML with K = 2 fits the PFS data the best. We also see from Figure 2 that for K = 2, λ₂ =0.276 is much larger than λ1 = 0.138, indicating that the exponential model (i.e., K = 1) did not fit the PFS data well. All of the above results suggest that it is sufcient to choose K = 2 in fitting the PFS data. We note that the issue of interval choice has also been discussed in [47], [48], and [49].

The diagrams of the MLEs of λ, AIC_Surv|Long and BIC_Surv|Long with various values of K under SPML for pain.

Table 4 shows AIC, AIC_Long, ΔAIC_Surv, BIC, BIC_Long, and ΔBIC_Surv for SPML, SPMQ, SPMQL, TML, TMQ, and TMQL and ΔAIC_Surv and ΔBIC_Surv for the TVC and TS models. The ΔAIC_Surv’s and ΔBIC_Surv’s are plotted in Figure 3. We see from Table 4 that pain had the largest values of ΔAIC_Surv and ΔBIC_Surv under SPML, SPMQ, SPMQL, TML, TMQ, TVC, and TS; fatigue had the largest values of ΔAIC_Surv and ΔBIC_Surv under TMQL and the second largest values of ΔAIC_Surv and ΔBIC_Surv under the other seven models; and cough had the smallest values of ΔAIC_Surv and ΔBIC_Surv. These results indicate that pain led to the most gain in fitting the PFS data while cough had the least contribution to the fit of the PFS data. However, AIC and BIC were not able to determine the contribution of the longitudinal data in fitting the survival data for these five sets of LCSS longitudinal outcomes under the joint modeling framework. We observe from Table 4 that the smallest values of AIC and BIC were attained by dyspnea under SPML, SPMQ, TML, and TMQ. Afiter examining AIC Long and BICLong, we found that dyspnea had the smallest values of AICLong and BICLong. Thus, AIC_Long and BIC_Long were the main contributions to the smallest values of AIC and BIC for dyspnea. From Table 4, we also see that (i) the values of ΔAIC_Surv under SPMQ are greater than those under SPMQL for anorexia, cough, dyspnea, and pain while the value of ΔAIC_Surv under SPMQ is very similar to the one under SPMQL for fatigue; and (ii) the values of ΔBIC_Surv under SPMQL are greater than those under SPMQ for anorexia, cough, dyspnea, and fatigue due to the extra parameter in the survival component under SPMQ. However, the values of ΔAIC_Surv and ΔBIC_Surv under TMQ are consistently higher than those under TMQL since under the TM, both TMQ and TMQL share the same number of parameters in the survival component. We note that on the one hand, ΔAIC_Surv and ΔBIC_Surv are defined primarily based on the likelihood function of the survival data; on the other hand, AIC and BIC are constructed using the likelihood function of both the longitudinal and survival data. Thus, since the total number of longitudinal outcomes were different among these five symptoms, AIC and BIC were indeed not comparable among them. Within each of these five symptoms, AIC and BIC selected SPMQ over SPML and TMQ over TML, due to the fact that AIC _Long and BIC _Long were in favor of quadratic trajectories over linear trajectories. These results indicate that the quadratic trajectories fit the longitudinal data better.

Table 4.

AICs and BICs for models with K = 2

Model		Anorexia	Cough	Dyspnea	Fatigue	Pain
SPML	AIC	14205.13	14451.48	12101.25	13184.26	13030.39
	AIC_Long	12017.95	12248.34	9912.26	11005.53	10868.55
	ΔAIC_Surv	19.11	3.15	17.30	27.56	44.46

	BIC	14302.38	14548.73	12198.50	13281.51	13127.64
	BIC_Long	12070.63	12301.02	9964.94	11058.20	10921.23
	ΔBIC_Surv	11.01	−4.95	9.20	19.45	36.35

SPMQ	AIC	14123.05	14250.06	11908.13	13058.18	12778.41
	AIC_Long	11933.53	12046.68	9714.28	10873.93	10610.07
	ΔAIC_Surv	16.77	2.91	12.44	22.04	37.95

	BIC	14240.57	14367.57	12025.64	13175.69	12895.92
	BIC_Long	12002.42	12115.56	9783.17	10942.82	10678.96
	ΔBIC_Surv	4.62	−9.25	0.29	9.89	25.80

SPMQL	AIC	14124.36	14251.34	11911.54	13057.48	12788.89
	AIC_Long	11933.24	12046.45	9713.94	10873.82	10609.26
	ΔAIC_Surv	15.17	1.40	8.70	22.63	26.67

	BIC	14237.82	14364.80	12025.00	13170.94	12902.35
	BIC_Long	12002.12	12115.34	9782.83	10942.70	10678.15
	ΔBIC_Surv	7.06	−6.70	0.60	14.53	18.56

TML	AIC	14204.92	14449.26	12106.26	13185.81	13038.62
	AIC_Long	12017.75	12248.29	9911.77	11005.07	10867.68
	ΔAIC_Surv	19.12	5.32	11.81	25.56	35.35

	BIC	14298.12	14542.46	12199.45	13279.01	13131.82
	BIC_Long	12070.43	12300.97	9964.45	11057.75	10920.36
	ΔBIC_Surv	15.07	1.27	7.76	21.50	31.30

TMQ	AIC	14118.40	14244.30	11904.94	13049.48	12773.67
	AIC_Long	11933.39	12046.50	9713.94	10873.95	10609.29
	ΔAIC_Surv	21.29	8.50	15.29	30.77	41.92

	BIC	14227.80	14353.70	12014.34	13158.89	12883.07
	BIC_Long	12002.27	12115.39	9782.82	10942.84	10678.17
	ΔBIC_Surv	17.23	4.45	11.24	26.72	37.86

TMQL	AIC	14128.97	14254.40	11916.77	13067.97	12805.13
	AIC_Long	11933.74	12046.45	9713.94	10874.12	10609.33
	ΔAIC_Surv	11.07	−1.65	3.47	12.45	10.49

	BIC	14238.38	14363.81	12026.18	13177.38	12914.54
	BIC_Long	12002.63	12115.34	9782.83	10943.01	10678.21
	ΔBIC_Surv	7.02	−5.70	−0.58	8.40	6.44

TS	ΔAIC_Surv	18.45	5.17	11.55	24.77	34.32

	ΔBIC_Surv	14.40	1.11	7.50	20.72	30.27

TVC	ΔAIC_Surv	19.52	4.25	15.04	27.76	48.60

	ΔBIC_Surv	15.47	0.20	10.99	23.71	44.54

Open in a new tab

Plots of ΔAIC_Surv (a) and ΔBIC_Surv (b) under the SPML, SPMQ, SPMQL, TML, TMQ, TMQL, TVC, and TS models with K = 2.

Tables 5 and 6 show the hazard ratios (HR’s, the exponentiated parameters) and p-values of the direct treatment effect on PFS (α), the overall treatment effect (α^* = α₁ + β₁γ₁ or α^* = α₁ + βγ₁), and the regression coefficients β associated with random trajectories under SPML, SPMQ, SPMQL, TML, TMQ, and TMQL. From these two tables, we see that except for cough and dyspnea, the HR’s for the overall treatment effect on PFS that ranged from 0.620 to 0.645 for the joint model were smaller than the HR of 0.647 when we fit the PFS data alone. It is interesting to mention that under TML and TMQ, the order in the magnitude of the HR’s for β is consistent with the values of ΔAIC_Surv and ΔBIC_Surv. For example, pain had the largest HR’s, namely, 1.464 and 1.5O4 under TML and TMQ, while cough had the smallest HR’s, namely, 1.194, 1.237, and 1.019 under TML, TMQ and TMQL.

Table 5.

Estimates for the survival components of SPMs with K = 2

	Longitudinal Symptom	α*		α ₁		β ₁		β ₂		β ₃
Model	Longitudinal Symptom	HR	P-value	HR	P-value	HR	P-value	HR	P-value	HR	P-value
SPML	Anorexia	0.628	<.0001	0.604	<.0001	1.403	<.0001	2.850	0.0024
	Cough	0.646	<.0001	0.643	<.0001	1.209	0.0080	1.722	0.0892
	Dyspnea	0.654	<.0001	0.666	0.0001	1.218	0.0043	3.560	<.0001
	Fatigue	0.635	<.0001	0.632	<.0001	1.418	<.0001	3.539	0.0003
	Pain	0.645	<.0001	0.669	0.0002	1.380	<.0001	5.331	<.0001

SPMQ	Anorexia	0.622	<.0001	0.597	<.0001	1.512	<.0001	3.087	0.0439	152.189	0.1499
	Cough	0.638	<.0001	0.632	<.0001	1.271	0.0054	1.811	0.0940	12.470	0.1317
	Dyspnea	0.646	<.0001	0.661	<.0001	1.228	0.0053	2.896	0.0093	44.221	0.0688
	Fatigue	0.620	<.0001	0.619	<.0001	1.446	<.0001	2.275	0.0873	9.179	0.4013
	Pain	0.645	<.0001	0.662	<.0001	1.403	<.0001	4.293	<.0001	144.619	0.0035

SPMQL	Anorexia	0.618	<.0001	0.599	<.0001	1.382	0.0001	1.382	0.0378
	Cough	0.632	<.0001	0.628	<.0001	1.178	0.0193	1.087	0.4370
	Dyspnea	0.637	<.0001	0.649	<.0001	1.176	0.0203	1.416	0.0058
	Fatigue	0.614	<.0001	0.612	<.0001	1.407	<.0001	1.542	0.0018
	Pain	0.638	<.0001	0.652	<.0001	1.308	0.0001	1.618	<.0001

Open in a new tab

When fitting the PFS data alone using (3.6), the HR for the treatment effect is 0.647 with p-value <.0001.

Table 6.

Estimates for the survival components of TMs with K = 2

	Longitudinal Symptom	α*		α ₁		β
Model	Longitudinal Symptom	HR	P-value	HR	P-value	HR	P-value
TML	Anorexia	0.631	<.0001	0.609	<.0001	1.365	<.0001
	Cough	0.650	<.0001	0.647	<.0001	1.194	0.0056
	Dyspnea	0.642	<.0001	0.655	<.0001	1.255	0.0002
	Fatigue	0.630	<.0001	0.627	<.0001	1.417	<.0001
	Pain	0.628	<.0001	0.655	<.0001	1.464	<.0001

TMQ	Anorexia	0.629	<.0001	0.607	<.0001	1.409	<.0001
	Cough	0.647	<.0001	0.642	<.0001	1.237	0.0010
	Dyspnea	0.640	<.0001	0.657	<.0001	1.291	<.0001
	Fatigue	0.628	<.0001	0.626	<.0001	1.486	<.0001
	Pain	0.635	<.0001	0.655	<.0001	1.504	<.0001

TMQL	Anorexia	0.630	<.0001	0.621	<.0001	1.164	0.0008
	Cough	0.650	<.0001	0.650	<.0001	1.019	0.5561
	Dyspnea	0.640	<.0001	0.645	<.0001	1.089	0.0220
	Fatigue	0.629	<.0001	0.628	<.0001	1.161	0.0003
	Pain	0.642	<.0001	0.649	<.0001	1.135	0.0007

Open in a new tab

7. Discussion

We have proposed novel decompositions of AIC and BIC to individually assess the contributions of each component in joint models of longitudinal and survival data, and we use ΔAIC_Surv and ΔBIC_Surv to determine the contribution of the longitudinal data to the fit of the survival data. We conducted extensive simulation studies to examine the empirical performance of ΔAIC_Surv and ΔBIC_Surv and demonstrated our proposed methodology on a detailed case study in mesothelioma. The proposed methodology is quite useful in also comparing and choosing between a trajectory-based model or a shared parameter model, which is important, since these two classes of models are often used and the choice of which one to use is not always clear. Our proposed criteria also help in the assessment of the survival model, in determining how many intervals to choose, for example, in the piecewise constant hazard model.

All computations in Section 5 and 6 were done in SAS. PROC NLMIXED was used to obtain the MLEs and AIC, and PROC IML was used to compute AIC_Long. The Riemann integral was used to compute the cumulative hazard function in (3.5) for the trajectory models. The Monte Carlo (MC) method for estimating AIC_Surv|Long discussed in Remark 4.2 was also implemented in PROC IML. As an illustration, for anorexia, cough, dyspnea, fatigue, and pain under TML with K = 1, the AIC_Surv|Long’s computed from AIC−AIC_Long using PROC NLMIXED were 2210.94, 2223.57, 2217.17, 2205.12, and 2200.94, while those obtained by the Monte Carlo method with an MC sample of size 10,000 were 2211.00, 2223.58, 2217.17, 2205.09, and 2200.93, respectively, which empirically validates the AIC obtained from PROC NLMIXED. Due to the nature of the joint model, there are more random effects in SPMQ than SPML, and hence, the computations for SPMQ are much more intensive than that for SPML. When the patients were followed much longer for the survival times than for the longitudinal outcomes, we used a constant extrapolation afiter the time of the last observed longitudinal outcome for the trajectory $θ_{i}^{'} g (t)$ to adjust the hazard functions for all TM’s and TS. The results shown in Table 2 empirically confrm that the NLMIXED procedure is quite reliable in computing the MLEs of the parameters in the joint model and they are robust with respect to the dimension of the model parameters. Finally, we note that each simulation with 500 simulated datasets took about l hour on a Dell PC with an Intel i5 processor, 2.40 GHz CPU, and 6 GB of memory. The SAS macros are available from the authors upon request.

In our simulation study, we observed that ΔAIC_Surv and ΔBIC_Surv correctly identify the true survival component in the joint model. In the analysis of the EMPHACIS data, we showed that the survival model with K = 2 fit the data the best along with SPML and TMQ. There are several potential extensions of the proposed method. The proposed methodology would be quite useful in situations where we wish to simultaneously jointly model a multivariate longitudinal marker, such as several PRO outcomes, with a time-to-event outcome, such as PFS. The proposed ΔAIC_Surv and ΔBIC_Surv can be very useful in this context as they can tell us about the overall contribution of the multivariate longitudinal marker to the fit of the survival data. The proposed methodology is also useful for multivariate survival data, such as PFS and OS for example, and then ΔAIC_Surv and ΔBIC_Surv can be used in assessing the contribution of the longitudinal data to the fit of the multivariate survival data. Finally, ΔAIC_Surv and ΔBIC_Surv can also be used in joint models for multivariate longitudinal and multivariate survival data and hence identify the combinations of longitudinal outcomes that are most highly associated with a multivariate time-to-event. These extensions are currently under investigation.

Acknowledgments

We would like to thank the Editor, the Associate Editor, and the two anonymous reviewers for their very helpful comments and suggestions, which have led to a much improved version of the paper. Dr. M.-H. Chen and Dr. J. G. Ibrahim's research was partially supported by NIH grants #GM 70335 and #CA 74015.

Appendix: Proofs of Theorems

Proof of Theorem 4.1

Let D_obs = {(Y_i,t_i,δ_i,z_i, x_i), i =1,…,n} denote the observed data. We frst observe that

L (γ, σ^{2} ∣ Y_{i}, W_{i}, θ_{i}) f (θ_{i} ∣ θ, Σ) = f (θ_{i} ∣ Y_{i}, W_{i}, φ_{1}) L (φ_{1} ∣ Y_{i}, W_{i}) .

(A.1)

Using (4.3) and (A.1), the joint likelihood for all subjects is given by

\begin{matrix} L (φ ∣ g, D_{o b s}) & = \prod_{i = 1}^{n} \int L (φ_{2} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}, g, γ) L (γ, σ^{2} ∣ Y_{i}, W_{i}, θ_{i}) f (θ_{i} ∣ θ, Σ) d θ_{i} \\ = \prod_{i = 1}^{n} \int L (φ_{2} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}, g, γ) f (θ_{i} ∣ Y_{i}, W_{i}, φ_{1}) L (φ_{1} ∣ Y_{i}, W_{i}) d θ_{i} \\ = \prod_{i = 1}^{n} L (φ_{i} ∣ Y_{i}, W_{i}) \prod_{i = 1}^{n} \int L (φ_{2} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}, g, γ) f (θ_{i} ∣ Y_{i}, W_{i}, φ_{1}) d θ_{i} . \end{matrix}

(A.2)

Taking -2log of (A.2) yields

- 2 \log L (φ ∣ g, D_{o b s}) = - 2 \sum_{i = 1}^{n} \log L (φ_{1} ∣ Y_{i}, W_{i}) - 2 \sum_{i = 1}^{n} \log \int L (φ_{2} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}, g, γ) f (θ_{i} ∣ Y_{i}, W_{i}, φ_{1}) d θ_{i} .

(A.3)

The AIC decomposition in (4.6) directly follows from (4.4) and (A.3), which completes the proof.

Proof of Theorem 4.2

Since AIC_Surv,0 depends on the survival data alone and BIC_Surv|Long = AIC_Surv|Long + dim(φ₂)(log n − 2), it is sufcient to show

{AIC}_{Surv ∣ Long} (b, c) = {AIC}_{Surv ∣ Long} .

Let Y_i =(Y_i(a_i1),…,Y_i(a_{im_i}))′ and $Y_{i}^{*} = {(Y_{i}^{*} (a_{i 1}), \dots, Y_{i}^{*} (a_{i m_{i}}))}^{'}$ denote the original and transformed longitudinal outcomes, respectively. Then we have $Y_{i}^{*} = \frac{1}{c} Y_{i} - \frac{b}{c} 1_{m_{i}}$ , where $1_{m_{i}} = {(1, \dots, 1^{'})}_{m_{i} \times 1}$ . Write $W_{i} = (W_{i 1} W_{i 2})$ , where W_i1 is a m_i × (q + 1) matrix and W_i2 is a m_i × (p + 1) matrix. The conditional density of Y_i is given by

Y_{i} ∣ W_{i}, θ_{i}, γ, σ^{2} ~ N (W_{i} (\begin{matrix} θ_{i} \\ γ \end{matrix}), σ^{2} I_{m_{i}}) = N (W_{i 1} θ_{i} + W_{i 2 γ}, σ^{2} I_{m_{i}}),

where I_{m_i} is the m_i × m_i identity matrix. We then obtain

Y_{i}^{*} ∣ W_{i}, θ_{i}, γ, σ^{2} ~ N ((W_{i 1} θ_{i} + W_{i 2 γ}, - b 1_{m i}) ∕ c, {(σ ∕ c)}^{2} I_{m i}) .

Write $θ_{i} = {(θ_{i 0}, θ_{i 1}, \dots, θ_{i q})}^{'}$ , $θ_{i q} = {(θ_{i 1}, \dots, θ_{i q})}^{'}$ , $θ = {(θ_{0}, θ_{1}, \dots, θ_{q})}^{'}$ and $θ_{q} = {(θ_{1}, \dots, θ_{q})}^{'}$ . Let $θ_{i}^{*} = (\begin{matrix} (θ_{i 0} - b) ∕ c \\ θ_{i q} ∕ c \end{matrix}) = \frac{1}{c} θ_{i} - \frac{1}{c} (\begin{matrix} b \\ 0 \end{matrix})$ , $θ^{*} = (\begin{matrix} (θ_{0} - b) ∕ c \\ θ_{q} ∕ c \end{matrix}) = \frac{1}{c} θ - \frac{1}{c} (\begin{matrix} b \\ 0 \end{matrix})$ , $γ^{*} = γ ∕ c, σ^{*} = σ ∕ c$ and $Σ^{*} = Σ ∕ c^{2}$ . Since θ_i ~ N (θ, ∑), we have

Y_{i}^{*} ∣ W_{i}, θ_{i}^{*}, γ^{*}, {(σ^{*})}^{2} ~ N (W_{i 1} θ_{i}^{*} + W_{i 2} γ^{*}, {(σ^{*})}^{2} I_{m_{i}}), and θ_{i}^{*} ~ N (θ^{*}, Σ^{*}) .

(A.4)

Similar to (3.2), we write the hazard function corresponding to Y_^{i ;* _^} as

λ (t ∣ λ_{0}^{*}, β^{*}, α^{*}, θ_{i}^{*}, g (t), γ^{*}, z_{i}, x_{i}) = λ_{0}^{*} (t) \exp {h (β^{*}, θ_{i}^{*}, g (t), γ_{1}^{*} z_{i}, {(γ_{2}^{'})}^{'} x_{i}) + α_{1}^{*} z_{i} + ({α_{2}^{'}}^{'}) x_{i}},

(A.5)

where λ₀^*(t) is the baseline hazard function, which is assumed to take the same form as λ₀(t) given by (3.4). Then complete-data likelihood function for the survival component in (3.5) under the transformed longitudinal data becomes

\begin{matrix} L^{*} (λ^{*}, β^{*}, α^{*} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}^{*}, g, γ^{*}) & = {[λ (t_{i} ∣ λ_{0}^{*}, β^{*}, α^{*}, θ_{i}^{*}, g (t_{i}), γ^{*}, z_{i}, x_{i})]}^{δ_{i}} \\ \times \exp {- \int_{0}^{t_{i}} λ (u ∣ λ_{0}^{*}, β^{*}, α^{*}, θ_{i}^{*}, g (u), γ^{*}, z_{i}, x_{i}) d u} . \end{matrix}

(A.6)

By comparing (A.4) and (A.5) to (4.1), (4.2), and (3.2), we obtain that $λ^{*} = λ \exp ({\hat{β}}_{1} b)$ for SPM, $λ^{*} = λ \exp (\hat{β} b)$ for TM, β^* = cβ, and α^* = α.

Let $\hat{φ}$ and ${\hat{φ}}^{*} = ({\hat{θ}}^{*}, {\hat{γ}}^{*}, {\hat{σ}}^{*}, {\hat{Σ}}^{*}, {\hat{λ}}^{*}, {\hat{β}}^{*}, {\hat{α}}^{*})$ denote the MLEs of the model parameters under the original longitudinal data and the transformed longitudinal data coupled with the same survival data, respectively. Using the transformation invariance principle of MLE, we have ${\hat{θ}}^{*} = (\begin{matrix} ({\hat{θ}}_{0} - b) ∕ c \\ {\hat{θ}}_{q} ∕ c \end{matrix}) = \frac{1}{c} \hat{θ} - \frac{1}{c} (\begin{matrix} b \\ 0 \end{matrix})$ , ${\hat{γ}}^{*} = \hat{γ} ∕ c$ , ${\hat{σ}}^{*} = \hat{σ} ∕ c$ , ${\hat{Σ}}^{*} = \hat{Σ} ∕ c^{2}$ , ${\hat{λ}}^{*} = \hat{λ} \exp ({\hat{β}}_{1} b)$ for SPM, ${\hat{λ}}^{*} = \hat{λ} \exp (\hat{β} b)$ for TM, ${\hat{β}}^{*} = c \hat{β}$ and ${\hat{α}}^{*} = \hat{α}$ .

Corresponding to $φ_{1} = (γ, σ^{2}, θ, Σ)$ and $φ_{2} = (λ, β, α)$ , we write $φ_{1}^{*} = (γ^{*} {(σ^{*})}^{2}, θ^{*}, Σ^{*})$ and $φ_{2}^{*} = (λ^{*}, β^{*}, α^{*})$ . Write $\hat{φ} = ({\hat{φ}}_{1}, {\hat{φ}}_{2})$ and ${\hat{φ}}^{*} = ({\hat{φ}}_{1}^{*}, {\hat{φ}}_{2}^{*})$ . Let $Σ_{r} = {(Σ^{- 1} + \frac{1}{σ^{2}} W_{i 1}^{'} W_{i 1})}^{- 1}$ and ${Σ_{r}}^{*} = Σ_{r} ∕ c^{2}$ . The conditional distribution of the random effects θ_i given the original longitudinal data takes the form

θ_{i} ∣ Y_{i}, W_{i}, φ_{1} ~ N (Σ_{r} [\frac{1}{σ^{2}} W_{i 1}^{'} (Y_{i} - W_{i 2 γ}) + Σ^{- 1} θ], Σ_{r}),

(A.7)

and the corresponding density is given by

f (θ_{i} ∣ Y_{i}, W_{i}, φ_{i}) = \frac{1}{{(2 π)}^{(q + 1) ∕ 2} ∣ Σ_{r} ∣^{1 ∕ 2}} \exp {- \frac{1}{2} {(θ_{i} - Σ_{r} [\frac{1}{σ^{2}} {W_{i 1}}^{'} (Y_{i} - W_{i 2} γ) + Σ^{- 1} θ])}^{'} \times {Σ_{r}}^{- 1} (θ_{i} - Σ_{r} [\frac{1}{σ^{2}} W_{i 1}^{'} (Y_{i} - W_{i 2} γ) Σ^{- 1} θ])} .

(A.8)

Note that ${(Σ^{- 1} + \frac{1}{σ^{2}} W_{i 1}^{'} W_{i 1})}^{- 1} = Σ - \frac{1}{σ^{2}} Σ {W_{i 1}}^{'} (I + \frac{1}{σ^{2}} W_{i 1} Σ {W_{i 1}^{'}}^{- 1}) W_{i 1} Σ$ and $W_{i 1} = (\begin{matrix} b \\ 0 \end{matrix}) = b 1_{m_{i}}$ . Replacing the original data and parameters with the transformed ones in (A.7) yields

θ_{i}^{*} ∣ Y_{i}^{*}, W_{i}, φ_{1}^{*} ~ N ({Σ_{r}}^{*} [\frac{1}{({σ^{*}}^{2})} {W_{i 1}}^{'} (Y_{i}^{*} - W_{i 2} γ^{*}) + {(Σ^{*})}^{- 1} θ^{*}], {Σ_{r}}^{*}),

(A.9)

and the conditional density of θ^*_i is given by

f^{*} (θ_{i}^{*} ∣ Y_{i}^{*}, W_{i}, φ_{1}^{*}) = \frac{1}{{(2 π)}^{(q + 1) ∕ 2} ∣ Σ_{r}^{*} ∣^{1 ∕ 2}} = \exp {- \frac{1}{2} {(θ_{i}^{*} - Σ_{r}^{*} [\frac{1}{({σ^{*}}^{2})} W_{i 1}^{'} (Y_{i}^{*} - W_{i 2} γ^{*}) + {(Σ^{*})}^{- 1} θ^{*}])}^{'} \times {({Σ_{r}}^{*})}^{- 1} (θ_{i}^{*} - {Σ_{r}}^{*} [\frac{1}{{(σ^{*})}^{2}} W_{i 1}^{'} (Y_{i}^{*} - W_{i 2} γ^{*}) + {(Σ^{*})}^{- 1} θ^{*}])} .

(A.10)

Using (A.6), (A.8), and (A.10) and after some algebra, we can show that

L^{*} ({\hat{φ}}_{2}^{*} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, \frac{1}{c} θ_{i} - \frac{1}{c} (\begin{matrix} b \\ 0 \end{matrix}), g, {\hat{γ}}^{*}) = L ({\hat{φ}}_{2} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}, g, \hat{γ})

(A.11)

and

f^{*} (\frac{1}{c} θ_{i} - \frac{1}{c} (\begin{matrix} b \\ 0 \end{matrix}) ∣ Y_{i}^{*}, W_{i}, {\hat{φ}}_{1}^{*}) = c f (θ_{i} ∣ Y_{i}, W_{i}, {\hat{φ}}_{1}) .

(A.12)

Using (A.11) and (A.12), we have AIC_Surv|Long (b, c)

\begin{matrix} = - 2 \sum_{i = 1}^{n} \log \int L^{*} ({\hat{φ}}_{2}^{*} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}^{*}, g, {\hat{γ}}^{*}) f^{*} (θ_{i}^{*} ∣ Y_{i}^{*}, W_{i}, {\hat{φ}}_{1}^{*}) d θ_{i}^{*} + 2 dim (φ_{2}^{*}) \\ = - 2 \sum_{i = 1}^{n} \log \int L^{*} ({\hat{φ}}_{2}^{*} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, \frac{1}{c} θ_{i} - \frac{1}{c} (\begin{matrix} b \\ 0 \end{matrix}), g, {\hat{γ}}^{*}) f^{*} (\frac{1}{c} θ_{i} - \frac{1}{c} (\begin{matrix} b \\ 0 \end{matrix}) ∣ Y_{i}^{*}, W_{i}, {\hat{φ}}_{1}^{*}) d [\frac{1}{c} θ_{i} - \frac{1}{c} (\begin{matrix} b \\ 0 \end{matrix})] + 2 \dim (φ_{2}^{*}) \\ = - 2 \sum_{i = 1}^{n} \log \int L ({\hat{φ}}_{2} ∣ t_{i}, δ_{i}, z_{i}, x_{i}, θ_{i}, g, \hat{γ}) f (θ_{i} ∣ Y_{i}, W_{i}, {\hat{φ}}_{1}) d θ_{i} + 2 dim (φ_{2}) \\ = {AIC}_{Surv ∣ Long}, \end{matrix}

which completes the proof.

References

1.Hsieh F, Tseng Y, Wang J. Joint modeling of survival and longitudinal data: Likelihood approach revisited. Biometrics. 2006;62:1037–1043. doi: 10.1111/j.1541-0420.2006.00570.x. [DOI] [PubMed] [Google Scholar]
2.Chen LM, Ibrahim JG, Chu H. Sample size and power determination in joint modeling of longitudinal and survival data. Statistics in Medicine. 2011;30:2295–2309. doi: 10.1002/sim.4263. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Wang P, Shen W, Boye ME. Joint modeling of longitudinal outcomes and survival using latent growth modeling approach in a mesothelioma trial. Health Services and Outcomes Research Methodology. 2012;12:182–199. doi: 10.1007/s10742-012-0092-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Hatfield LA, Boye ME, Hackshaw MD, Carlin BP. Multilevel Bayesian models for survival times and longitudinal patient-reported outcomes with many zeros. Journal of the American Statistical Association. 2012;107:875–885. [Google Scholar]
5.Rothman M, Burke L, Erickson P, Leidy NK, Patrick DL, Petrie CD. Use of existing patient-reported outcome (PRO) instruments and their modification: the ISPOR good research practices for evaluating and documenting content validity for the use of existing instruments and their modification PRO task force report. Value in Health. 2009;12:1075–1083. doi: 10.1111/j.1524-4733.2009.00603.x. [DOI] [PubMed] [Google Scholar]
6.DeMuro C, Clark M, Doward L, Mordin M, Gnanasakthy A. Assessment of PRO label claims granted by the FDA as compared to the EMA (2006-2010) Value in Health. 2013;16:1150–1155. doi: 10.1016/j.jval.2013.08.2293. [DOI] [PubMed] [Google Scholar]
7.Joly F, Vardy J, Pintilie M, Tannock IF. Quality of life and/or symptom control in randomized clinical trials for patients with advanced cancer. Annals of Oncology. 2007;18:1935–1942. doi: 10.1093/annonc/mdm121. [DOI] [PubMed] [Google Scholar]
8.Hatfield LA, Boye ME, Carlin BP. Joint modeling of multiple longitudinal patient-reported outcomes and survival. Journal of Biopharmaceutical Statistics. 2011;21:971–991. doi: 10.1080/10543406.2011.590922. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Brown ER, Ibrahim JG. A Bayesian semiparametric joint hierarchical model for longitudinal and survival data. Biometrics. 2003;59:221–228. doi: 10.1111/1541-0420.00028. [DOI] [PubMed] [Google Scholar]
10.Brown ER, Ibrahim JG. Bayesian approaches to joint cure rate and longitudinal models with applications to cancer vaccine trials. Biometrics. 2003;59:686–693. doi: 10.1111/1541-0420.00079. [DOI] [PubMed] [Google Scholar]
11.Chen MH, Ibrahim JG, Sinha D. A new joint model for longitudinal and survival data with a cure fraction. Journal of Multivariate Analysis. 2004;91:18–34. [Google Scholar]
12.Ibrahim JG, Chen MH, Sinha D. Bayesian methods for joint modeling of longitudinal and survival data with applications to cancer vaccine studies. Statistica Sinica. 2004;14:863–883. [Google Scholar]
13.Chi YY, Ibrahim JG. Joint models for multivariate longitudinal and multivariate survival data. Biometrics. 2006;62:432–445. doi: 10.1111/j.1541-0420.2005.00448.x. [DOI] [PubMed] [Google Scholar]
14.Chi YY, Ibrahim JG. Bayesian approaches to joint longitudinal and survival models accommodating both zero and nonzero cure fractions. Statistica Sinica. 2007;17:445–462. [Google Scholar]
15.Ibrahim JG, Chu H, Chen LM. Basic concepts and methods for joint models of longitudinal and survival data. Journal of Clinical Oncology. 2010;28:2796–2801. doi: 10.1200/JCO.2009.25.0654. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Terrin N, Rodday AM, Parsons SK. Joint models for predicting transplant-related mortality from quality of life data. Quality Life Research. doi: 10.1007/s11136-013-0550-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Taylor JMG, Cumberland WG, Sy JP. A stochastic model for analysis of longitudinal AIDS data. Journal of the American Statistical Association. 1994;89:727–736. [Google Scholar]
18.Tsiatis AA, DeGruttola V, Wulfsohn MS. Modelling the relationship of survival to longitudinal data measured with error. Applications to survival and CD4 counts in patients with AIDS. Journal of the American Statistical Association. 1995;90:27–37. [Google Scholar]
19.Wulfsohn MS, Tsiatis AA. A joint model for survival and longitudinal data measured with error. Biometrics. 1997;53:330–339. [PubMed] [Google Scholar]
20.Faucett CJ, Thomas CC. Simultaneously modelling censored survival data and repeatedly measured covariates: A Gibbs sampling approach. Statistics in Medicine. 1996;15:1663–1685. doi: 10.1002/(SICI)1097-0258(19960815)15:15<1663::AID-SIM294>3.0.CO;2-1. [DOI] [PubMed] [Google Scholar]
21.Wang Y, Taylor JMG. Jointly modelling longitudinal and event time data, with applications to AIDS studies. Journal of the American Statistical Association. 2001;96:895–905. [Google Scholar]
22.Faucett CL, Schenker N, Taylor JMG. Survival analysis using auxiliary variables via multiple imputation, with application to AIDS clinical trial data. Biometrics. 2002;58:37–47. doi: 10.1111/j.0006-341x.2002.00037.x. [DOI] [PubMed] [Google Scholar]
23.Brown ER, Ibrahim JG, DeGruttola V. A flexible B-spline model for multiple longitudinal biomarkers and survival. Biometrics. 2005;61:64–73. doi: 10.1111/j.0006-341X.2005.030929.x. [DOI] [PubMed] [Google Scholar]
24.Schluchter MD. Methods for the analysis of informatively censored longitudinal data. Statistics in Medicine. 1992;11:1861–1870. doi: 10.1002/sim.4780111408. [DOI] [PubMed] [Google Scholar]
25.Hogan JW, Laird NM. Mixture models for the joint distribution or repeated measures and event times. Statistics in Medicine. 1997;16:239–257. doi: 10.1002/(sici)1097-0258(19970215)16:3<239::aid-sim483>3.0.co;2-x. [DOI] [PubMed] [Google Scholar]
26.Pawitan Y, Self S. Modeling disease marker processes in AIDS. Journal of the American Statistical Association. 1993;88:719–726. [Google Scholar]
27.DeGruttola V, Tu XM. Modeling progression of CD4-lymphocyte count and its relationship to survival time. Biometrics. 1994;50:1003–1014. [PubMed] [Google Scholar]
28.LaValley MP, DeGruttola V. Model for empirical Bayes estimators of longitudinal CD4 counts. Statistics in Medicine. 1996;15:2289–2305. doi: 10.1002/(SICI)1097-0258(19961115)15:21<2289::AID-SIM449>3.0.CO;2-I. [DOI] [PubMed] [Google Scholar]
29.Henderson R, Diggle PJ, Dobson A. Joint modelling of longitudinal measurements and event time data. Biostatistics. 2000;1:465–480. doi: 10.1093/biostatistics/1.4.465. [DOI] [PubMed] [Google Scholar]
30.Xu J, Zeger SL. Joint analysis of longitudinal data comprising repeated measures and times to events. Applied Statistics. 2001;50:375–387. [Google Scholar]
31.Xu J, Zeger SL. The evaluation of multiple surrogate endpoints. Biometrics. 2001;57:81–87. doi: 10.1111/j.0006-341x.2001.00081.x. [DOI] [PubMed] [Google Scholar]
32.Song X, Davidian M, Tsiatis AA. An estimator for the proportional hazards model with multiple longitudinal covariates measured with error. Biostatistics. 2002;3:511–528. doi: 10.1093/biostatistics/3.4.511. [DOI] [PubMed] [Google Scholar]
33.Tsiatis AA, Davidian M. Joint modeling of longitudinal and time-to-event data: An overview. Statistica Sinica. 2004;14:809–834. [Google Scholar]
34.Ibrahim JG, Chen MH, Sinha D. Bayesian Survival Analysis. Springer-Verlag; New York: 2001. [Google Scholar]
35.Law NJ, Taylor JMG, Sandler H. The joint modeling of a longitudinal disease progression marker and the failure time process in the presence of cure. Biostatistics. 2002;3:547–563. doi: 10.1093/biostatistics/3.4.547. [DOI] [PubMed] [Google Scholar]
36.Vogelzang NJ, Rusthoven JJ, Symanowski J, Denham C, Kaukel E, Ruffie P, Gatzemeier U, Boyer M, Emri S, Manegold C, Niyikiza C, Paoletti P. Phase III study of pemetrexed in combination with cisplatin versus cisplatin alone in patients with malignant pleural mesothelioma. Journal of Clinical Oncology. 2003;21:2636–2644. doi: 10.1200/JCO.2003.11.136. [DOI] [PubMed] [Google Scholar]
37.Robinson BWS, Lake RA. Advances in malignant mesothelioma. New England Journal of Medicine. 2009;353:1591–603. doi: 10.1056/NEJMra050152. [DOI] [PubMed] [Google Scholar]
38.Thompson JK, Westbom CM, Shukla A. Malignant mesothelioma: development to therapy. Journal of Cellular Biochemistry. 2014;115:1–7. doi: 10.1002/jcb.24642. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Bridges JFP, Mohamed AF, Finnern HW, Woehl A, Hauber AB. Patients’ preferences for treatment outcomes for advanced non-small cell lung cancer: A conjoint analysis. Lung Cancer. 2012;77:224–231. doi: 10.1016/j.lungcan.2012.01.016. [DOI] [PubMed] [Google Scholar]
40.Patricia HJ, Gralla RJ, Liepa AM, Symanowski JT, Rusthoven JJ. Measuring quality of life in patients with pleural mesothelioma using a modified version of the Lung Cancer Symptom Scale (LCSS): psychometric properties of the LCSS-Meso. Supportive Care in Cancer. 2006;14:11–21. doi: 10.1007/s00520-005-0837-0. [DOI] [PubMed] [Google Scholar]
41.Hollen PJ, Gralla RJ, Liepa AM, Symanowski JT, Rusthoven JJ. Adapting the Lung Cancer Symptom Scale (LCSS) to mesothelioma: Using the LCSS-Meso conceptual model for validation. Cancer. 2004;101:587–595. doi: 10.1002/cncr.20315. [DOI] [PubMed] [Google Scholar]
42.Bottomley A, Coens C, Efficace F, Gaafar R, Manegold C, Burgers S, Vincent M, Legrand C, van Meerbeeck J. EORTC-NCIC: symptoms and patient-reported well-being: do they predict survival in malignant pleural mesothelioma? A prognostic factor analysis of EORTC-NCIC 08983: randomized phase III study of cisplatin with or without raltitrexed in patients with malignant pleural mesothelioma. Journal of Clinical Oncology. 2007;25:5770–5776. doi: 10.1200/JCO.2007.12.5294. [DOI] [PubMed] [Google Scholar]
43.Cox DR, Oakes D. Analysis of Survival Data. Chapman and Hall; London: 1984. [Google Scholar]
44.Fisher LD, Lin D. Time-dependent covariates in the Cox proportional-hazards regression model. Annual Review of Public Health. 1999;20:145–157. doi: 10.1146/annurev.publhealth.20.1.145. [DOI] [PubMed] [Google Scholar]
45.Akaike H. Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csaki F, editors. Proceedings of the Second International Symposium on Information Theory. Akademiai Kiado; Budapest: 1973. pp. 267–281. [Google Scholar]
46.Schwarz G. Estimating the dimension of a model. The Annals of Statistics. 1978;6:461–464. [Google Scholar]
47.Lawless JF, Zhan M. Analysis of interval-grouped recurrent event data using piecewise constant rate function. Canadian Journal of Statistics. 1998;26:549–565. [Google Scholar]
48.Feng S, Wolfe RA, Port PK. Frailty survival model analysis of the National Deceased Donor Kidney Transplant Dataset using Poisson variance structures. Journal of the American Statistical Association. 2005;100:728–735. [Google Scholar]
49.Liu L. Joint modeling longitudinal semi-continuous data and survival, with application to longitudinal medical cost data. Statistics in Medicine. 2009;28:972–986. doi: 10.1002/sim.3497. [DOI] [PubMed] [Google Scholar]

[R1] 1.Hsieh F, Tseng Y, Wang J. Joint modeling of survival and longitudinal data: Likelihood approach revisited. Biometrics. 2006;62:1037–1043. doi: 10.1111/j.1541-0420.2006.00570.x. [DOI] [PubMed] [Google Scholar]

[R2] 2.Chen LM, Ibrahim JG, Chu H. Sample size and power determination in joint modeling of longitudinal and survival data. Statistics in Medicine. 2011;30:2295–2309. doi: 10.1002/sim.4263. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Wang P, Shen W, Boye ME. Joint modeling of longitudinal outcomes and survival using latent growth modeling approach in a mesothelioma trial. Health Services and Outcomes Research Methodology. 2012;12:182–199. doi: 10.1007/s10742-012-0092-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Hatfield LA, Boye ME, Hackshaw MD, Carlin BP. Multilevel Bayesian models for survival times and longitudinal patient-reported outcomes with many zeros. Journal of the American Statistical Association. 2012;107:875–885. [Google Scholar]

[R5] 5.Rothman M, Burke L, Erickson P, Leidy NK, Patrick DL, Petrie CD. Use of existing patient-reported outcome (PRO) instruments and their modification: the ISPOR good research practices for evaluating and documenting content validity for the use of existing instruments and their modification PRO task force report. Value in Health. 2009;12:1075–1083. doi: 10.1111/j.1524-4733.2009.00603.x. [DOI] [PubMed] [Google Scholar]

[R6] 6.DeMuro C, Clark M, Doward L, Mordin M, Gnanasakthy A. Assessment of PRO label claims granted by the FDA as compared to the EMA (2006-2010) Value in Health. 2013;16:1150–1155. doi: 10.1016/j.jval.2013.08.2293. [DOI] [PubMed] [Google Scholar]

[R7] 7.Joly F, Vardy J, Pintilie M, Tannock IF. Quality of life and/or symptom control in randomized clinical trials for patients with advanced cancer. Annals of Oncology. 2007;18:1935–1942. doi: 10.1093/annonc/mdm121. [DOI] [PubMed] [Google Scholar]

[R8] 8.Hatfield LA, Boye ME, Carlin BP. Joint modeling of multiple longitudinal patient-reported outcomes and survival. Journal of Biopharmaceutical Statistics. 2011;21:971–991. doi: 10.1080/10543406.2011.590922. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Brown ER, Ibrahim JG. A Bayesian semiparametric joint hierarchical model for longitudinal and survival data. Biometrics. 2003;59:221–228. doi: 10.1111/1541-0420.00028. [DOI] [PubMed] [Google Scholar]

[R10] 10.Brown ER, Ibrahim JG. Bayesian approaches to joint cure rate and longitudinal models with applications to cancer vaccine trials. Biometrics. 2003;59:686–693. doi: 10.1111/1541-0420.00079. [DOI] [PubMed] [Google Scholar]

[R11] 11.Chen MH, Ibrahim JG, Sinha D. A new joint model for longitudinal and survival data with a cure fraction. Journal of Multivariate Analysis. 2004;91:18–34. [Google Scholar]

[R12] 12.Ibrahim JG, Chen MH, Sinha D. Bayesian methods for joint modeling of longitudinal and survival data with applications to cancer vaccine studies. Statistica Sinica. 2004;14:863–883. [Google Scholar]

[R13] 13.Chi YY, Ibrahim JG. Joint models for multivariate longitudinal and multivariate survival data. Biometrics. 2006;62:432–445. doi: 10.1111/j.1541-0420.2005.00448.x. [DOI] [PubMed] [Google Scholar]

[R14] 14.Chi YY, Ibrahim JG. Bayesian approaches to joint longitudinal and survival models accommodating both zero and nonzero cure fractions. Statistica Sinica. 2007;17:445–462. [Google Scholar]

[R15] 15.Ibrahim JG, Chu H, Chen LM. Basic concepts and methods for joint models of longitudinal and survival data. Journal of Clinical Oncology. 2010;28:2796–2801. doi: 10.1200/JCO.2009.25.0654. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Terrin N, Rodday AM, Parsons SK. Joint models for predicting transplant-related mortality from quality of life data. Quality Life Research. doi: 10.1007/s11136-013-0550-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] 17.Taylor JMG, Cumberland WG, Sy JP. A stochastic model for analysis of longitudinal AIDS data. Journal of the American Statistical Association. 1994;89:727–736. [Google Scholar]

[R18] 18.Tsiatis AA, DeGruttola V, Wulfsohn MS. Modelling the relationship of survival to longitudinal data measured with error. Applications to survival and CD4 counts in patients with AIDS. Journal of the American Statistical Association. 1995;90:27–37. [Google Scholar]

[R19] 19.Wulfsohn MS, Tsiatis AA. A joint model for survival and longitudinal data measured with error. Biometrics. 1997;53:330–339. [PubMed] [Google Scholar]

[R20] 20.Faucett CJ, Thomas CC. Simultaneously modelling censored survival data and repeatedly measured covariates: A Gibbs sampling approach. Statistics in Medicine. 1996;15:1663–1685. doi: 10.1002/(SICI)1097-0258(19960815)15:15<1663::AID-SIM294>3.0.CO;2-1. [DOI] [PubMed] [Google Scholar]

[R21] 21.Wang Y, Taylor JMG. Jointly modelling longitudinal and event time data, with applications to AIDS studies. Journal of the American Statistical Association. 2001;96:895–905. [Google Scholar]

[R22] 22.Faucett CL, Schenker N, Taylor JMG. Survival analysis using auxiliary variables via multiple imputation, with application to AIDS clinical trial data. Biometrics. 2002;58:37–47. doi: 10.1111/j.0006-341x.2002.00037.x. [DOI] [PubMed] [Google Scholar]

[R23] 23.Brown ER, Ibrahim JG, DeGruttola V. A flexible B-spline model for multiple longitudinal biomarkers and survival. Biometrics. 2005;61:64–73. doi: 10.1111/j.0006-341X.2005.030929.x. [DOI] [PubMed] [Google Scholar]

[R24] 24.Schluchter MD. Methods for the analysis of informatively censored longitudinal data. Statistics in Medicine. 1992;11:1861–1870. doi: 10.1002/sim.4780111408. [DOI] [PubMed] [Google Scholar]

[R25] 25.Hogan JW, Laird NM. Mixture models for the joint distribution or repeated measures and event times. Statistics in Medicine. 1997;16:239–257. doi: 10.1002/(sici)1097-0258(19970215)16:3<239::aid-sim483>3.0.co;2-x. [DOI] [PubMed] [Google Scholar]

[R26] 26.Pawitan Y, Self S. Modeling disease marker processes in AIDS. Journal of the American Statistical Association. 1993;88:719–726. [Google Scholar]

[R27] 27.DeGruttola V, Tu XM. Modeling progression of CD4-lymphocyte count and its relationship to survival time. Biometrics. 1994;50:1003–1014. [PubMed] [Google Scholar]

[R28] 28.LaValley MP, DeGruttola V. Model for empirical Bayes estimators of longitudinal CD4 counts. Statistics in Medicine. 1996;15:2289–2305. doi: 10.1002/(SICI)1097-0258(19961115)15:21<2289::AID-SIM449>3.0.CO;2-I. [DOI] [PubMed] [Google Scholar]

[R29] 29.Henderson R, Diggle PJ, Dobson A. Joint modelling of longitudinal measurements and event time data. Biostatistics. 2000;1:465–480. doi: 10.1093/biostatistics/1.4.465. [DOI] [PubMed] [Google Scholar]

[R30] 30.Xu J, Zeger SL. Joint analysis of longitudinal data comprising repeated measures and times to events. Applied Statistics. 2001;50:375–387. [Google Scholar]

[R31] 31.Xu J, Zeger SL. The evaluation of multiple surrogate endpoints. Biometrics. 2001;57:81–87. doi: 10.1111/j.0006-341x.2001.00081.x. [DOI] [PubMed] [Google Scholar]

[R32] 32.Song X, Davidian M, Tsiatis AA. An estimator for the proportional hazards model with multiple longitudinal covariates measured with error. Biostatistics. 2002;3:511–528. doi: 10.1093/biostatistics/3.4.511. [DOI] [PubMed] [Google Scholar]

[R33] 33.Tsiatis AA, Davidian M. Joint modeling of longitudinal and time-to-event data: An overview. Statistica Sinica. 2004;14:809–834. [Google Scholar]

[R34] 34.Ibrahim JG, Chen MH, Sinha D. Bayesian Survival Analysis. Springer-Verlag; New York: 2001. [Google Scholar]

[R35] 35.Law NJ, Taylor JMG, Sandler H. The joint modeling of a longitudinal disease progression marker and the failure time process in the presence of cure. Biostatistics. 2002;3:547–563. doi: 10.1093/biostatistics/3.4.547. [DOI] [PubMed] [Google Scholar]

[R36] 36.Vogelzang NJ, Rusthoven JJ, Symanowski J, Denham C, Kaukel E, Ruffie P, Gatzemeier U, Boyer M, Emri S, Manegold C, Niyikiza C, Paoletti P. Phase III study of pemetrexed in combination with cisplatin versus cisplatin alone in patients with malignant pleural mesothelioma. Journal of Clinical Oncology. 2003;21:2636–2644. doi: 10.1200/JCO.2003.11.136. [DOI] [PubMed] [Google Scholar]

[R37] 37.Robinson BWS, Lake RA. Advances in malignant mesothelioma. New England Journal of Medicine. 2009;353:1591–603. doi: 10.1056/NEJMra050152. [DOI] [PubMed] [Google Scholar]

[R38] 38.Thompson JK, Westbom CM, Shukla A. Malignant mesothelioma: development to therapy. Journal of Cellular Biochemistry. 2014;115:1–7. doi: 10.1002/jcb.24642. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] 39.Bridges JFP, Mohamed AF, Finnern HW, Woehl A, Hauber AB. Patients’ preferences for treatment outcomes for advanced non-small cell lung cancer: A conjoint analysis. Lung Cancer. 2012;77:224–231. doi: 10.1016/j.lungcan.2012.01.016. [DOI] [PubMed] [Google Scholar]

[R40] 40.Patricia HJ, Gralla RJ, Liepa AM, Symanowski JT, Rusthoven JJ. Measuring quality of life in patients with pleural mesothelioma using a modified version of the Lung Cancer Symptom Scale (LCSS): psychometric properties of the LCSS-Meso. Supportive Care in Cancer. 2006;14:11–21. doi: 10.1007/s00520-005-0837-0. [DOI] [PubMed] [Google Scholar]

[R41] 41.Hollen PJ, Gralla RJ, Liepa AM, Symanowski JT, Rusthoven JJ. Adapting the Lung Cancer Symptom Scale (LCSS) to mesothelioma: Using the LCSS-Meso conceptual model for validation. Cancer. 2004;101:587–595. doi: 10.1002/cncr.20315. [DOI] [PubMed] [Google Scholar]

[R42] 42.Bottomley A, Coens C, Efficace F, Gaafar R, Manegold C, Burgers S, Vincent M, Legrand C, van Meerbeeck J. EORTC-NCIC: symptoms and patient-reported well-being: do they predict survival in malignant pleural mesothelioma? A prognostic factor analysis of EORTC-NCIC 08983: randomized phase III study of cisplatin with or without raltitrexed in patients with malignant pleural mesothelioma. Journal of Clinical Oncology. 2007;25:5770–5776. doi: 10.1200/JCO.2007.12.5294. [DOI] [PubMed] [Google Scholar]

[R43] 43.Cox DR, Oakes D. Analysis of Survival Data. Chapman and Hall; London: 1984. [Google Scholar]

[R44] 44.Fisher LD, Lin D. Time-dependent covariates in the Cox proportional-hazards regression model. Annual Review of Public Health. 1999;20:145–157. doi: 10.1146/annurev.publhealth.20.1.145. [DOI] [PubMed] [Google Scholar]

[R45] 45.Akaike H. Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csaki F, editors. Proceedings of the Second International Symposium on Information Theory. Akademiai Kiado; Budapest: 1973. pp. 267–281. [Google Scholar]

[R46] 46.Schwarz G. Estimating the dimension of a model. The Annals of Statistics. 1978;6:461–464. [Google Scholar]

[R47] 47.Lawless JF, Zhan M. Analysis of interval-grouped recurrent event data using piecewise constant rate function. Canadian Journal of Statistics. 1998;26:549–565. [Google Scholar]

[R48] 48.Feng S, Wolfe RA, Port PK. Frailty survival model analysis of the National Deceased Donor Kidney Transplant Dataset using Poisson variance structures. Journal of the American Statistical Association. 2005;100:728–735. [Google Scholar]

[R49] 49.Liu L. Joint modeling longitudinal semi-continuous data and survival, with application to longitudinal medical cost data. Statistics in Medicine. 2009;28:972–986. doi: 10.1002/sim.3497. [DOI] [PubMed] [Google Scholar]

PERMALINK

Assessing model fit in joint models of longitudinal and survival data with applications to cancer clinical trials

Danjie Zhang

Ming-Hui Chen

Joseph G Ibrahim

Mark E Boye

Ping Wang

Wei Shen

Abstract

1. Introduction

2. The EMPHACIS Data

Table 1.

3. The Models

3.1. Longitudinal Component of the Joint Model

3.2. Survival Component of the Joint Model

3.3. The Time-Varying Covariates (TVC) Model

3.4. The Two-Stage (TS) Model

4. Assessing the Contribution of Longitudinal Data When Modeling the Survival Data

4.1. AIC and BIC Decomposition

4.2. ΔAICSurv and ΔBICSurv

5. Simulation Studies

Table 2.

Table 3.

Figure 1.

6. Analysis of the EMPHACIS Data

Figure 2.

Table 4.

Figure 3.

Table 5.

Table 6.

7. Discussion

Acknowledgments

Appendix: Proofs of Theorems

Proof of Theorem 4.1

Proof of Theorem 4.2

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

4.2. ΔAIC_Surv and ΔBIC_Surv