Prediction of transplant-free survival in idiopathic pulmonary fibrosis patients using joint models for event times and mixed multivariate longitudinal data

Jiin Choi; Stewart J Anderson; Thomas J Richards; Wesley K Thompson

doi:10.1080/02664763.2014.909784

. Author manuscript; available in PMC: 2015 Jan 1.

Published in final edited form as: J Appl Stat. 2014 Apr 23;41(10):2192–2205. doi: 10.1080/02664763.2014.909784

Prediction of transplant-free survival in idiopathic pulmonary fibrosis patients using joint models for event times and mixed multivariate longitudinal data

Jiin Choi ¹, Stewart J Anderson ², Thomas J Richards ³, Wesley K Thompson ⁴

PMCID: PMC4157686 NIHMSID: NIHMS588767 PMID: 25214700

SUMMARY

We implement a joint model for mixed multivariate longitudinal measurements, applied to the prediction of time until lung transplant or death in idiopathic pulmonary fibrosis. Specifically, we formulate a unified Bayesian joint model for the mixed longitudinal responses and time-to-event outcomes. For the longitudinal model of continuous and binary responses, we investigate multivariate generalized linear mixed models using shared random effects. Longitudinal and time-to-event data are assumed to be independent conditional on available covariates and shared parameters. A Markov chain Monte Carlo (MCMC) algorithm, implemented in OpenBUGS, is used for parameter estimation. To illustrate practical considerations in choosing a final model, we fit 37 different candidate models using all possible combinations of random effects and employ a Deviance Information Criterion (DIC) to select a best fitting model. We demonstrate the prediction of future event probabilities within a fixed time interval for patients utilizing baseline data, post-baseline longitudinal responses, and the time-to-event outcome. The performance of our joint model is also evaluated in simulation studies.

Keywords: Idiopathic Pulmonary Fibrosis, Joint model, Mixed continuous and binary data, Multivariate longitudinal data, Prediction model, Shared parameter model, Survival analysis

1. Introduction

While longitudinal data from biomarkers or intermediate clinical assessments are often collected in Phase II and III clinical trials, the primary clinical trial outcome typically consists of time until the occurrence of a specified event, e.g., time to onset of illness or death. An example from the field of chronic lung disease research is the study of transplant-free survival in idiopathic pulmonary fibrosis (IPF). IPF is a chronic, progressive lung disease characterized by fibrosis of unknown etiology [24]. Here, our primary concern is the prediction of failure times by modeling the relationship between longitudinally measured indicators of pulmonary function and time to either lung transplant or death. Specifically, we utilize longitudinally assessed forced vital capacity (FVC), a continuous measure of pulmonary function, as a predictor of event times. We also utilize a second, binary, disease progression indicator defined as a decline of 5% or more in FVC from baseline FVC. Since these measures are potentially conjointly informative of time to lung transplant or death, it would be desirable to implement a modeling approach that allows prediction of event times using two or more longitudinal responses simultaneously.

Several approaches have been developed for handling multivariate longitudinal outcomes [4,20,16,23,29]. For example, some approaches factorize the joint distribution as the product of the marginal distribution of one variable times the conditional distribution of the second variable given the first [3,11]. Other approaches use random effects to model relationships between outcomes [8,13]. To date, however, most researchers have implemented joint models in which a univariate continuous longitudinal response is related to the time-to-event outcome [7,14,15,27]. In this paper, we develop a joint model for mixed continuous and binary longitudinal responses and a time-to-event outcome. For longitudinal responses we investigate mixed models with shared random effects. To build a joint model with a time-to-event outcome, we assume shared parameters exist between the longitudinal mixed models and the time-to-event outcome. The longitudinal responses and the time-to-event outcome are assumed to be independent conditional on the shared parameters and measured covariates.

We apply the proposed joint model to the dynamic prediction of mortality occurring within a fixed time for a subject still at risk just before time t [30]. A number of authors have proposed the prediction of future event probabilities for subjects based on the joint modeling of longitudinal measurements, time-to-event outcomes, and other covariates [9,25]. Fieuws et al. [9] investigated predicting renal graft failure using multiple longitudinal outcomes of biochemical and physiological markers utilizing a multivariate mixed model [22] and a pattern-mixture approach [19]. Rizopoulos [25] provided individualized prediction models of survival in AIDS patients who also had longitudinal CD4 cell count measurements. Our prediction approach is most closely related to that of Rizopoulos [25]. However, as is outlined in Section 2, we use a fully parametric model and a Bayesian approach similar to that of Guo and Carlin [14].

The remainder of the paper is organized as follows. In Section 2, we present our joint model assuming shared random effects for the longitudinal outcomes. In Section 3, we describe parameter estimation approach and model selection methods. In Section 4, we illustrate the dynamic prediction of event probabilities within a fixed time window. Section 5 presents simulation studies using the proposed model, whereas Section 6 applies the joint models to prediction of failure times in idiopathic pulmonary fibrosis data. Section 7 concludes with a brief discussion.

2. Joint modeling approach

2.1 Notation and definitions

For brevity, we focus the development of the model on the case of bivariate longitudinal responses, one continuous and normally distributed and the other binary. The generalization to higher dimensions and other members of the exponential family of distributions is conceptually straightforward (see Discussion). For subject i, i = 1,···, n, let y₁_ij and y₂_ij denote the j^th outcome at time point t_ij consisting of continuous and binary components, respectively. Further, let $y_{i} = {(y_{1 i}^{⊤}, y_{2 i}^{⊤})}^{⊤}$ denote the bivariate longitudinal outcome vector for subject i, where y_hi = (y_hi₁,···, y_hij)^⊤, h = 1,2, j = 1,···, n_hi is an n_hi-dimensional column vector giving the h^th longitudinal outcomes for subject i. For the longitudinal bivariate response vector, y_i, with different data types, we assume a generalized linear mixed effects model

E (y_{h i} ∣ b_{h i}) = g_{h} (X_{h i} β_{h} + Z_{h i} b_{h i}), h = 1, 2

(1)

where g_h(·) denotes known bijective link functions that differ across data types, X_hi and β_h denote an n_hi × p_h design matrix of covariate values and a p_h-dimensional vector of fixed effects, respectively, and Z_hi and b_hi denote n_hi × q₁ design matrix of covariates and a q_h-dimensional vector of normally distributed random effects with zero mean and covariance matrix as described below. We assume that the elements of y_hi are independent conditional on b_hi [10,12].

Here, we choose the identity link for the continuous response and the logit link for the binary response. Thus the generalized linear mixed effects model (1) can be written in the form

E (y_{1 i} ∣ b_{1 i}) = X_{1 i} β_{1} + Z_{1 i} b_{1 i}

(2a)

logit (Pr (y_{2 i} = 1) ∣ b_{2 i}) = X_{2 i} β_{2} + Z_{2 i} b_{2 i} .

(2b)

We assume that b₁_i follows a normal distribution with a mean vector of zeros and variance-covariance matrix, Σ, and that b₂_i is proportional to b₁_i, i.e., b₂_i = A₀b₁_i, where A₀ is a diagonal matrix of unknown constants. A joint model with this assumption is termed a shared random effects joint model. We briefly outline another alternative, also implemented in our code, in the Discussion section.

2.2 Joint model of multivariate longitudinal outcomes and a time-to-event outcome

Let T_i denote the true event time for subject i, C_i be the censoring time, and δ_i = I(T_i ≤ C_i) be the event indicator. Let $T_{i}^{*} = min (T_{i}, C_{i})$ be the observed event time for subject i. We assume that censoring is non-informative [6]. A proportional hazards model is given by

λ_{i} (t ∣ x_{3 i}, U_{3 i}) = λ_{0} (t) exp {x_{3 i}^{⊤} β_{3} + U_{3 i}},

(3)

where x₃_i is a p₃-dimensional vector of covariates with regression coefficients β₃, and λ₀(t) is the baseline hazard function, which can be assumed to be of a parametric form or left unspecified. To express the effects of longitudinal outcomes on the time-to-event outcome, the shared parameters, U₃_i, are associated with the random effects of longitudinal outcomes b₁_i and b₂_i. The joint model connects the longitudinal response submodels (2a,b) and the event time outcome submodel (3)

U_{3 i} = a_{1}^{⊤} b_{1 i} + a_{2}^{⊤} b_{2 i} + b_{3 i},

(4)

where $a = {(a_{1}^{⊤}, a_{2}^{⊤})}^{⊤}$ is a set of unknown constants and b₃_i is a normally distributed frailty term with mean zero and variance $σ_{3}^{2}$ , independent of the $b_{i} = {(b_{1 i}^{⊤}, b_{2 i}^{⊤})}^{⊤}$ . The hazard function of the time-to-event depends on the longitudinal outcomes through the shared b₁_i and b₂_i. Thus, the parameter a quantifies the degree of association explained by the random effects in (2a,b). In terms of a shared random effects joint model, Eq. (4) can be reduced in the form

U_{3 i} = a_{1}^{⊤} b_{1 i} + b_{3 i} .

The components of x₁, x₂, and x₃ may or may not all be the same, allowing the longitudinal continuous and binary responses, and the time-to-event outcome to depend on different and/or overlapping covariate information.

3. Estimation and model selection

3.1 Parameter estimation

We assume that the longitudinal responses vectors y_i and the time-to-event outcome T are independent conditional on covariates X, Z, and the random effects vector, b. The observed data for the i^th subject, with n_i repeated measurements, for the h^th response are denoted by

{y_{h i} (t_{i j}), X (t_{i j}), Z (t_{i j}), T_{i}^{*}, δ_{i}; h = 1, 2, i = 1, \dots, n, j = 1, \dots, n_{h i}} .

Based on a full conditional independence assumption, we can express the joint distribution of the observed data for the i^th subject as

f (y_{i}, T_{i}^{*}, δ_{i} ∣ b_{i}; θ) = f (y_{1 i} ∣ b_{1 i}; β_{1}, σ_{ε_{1}}^{2}) f (y_{2 i} ∣ b_{2 i}; β_{2}) f (T_{i}^{*}, δ_{i} ∣ b_{i}; β_{3}, a, σ_{3}^{2})

where θ denotes the complete parameter vector and f(·) denotes a generic probability density function. Thus, the log-likelihood for the observed data is given by

\begin{array}{l} \sum_{i = 1}^{n} log \int_{b_{i}} \prod_{j = 1}^{n_{1 i}} [\frac{1}{\sqrt{2 π σ_{ε_{1}}^{2}}} exp {- \frac{1}{2 σ_{ε_{1}}^{2}} {(y_{1 i j} - x_{1 i j} β_{1} - z_{1 i j} b_{1 i})}^{2}}] \\ \times \prod_{j = 1}^{n_{2 i}} [{\frac{1}{1 + exp (- x_{2 i j} β_{2} - z_{2 i j} b_{2 i})}}^{y_{2 i j}} {\frac{exp (- x_{2 i j} β_{2} - z_{2 i j} b_{2 i})}{1 + exp (- x_{2 i j} β_{2} - z_{2 i j} b_{2 i})}}^{1 - y_{2 i j}}] \\ \times {[λ_{0} (T_{i}^{*}) exp {x_{3 i}^{⊤} β_{3} + U_{i}}]}^{δ_{i}} exp {- \int_{0}^{T_{i}^{*}} λ_{0} (s) exp {x_{3 i}^{⊤} β_{3} + U_{i}} d s} \\ \times f (b_{i} ∣ \sum) d b_{i} \end{array}

where x_hij and z_hij are j^th row of matrices x_hi and z_hi, respectively, and f(b_i|Σ) is the multivariate normal density function of b_i conditional on the covariance parameter matrix, Σ.

We implement a Bayesian approach for parameter inferences, using a Gibbs sampling algorithm. The algorithm was programmed using the R interface, rbugs, which accesses the software ‘OpenBUGS’ [21]. We implement the Gibbs sampler using weakly informative priors for all parameters. Fixed effects are normal with large variance components. Error variances and random effect covariances are distributed as inverse gamma and inverse Wishart distributions, respectively [2]. Code used in the analyses is available from the first author upon request.

3.2 Model selection criteria

Spiegelhalter et al. [28] proposed the Deviance Information Criterion (DIC) for model selection. The DIC is a generalization of the Akaike Information Criterion (AIC) [1] for hierarchical models based on the deviance of the posterior distribution. For parameter vector θ and observed data vector, y, let D(θ) be the deviance, D(θ) = −2 log f(y|θ) + 2 log h(y), where f(y|θ) is the likelihood function and h(y) is a standardizing function of the data alone [2]. We define D(θ) to be $D (θ) = - 2 \sum_{i = 1}^{n} log f (y_{i}, T_{i}^{*}, δ_{i}; θ)$ in our joint model, $\bar{D (θ)} = E_{θ ∣ y} [D (θ)]$ to denote the posterior expected deviance and θ̄ = E_θ_|_y[θ] to denote the mean of the posterior distribution of the parameters. The effective number of parameters, p_D, which can capture the complexity of a model is $p_{D} = \bar{D (θ)} - D (\bar{θ})$ . Then, the DIC is defined as $DIC = D (\bar{θ}) + 2 p_{D} = \bar{D (θ)} + p_{D}$ . Smaller values of the DIC indicate better fitting models. As with the AIC, differences of the DIC between models are a tool used for model selection. Differences of 3 to 5 are considered meaningful. In this paper, we use the DIC for model selection.

4. Dynamic prediction of event probabilities

We compute dynamic probabilities of an event occurring within a fixed window as proposed by van Houwelingen and Putter [30]. Specifically, we considered the conditional survival function of time v ≥ t given survival to time t:

S (v ∣ t) = P (T > v ∣ T \geq t, y, X, Z, b; θ, v \geq t) = \frac{S (v ∣ y, X, Z, b; θ, v \geq t)}{S (t ∣ y, X, Z, b; θ)} .

(5)

Equation (5) can also be expressed using a hazard function as

\begin{array}{l} S (v ∣ t) = exp [- {Λ (v ∣ y, X, Z, b; θ) - Λ (t ∣ y, X, Z, b; θ)}] \\ = exp (- \int_{t}^{v} λ (s ∣ y, X, Z, b; θ) d s) \end{array}

(6)

where Λ(·) denotes the cumulative hazard function and λ(·) denotes the instantaneous hazard function. Equation (6) implies that only the hazard on the interval [t, v] is necessary to predict the probability of event up to time v for a subject at risk just before time t. Let v = t + w be a fixed window of width w. We can relate the survival function to the cumulative distribution function as follows:

F_{w} (t) = 1 - S (t + w ∣ t) .

F_w(t) is evaluated at all-time points t where the estimates change value. The variance of this function is based on the Nelson-Aalen estimate of that cumulative hazard that is given as

{s e}^{2} [ln {{\hat{F}}_{w} (t)}] = \sum_{\begin{matrix} t \leq t_{i} \leq t + w \\ t_{i} \in D \end{matrix}} \frac{1}{R {(t_{i})}^{2}}

where Inline graphic denotes the set of event times and R(t) denotes the size of the risk set, i.e., the number of subjects with no event and still being followed just before time t.

5. Simulation Studies

We conducted simulation studies to investigate the performance of the proposed joint models with the longitudinal continuous and binary outcomes and the time-to-event outcome. The simulated longitudinal data consisted of a quantitative outcome and a dichotomous outcome with seven repeated measurements at fixed times 0, 0.5, 1, 1.5, 2, 2.5, and 3 years. We considered three sample sizes, N=100, N = 200 and N = 500 subjects. Both longitudinal outcomes depended on the same fixed covariates: a continuous variable (x₁) sampled from the normal distribution, x₁ ~ N(0, 5²), a dichotomous variable (x₂) sampled from the Bernoulli distribution with probability 0.5, x₂ ~ Bernoulli(0.5), and time (t), t = {0, 0.5, 1, 1.5, 2, 2.5, 3}, were included as fixed covariates. Subject-specific random intercepts and slopes were assumed. For the longitudinal continuous outcome, the measurement error term was normally distributed with mean zero and variance, $σ_{ε_{1}}^{2} = 1$ . The continuous longitudinal outcome was generated from the model

y_{1} (t) = β_{11} + β_{12} x_{1} + β_{13} x_{2} + β_{14} t + b_{10} + b_{11} t + ε_{1} (t)

The binary longitudinal outcome was generated from the model

logit (Pr (y_{2} (t) = 1)) = β_{21} + β_{22} x_{1} + β_{23} x_{2} + β_{24} t + α_{1} b_{10} + α_{2} b_{11} t

and the random effects were assumed to have distribution

(\begin{matrix} b_{10} \\ b_{11} \end{matrix}) ~ N ((\begin{matrix} 0 \\ 0 \end{matrix}), [\begin{matrix} σ_{b_{10}}^{2} & ρ_{12} σ_{b_{10}} σ_{b_{11}} \\ σ_{b_{11}}^{2} \end{matrix}]) .

The time-to-event outcome was generated from an exponential distribution, T ~ exp(μ₃), where μ₃ = exp(β₃₁ + β₃₂x₂ + α₃b₁₀ + α₄b₁₁ + b₃) and $b_{3} ~ N (0, σ_{b_{3}}^{2})$ with $σ_{b_{3}}^{2} = 0.25$ . A non-informative censoring time C was generated from a uniform distribution on [0.2, 2] which resulted in roughly 35% censoring on average. For each simulation study, 200 replications were performed. In each analysis, a total of 15,000 MCMC iterations were used, discarding the first 5,000 iterations as a burn-in.

Table 1 shows the results of the simulation studies, including true parameter values, bias (defined as the true parameter minus the mean estimated parameter), standard errors of the parameter estimates (SE), mean squared error (MSE), and the coverage probability of the estimated 95% credibility intervals (CP). With a few exceptions, most parameters in the joint models show acceptably low levels of bias and good coverage probabilities. For N=100, the biases of variance and covariance of random effects are relatively higher than for larger sample sizes, and it might create very low coverage probabilities. For N=200, CP averages close to 95% for all parameters. Most parameters have very small biases, with the exception of somewhat higher biases in the α parameters. However, biases in α are fairly small as a proportion of the size of the parameter, and decrease with larger sample sizes suggesting that a bias of the links between the longitudinal and time-to-event processes are strongly related to the number of events in the survival outcome. In general, as expected, larger sample sizes (N=500) show better results as indicated by the smaller bias, SE, MSE, and less variable CP that average very close to the nominal 95% level.

Table 1.

Simulation results

Parameter

True

N = 100

N = 200

N = 500

Bias

MSE

Bias

MSE

Bias

MSE

β₁₁

5.0

−0.002

0.104

0.024

0.930

−0.015

0.078

0.014

0.915

0.003

0.051

0.005

0.935

β₁₂

0.8

−0.005

0.138

0.047

0.920

0.008

0.107

0.029

0.880

−0.010

0.071

0.010

0.955

β₁₃

−0.2

0.000

0.012

0.000

0.925

0.001

0.009

0.000

0.925

0.000

0.006

0.000

0.950

β₁₄

−0.2

−0.004

0.056

0.007

0.930

−0.005

0.044

0.004

0.910

−0.004

0.028

0.002

0.945

β₂₁

0.3

0.018

0.223

0.109

0.940

−0.009

0.164

0.054

0.945

−0.003

0.103

0.022

0.960

β₂₂

0.3

−0.016

0.266

0.163

0.920

0.013

0.198

0.084

0.930

0.001

0.125

0.029

0.965

β₂₃

−0.2

−0.005

0.028

0.002

0.965

−0.001

0.020

0.001

0.970

−0.001

0.012

0.000

0.940

β₂₄

−0.25

−0.013

0.114

0.028

0.930

−0.015

0.083

0.015

0.920

−0.004

0.052

0.006

0.925

β₃₁

0.2

−0.023

0.345

0.266

0.930

−0.074

0.266

0.156

0.940

−0.021

0.171

0.059

0.950

β₃₂

1.0

0.003

0.419

0.398

0.945

0.031

0.307

0.216

0.915

−0.002

0.194

0.072

0.985

α₁

1.6

0.224

0.536

0.632

0.945

0.280

0.331

0.296

0.890

0.147

0.198

0.098

0.870

α₂

1.2

0.154

0.244

0.147

0.935

0.040

0.161

0.055

0.935

−0.012

0.100

0.022

0.930

α₃

2.0

0.181

0.891

1.308

0.975

0.164

0.585

0.595

0.980

0.052

0.344

0.211

0.995

α₄

3.5

0.369

0.647

0.836

0.975

0.153

0.457

0.379

0.985

0.087

0.291

0.158

0.960

σ_{ε_{1}}^{2}

1.0

0.029

0.061

0.008

0.925

0.023

0.043

0.004

0.940

0.008

0.027

0.001

0.975

σ_{b_{3}}^{2}

0.25

0.005

0.058

0.003

1.000

0.001

0.055

0.003

1.000

−0.011

0.049

0.003

1.000

σ_{b_{10}}^{2}

0.2

−0.073

0.020

0.006

0.010

−0.024

0.026

0.002

0.865

0.019

0.026

0.001

0.955

σ_{b_{11}}^{2}

0.25

−0.081

0.022

0.007

0.080

−0.016

0.025

0.001

0.895

0.013

0.021

0.001

0.950

ρ₁₂

0.5

−0.220

0.088

0.060

0.165

−0.027

0.067

0.007

0.995

0.043

0.052

0.006

0.945

Open in a new tab

6. Application to mortality in idiopathic pulmonary fibrosis outcomes study

6.1 Data description and Preliminary Analyses

Idiopathic pulmonary fibrosis (IPF) is a chronic, progressive lung disease characterized by fibrosis of unknown etiology [24]. Our primary concern is prediction of failure times, defined as time to either lung transplant or death, utilizing longitudinally measured forced vital capacity (FVC), a continuous measure of pulmonary function. Individual profiles for the pulmonary function assessed by FVC are shown in Figure 1, with subjects randomly apportioned to six subplots in order to more clearly depict these trajectories. An additional disease progression indicator, defined as a decline of 5% or more in FVC from the baseline FVC measurement, is also used as a (binary) longitudinal response. Baseline covariates also considered for inclusion in the models were: age centered at time of diagnosis, baseline FVC, smoking, gender, and time in years.

Subject-specific trajectories of FVC measurements. (PFT: pulmonary function tests)

Of the 125 patients considered in the study, 64 (51.2%) patients died or had lung transplants. Patient characteristics are summarized in Table 3. This dataset was comprised of irregular follow-up times across patients, but generally patients were followed every 3–4 months according to standard of care. The time origin for each patient was the baseline assessment of pulmonary function, performed upon evaluation at the Simmons Center for Interstitial Lung Disease (ILD). The FVC of each patient was repeatedly measured about 11 times on average, up to 43 times. Follow-up information for the study was reported through December 31, 2011. The average follow-up time was 3.7 years among all patients and 4.1 years among patients who did not die nor received a lung transplant. All IPF patients were evaluated at the University of Pittsburgh Medical Center and clinical data were obtained from the Simmons Center for ILD.

Table 3.

Bayesian Model Selection of shared random effects joint model

Model	U₁(t)	U₂(t)	U₃	DIC_y1	DIC_y2	DIC_T	D̄	p_D	DIC_total
	no random effects
1	0	0	0	1795	1715	356.2	3856	11.02	3867
2	0	0	b₃	1795	1715	354.3	3842	23.11	3865
	random intercepts
3	b₁₀	0	0	763.8	1715	356.1	2719	115.7	2835
4	b₁₀	0	b₃	763.9	1715	354.5	2706	127.9	2833
5	b₁₀	0	α₃b₁₀	763.6	1715	357.3	2720	116.3	2836
6	b₁₀	0	α₃b₁₀ + b₃	763.6	1715	355.2	2706	128.2	2834
7	b₁₀	α₁b₁₀	0	719.5	1189	356.2	2143	122.5	2265
8	b₁₀	α₁b₁₀	b₃	718.8	1189	354.5	2128	134.3	2263
9	b₁₀	α₁b₁₀	α₃b₁₀	719.4	1189	357.8	2143	122.9	2266
10	b₁₀	α₁b₁₀	α₃b₁₀ + b₃	718.8	1189	355.3	2128	134.7	2263
	random intercepts and random slopes
11	b₁₀ + b₁₁t	0	0	241.8	1715	356.2	2113	200.4	2313
12	b₁₀ + b₁₁t	0	b₃	241.9	1715	354.2	2099	212.5	2311
13	b₁₀ + b₁₁t	0	α₃b₁₀	241.7	1715	355.2	2111	200.8	2312
14	b₁₀ + b₁₁t	0	α₄b₁₁	241.9	1715	350.4	2104	203.1	2307
15	b₁₀ + b₁₁t	0	α₃b₁₀ + b₃	241.9	1715	353.0	2098	212.2	2310
16	b₁₀ + b₁₁t	0	α₄b₁₁ + b₃	241.0	1715	351.5	2095	212.1	2307
17	b₁₀ + b₁₁t	0	α₃b₁₀ + α₄b₁₁	242.3	1715	351.1	2106	202.1	2308
18	b₁₀ + b₁₁t	0	α₃(b₁₀ + b₁₁)	238.9	1715	352.7	2106	200.7	2307
19	b₁₀ + b₁₁t	0	α₃b₁₀ + α₄b₁₁ + b₃	238.9	1715	349.7	2092	211.4	2304
20	b₁₀ + b₁₁t	α₁b₁₀	0	374.3	1227	355.9	1742	215.1	1957
21	b₁₀ + b₁₁t	α₁b₁₀	b₃	373.7	1227	354.6	1728	227.3	1955
22	b₁₀ + b₁₁t	α₁b₁₀	α₃b₁₀	374.4	1226	357.5	1743	215.8	1958
23	b₁₀ + b₁₁t	α₁b₁₀	α₄b₁₁	372.6	1226	346.6	1727	217.7	1945
24	b₁₀ + b₁₁t	α₁b₁₀	α₃b₁₀ + b₃	375.0	1228	355.1	1729	228.6	1958
25	b₁₀ + b₁₁t	α₁b₁₀	α₄b₁₁ + b₃	372.6	1227	348.0	1719	228.6	1947
26	b₁₀ + b₁₁t	α₁b₁₀	α₃b₁₀ + α₄b₁₁	372.8	1225	347.0	1726	218.9	1945
27	b₁₀ + b₁₁t	α₁b₁₀	α₃(b₁₀ + b₁₁)	373.8	1227	356.2	1741	216.2	1957
28	b₁₀ + b₁₁t	α₁b₁₀	α₃b₁₀ + α₄b₁₁ + b₃	373.6	1226	348.9	1719	229.4	1949
29	b₁₀ + b₁₁t	α₁b₁₀ + α₂b₁₁t	0	167.2	923.2	356.1	1235	211.6	1447
30	b₁₀ + b₁₁t	α₁b₁₀ + α₂b₁₁t	b₃	167.0	923.2	354.4	1221	223.9	1445
31	b₁₀ + b₁₁t	α₁b₁₀ + α₂b₁₁t	α₃b₁₀	168.6	922.4	356.3	1235	211.9	1447
32	b₁₀ + b₁₁t	α₁b₁₀ + α₂b₁₁t	α₄b₁₁	164.1	923.0	352.9	1229	210.6	1440
33	b₁₀ + b₁₁t	α₁b₁₀ + α₂b₁₁t	α₃b₁₀ + b₃	166.0	923.1	353.4	1218	224.0	1442
34	b₁₀ + b₁₁t	α₁b₁₀ + α₂b₁₁t	α₄b₁₁ + b₃	167.2	924.0	351.8	1219	224.3	1443
35	b₁₀ + b₁₁t	α₁b₁₀ + α₂b₁₁t	α₃b₁₀ + α₄b₁₁	166.2	923.8	352.8	1230	212.7	1443
36	b₁₀ + b₁₁t	α₁b₁₀ + α₂b₁₁t	α₃(b₁₀ + b₁₁)	165.3	922.5	354.8	1232	211.0	1443
37	b₁₀ + b₁₁t	α₁b₁₀ + α₂b₁₁t	α₃b₁₀ + α₄b₁₁ + b₃	167.6	924.3	351.0	1218	224.5	1443

Open in a new tab

Fitted survival curves of the Weibull model and exponential model are shown in Figure 2. The survival curves of the Weibull model and exponential both show a good fit to the marginal survival function. Because the Weibull and exponential models are almost identical we used the more parsimonious exponential model. The estimated median transplant-free survival time from the initial visit date was 4.4 years.

The Kaplan-Meier plot of transplant-free survival (*black solid curve*); Weibull survival curve (*grey solid curve*) and exponential survival curve (*black broken curve*).

6.2 Joint model

As before, let y₁_ij be the continuous longitudinal outcome indicating FVC measurements, y₂_ij be the binary longitudinal outcome indicating disease progression indicator, and T_i be the transplant-free survival time. More specifically, the model is given by

\begin{array}{l} y_{1 i j} = μ_{1 i} (t_{i j}) + U_{1 i} (t_{i j}) + ε_{1 i j} \\ = β_{11} + β_{12} {BaselineFVC}_{i} + β_{13} {Time}_{i j} + b_{10 i} + b_{11 i} {Time}_{i j} + ε_{1} (t), \\ (\begin{matrix} b_{10 i} \\ b_{11 i} \end{matrix}) ~ N ((\begin{matrix} 0 \\ 0 \end{matrix}), [\begin{matrix} σ_{b_{10}}^{2} & ρ_{b_{1}} σ_{b_{10}} σ_{b_{11}} \\ ρ_{b_{1}} σ_{b_{10}} σ_{b_{11}} & σ_{b_{11}}^{2} \end{matrix}]), ε_{1 i j} ~ N (0, σ_{ε_{1}}^{2}) \\ logit (Pr (y_{2 i j} = 1)) = μ_{2 i} (t_{i j}) + U_{2 i} (t_{i j}) \\ = β_{21} + β_{22} {Age}_{i} + β_{23} {Time}_{i j} + α_{1} b_{10 i} + α_{2} b_{11 i} {Time}_{i j}, \\ T_{i} ~ Exponential (μ_{3}), \\ log (μ_{3 i}) = x_{3 i}^{T} β_{3} + U_{3 i} = β_{31} + β_{32} {Gender}_{i} + β_{33} {Smoking}_{i} + β_{34} {BaselineFVC}_{i} + α_{3} b_{10 i} + α_{4} b_{11 i} + b_{3 i} . \end{array}

Under the shared random effects joint model, the random effects terms of the three submodels are defined as follows:

U₁_i(t_ij) = b₁₀_i + b₁₁_it_ij
U₂_i(t_ij) = α₁b₁₀_i + α₂b₁₁_it_ij
U₃_i = α₃b₁₀_i + α₄b₁₁_i + b₃_i

The association between longitudinal responses and time-to-event outcome is explained by the parameter α = (α₁, α₂, α₃, α₄). For the continuous longitudinal response, we included the covariates of baseline FVC and time in years. For the binary longitudinal response, age and time in years were found to be significant covariates. Gender, smoking, and baseline FVC were significant in the model for transplant-free survival.

Non-informative priors were used for all parameters. For the coefficients of the fixed effects β₁, β₂, and β₃, we assume a multivariate normal distribution with mean zero and variance-covariance matrices 100I₃, 100I₃, and 100I₄, respectively, where I_k indicates an identity matrix with dimension k. We assumed that the association coefficient, α, had a normal distribution with mean zero and variance equal to 100. A total of 15,000 MCMC iterations were used discarding the first 5,000 iterations for the burn-in period.

6.3 Results

We considered all possible combinations of random effects resulting in 37 assessed models. Table 3 summarizes DICs for each submodel, the posterior expected deviance, D̄, the effective number of parameters, p_D, and total DIC scores (DIC_total) for each joint model. Note that DIC_y1, DIC_y2, and DIC_T denote DIC from the longitudinal continuous outcome, the longitudinal binary outcome, and the time-to-event outcome submodels, respectively. These scores show relative contributions of each component to the overall model DIC. DIC_total, was used to choose the “best” model.

Joint models with both subject-specific random intercepts and slopes in both the longitudinal continuous outcome submodel and binary outcome submodel (Models 29–37) have the smallest DICs, and hence the smallest DIC_total compared with other joint models with random effect structures. In the time-to-event submodel, adding a frailty term leads to only a very slight decrease in DIC_T. Model 32 with the smallest DIC_total was selected as our best model. Under Model 32, the longitudinal binary outcome shares the subject-specific random intercept and slope of the longitudinal continuous outcome and the time-to-event outcome is related to the subject-specific random slope of the longitudinal continuous outcome.

In Table 4, the posterior estimates for Model 32 parameters along with their standard errors and 95% credibility intervals are presented. For longitudinal continuous FVC, both the baseline FVC and time are statistically significant. Hence, patients with high values of baseline FVC have high FVC measurements and FVC measurements tend to decrease as time progresses. For the longitudinal binary outcome, there is a significant time effect. For the time-to-event outcome, male smokers with lower baseline FVC have a higher risk of death or transplant. The association parameter, α₄ indicates that there is a nearly significant negative association between the subject-specific random slopes of FVC measurements and the hazard of the transplant-free survival outcome.

Table 4.

Posterior estimates for Model 32

Parameter	Posterior Mean	Std. Error	95% CI
Longitudinal continuous submodel
Intercept (β₁₁)	0.167	0.048	(0.072, 0.263)
BaselineFVC (β₁₂)	0.933	0.014	(0.906, 0.961)
Time (β₁₃)	−0.094	0.016	(−0.122, −0.061)
$σ_{b_{10}}^{2}$	0.081	0.008	(0.066, 0.098)
$σ_{b_{11}}^{2}$	0.068	0.007	(0.056, 0.084)
ρ_b₁	−0.068	0.069	(−0.200, 0.067)
$σ_{ε}^{2}$	0.060	0.002	(0.055, 0.064)
Longitudinal binary submodel
Intercept (β₂₁)	−1.424	0.325	(−2.043, −0.796)
Age (β₂₂)	−0.015	0.014	(−0.044, 0.012)
Time (β₂₃)	1.019	0.180	(0.6513, 1.341)
α₁	−10.67	0.835	(−12.38, −9.145)
α₂	−10.31	0.727	(−11.82, −8.961)
Time-to-event submodel
Intercept (β₃₁)	−1.967	0.530	(−3.003, −0.939)
Male (β₃₂)	0.903	0.360	(0.214, 1.648)
Smoking (β₃₃)	0.719	0.341	(0.087, 1.422)
BaselineFVC (β₃₄)	−0.405	0.203	(−0.807, −0.013)
α₄	−1.613	0.810	(−3.201, −0.025)

Open in a new tab

Figure 3 shows the predicted four-year-ahead failure probabilities for the IPF data set along with 95% confidence intervals, obtained from the best-fitting Model 32. In general, the probability of event within the next four years decreases slowly, with a deceleration apparent if the patient survives longer than two years, so that at baseline subjects have ~45% chance of failure in four years, whereas if a patient does not experience failure by year five, the probability of a failure in the next four years decreases to ~35%. As time progresses, confidence intervals become wider because of the sparsity of the data.

Probability of transplant or death within the next four years using shared random effects joint model 32: Point estimates (*solid curve*) and 95 % confidence intervals (*dotted curves*).

7. Discussion

In this paper, we have developed a joint model for multivariate longitudinal continuous and binary responses and a time-to-event outcome, applying this model to the prediction of failure times from idiopathic pulmonary fibrosis. This general approach was introduced in Rizopoulos and Ghosh [26] who used natural splines to model three mixed longitudinal processes (two continuous and one binary) as predictors of a time-to-event outcome. In our case, we used parametric models for the longitudinal processes as the number of longitudinal observations per individual was relatively small and hence parametric random effect models were more tractable. Simulation studies indicate good performance for our models.

Our joint model was developed utilizing a unified Bayesian framework, including model fitting, parameter inferences, computation of dynamic prediction probabilities with posterior credible intervals, and a DIC model selection criterion that parses out the contributions of the joint sub-models to the overall fit. Results from the best fitting model indicate that the two longitudinal responses jointly contribute nearly significantly to prediction of failure times in IPF as indicated by the value of the shared parameter, α₄, and its 95% credibility interval (Table 4). Our approach for choosing prior distributions followed that of Guo and Carlin [14], which is quite general and easier to implement than that used by Rizopoulos and Ghosh [26].

In our present investigation, we utilized shared random effects to characterize the relationship between the longitudinal continuous and binary outcomes. A disadvantage of the shared random effects joint model is that the structure of the random effects for both the longitudinal outcomes is limited. Nevertheless, shared parameter random effects models do gain efficiency by using a smaller number of random effects parameters compared to other joint model formulations.

A practical purpose our investigation was to illustrate the utility of joint models for computing dynamic predictions of the probabilities of events occurring within a fixed window of time. Given a subject is at risk just before time t, the probability of an event occurring within the next fixed window of time is predicted using the survival function obtained from our proposed joint models. By predicting the probability of event within a fixed window of time we can easily visualize conditional probabilities of events. Plots such as these can be an effective means for presenting dynamic risk of failure times to clinician researchers. Finally, we have implemented our shared random effects joint models in an R interface using OpenBugs, available from the first author upon request.

Finally, in related work, we have implemented a correlated random effects joint model [5] which we plan to investigate further. As part of this future investigation, we wish to extend models proposed by Lee and Nelder [17,18] that expand the correlated random effects models by introducing a general method called maximum h-likelihood estimation. In the correlated random effects model, the relationship between the continuous and binary outcomes can be investigated through correlations among the random effects b₁_i and b₂_i.

Table 2.

Patient characteristics of Idiopathic Pulmonary Fibrosis (IPF) outcome data

	N(=125)	%
Gender
Male	84	67.2
Female	41	32.8
Race
White	122	97.6
Black	1	0.8
American Indian	1	0.8
Oriental	1	0.8
Smoking
Ever	88	70.4
Never	37	29.6
Diagnosis made
Clinically	63	50.4
Historically	62	49.6
Transplant
Yes	23	18.4
No	102	81.6
	Mean	SD
Age (year)
Overall	65.2	9.34
Male	65.7	9.00
Female	64.2	10.04
Follow-up (year)
All patients	3.7	2.32
Alive and not transplanted	4.1	2.35
Baseline PFTs
FVC	2.7	0.85
FEV1	2.2	0.65
DLCO	12.7	4.36

Open in a new tab

Definition of abbreviations: PFT = pulmonary function tests; FVC = forced vital capacity; FEV1 = forced expiratory volume in 1 second; DLCO = diffusing capacity of carbon monoxide.

Acknowledgments

The authors would like to thank Drs. Naftali Kaminski and Charles F. Reynolds III for providing data which helped motivate the methods demonstrated in this manuscript. This work was partially supported by NCI grant numbers U10-CA069974 and U10-CA069651 (Anderson) and NIMH grant number P30-MH-090333 (Anderson and Thompson).

References

1.Akaike H. Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csáki F, editors. Proc 2nd Int Symp Information Theory. Budapest: Akadémiai Kiadó; 1973. pp. 267–281. [Google Scholar]
2.Carlin BP, Louis TA. Bayesian Methods for Data Analysis. 3. Chapman and Hall/CRC Press; Boca Raton, FL: 2009. [Google Scholar]
3.Catalano PJ, Ryan LM. Bivariate latent variable models for clustered discrete and continuous outcomes. J Amer Statist Assoc. 1992;87:651–658. [Google Scholar]
4.Chi YY, Ibrahim JG. Joint models for multivariate longitudinal and multivariate survival data. Biometrics. 2006;62:432–445. doi: 10.1111/j.1541-0420.2005.00448.x. [DOI] [PubMed] [Google Scholar]
5.Choi J-I. PhD dissertation. University of Pittsburgh; 2012. Prediction in the Joint Modeling of Mixed Types of Multivariate Longitudinal Outcomes and a Time-to-Event Outcome. [Google Scholar]
6.Collett D. Modelling Survival Data in Medical Research. Chapman & Hall; London: 1994. [Google Scholar]
7.Faucett CJ, Thomas DC. Simultaneously modeling censored survival data and repeated measured covariates: A Gibbs sampling approach. Stat Med. 1996;15:1663–1685. doi: 10.1002/(SICI)1097-0258(19960815)15:15<1663::AID-SIM294>3.0.CO;2-1. [DOI] [PubMed] [Google Scholar]
8.Fieuws S, Verbeke G. Joint modelling of multivariate longitudinal profiles: pitfalls of the random-effects approach. Stat Med. 2004;23:3093–3104. doi: 10.1002/sim.1885. [DOI] [PubMed] [Google Scholar]
9.Fieuws S, Verbeke G, Maes B, Vanrenterghem Y. Predicting renal graft failure using multivariate longitudinal profiles. Biostatistics. 2008;9:419–431. doi: 10.1093/biostatistics/kxm041. [DOI] [PubMed] [Google Scholar]
10.Fitzmaurice GM, Davidian M, Verbeke G, Molenberghs G. Longitudinal Data Analysis. Chapman and Hall/CRC Press; Boca Raton, FL: 2009. [Google Scholar]
11.Fitzmaurice GM, Laird NM. Regression models for a bivariate discrete and continuous outcome with clustering. J Amer Statist Assoc. 1995;90:845–852. [Google Scholar]
12.Fitzmaurice GM, Laird NM, Ware J. Applied Longitudinal Data. Wiley; Hoboken, New Jersey: 2004. [Google Scholar]
13.Gueorguieva RV, Sanacora G. Joint analysis of repeatedly observed continuous and ordinal measures of disease severity. Stat Med. 2006;25:1307–1322. doi: 10.1002/sim.2270. [DOI] [PubMed] [Google Scholar]
14.Guo X, Carlin BP. Separate and joint modeling of longitudinal and event time data using standard computer packages. Amer Statist. 2004;58:16–24. [Google Scholar]
15.Henderson R, Diggle P, Dobson A. Joint modelling of longitudinal measurements and event time data. Biostatistics. 2000;1:465–480. doi: 10.1093/biostatistics/1.4.465. [DOI] [PubMed] [Google Scholar]
16.Hogan JW, Laird NM. Mixture models for the joint distribution of repeated measures and event times. Stat Med. 1997;16:239–257. doi: 10.1002/(sici)1097-0258(19970215)16:3<239::aid-sim483>3.0.co;2-x. [DOI] [PubMed] [Google Scholar]
17.Lee Y, Nelder JA. Hierarchical generalized linear models. Journal of the Royal Statistical Society Series B (Methodological) 1996:619–678. [Google Scholar]
18.Lee Y, Nelder JA, Pawitan Y. Generalized linear models with random effects: unified analysis via H-likelihood. CRC Press; 2006. [Google Scholar]
19.Little RJA. Pattern-mixture models for multivariate incomplete data. J Amer Statist Assoc. 1993;88:125–134. [Google Scholar]
20.Liu X, Daniels MJ, Marcus B. Joint models for the association of longitudinal binary and continuous processes with application to a smoking cessation trial. J Amer Statist Assoc. 2010;104:429–438. doi: 10.1198/016214508000000904. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Lunn D, Spiegelhalter D, Thomas A, Best N. The BUGS project: Evolution, critique and future directions (with discussion) Stat Med. 2009;28:3049–3082. doi: 10.1002/sim.3680. [DOI] [PubMed] [Google Scholar]
22.Molenberghs G, Verbeke G. Models for Discrete Longitudinal Data. Springer; New York: 2005. [Google Scholar]
23.Molenberghs G, Verbeke G, Demetrio CGB, Vieira AM. A family of generalized linear models for repeated measures with normal and conjugate random effects. Statist Sci. 2010;25:325–347. [Google Scholar]
24.Richards TJ, Kaminski N, Baribaud F, Flavin S, Brodmerkel C, Horowitz D, Li K, Choi J, Vuga LJ, Lindell KO, Klesen M, Zhang Y, Gibson KF. Peripheral Blood Proteins Predict Mortality in Idiopathic Pulmonary Fibrosis. American Journal of Respiratory and Critical Care Medicine. 2012;185:67–76. doi: 10.1164/rccm.201101-0058OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Rizopoulos D. Dynamic predictions and prospective accuracy in joint models for longitudinal and time-to-event data. Biometrics. 2011;67:819–829. doi: 10.1111/j.1541-0420.2010.01546.x. [DOI] [PubMed] [Google Scholar]
26.Rizopoulos D, Ghosh P. A Bayesian semiparametric multivariate joint model for multiple longitudinal outcomes and a time-to-event. Stat Med. 2011;30:1366–1380. doi: 10.1002/sim.4205. [DOI] [PubMed] [Google Scholar]
27.Song X, Davidian M, Tsiatis AA. A semiparametric likelihood approach to joint modeling of longitudinal and time-to-event data. Biometrics. 2002;58:742–753. doi: 10.1111/j.0006-341x.2002.00742.x. [DOI] [PubMed] [Google Scholar]
28.Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. Bayesian Measures of Model Complexity and Fit. J R Stat Soc Ser B Stat Methodol. 2002;64:583–639. [Google Scholar]
29.Tsiatis AA, Davidian M. Joint modeling of longitudinal and time-to-event data: An overview. Statist Sinica. 2004;14:809–834. [Google Scholar]
30.van Houwelingen HC, Putter H. Dynamic Prediction in Clinical Survival Analysis. Chapman and Hall/CRC Press; Boca Raton, FL: 2011. [Google Scholar]

[R1] 1.Akaike H. Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csáki F, editors. Proc 2nd Int Symp Information Theory. Budapest: Akadémiai Kiadó; 1973. pp. 267–281. [Google Scholar]

[R2] 2.Carlin BP, Louis TA. Bayesian Methods for Data Analysis. 3. Chapman and Hall/CRC Press; Boca Raton, FL: 2009. [Google Scholar]

[R3] 3.Catalano PJ, Ryan LM. Bivariate latent variable models for clustered discrete and continuous outcomes. J Amer Statist Assoc. 1992;87:651–658. [Google Scholar]

[R4] 4.Chi YY, Ibrahim JG. Joint models for multivariate longitudinal and multivariate survival data. Biometrics. 2006;62:432–445. doi: 10.1111/j.1541-0420.2005.00448.x. [DOI] [PubMed] [Google Scholar]

[R5] 5.Choi J-I. PhD dissertation. University of Pittsburgh; 2012. Prediction in the Joint Modeling of Mixed Types of Multivariate Longitudinal Outcomes and a Time-to-Event Outcome. [Google Scholar]

[R6] 6.Collett D. Modelling Survival Data in Medical Research. Chapman & Hall; London: 1994. [Google Scholar]

[R7] 7.Faucett CJ, Thomas DC. Simultaneously modeling censored survival data and repeated measured covariates: A Gibbs sampling approach. Stat Med. 1996;15:1663–1685. doi: 10.1002/(SICI)1097-0258(19960815)15:15<1663::AID-SIM294>3.0.CO;2-1. [DOI] [PubMed] [Google Scholar]

[R8] 8.Fieuws S, Verbeke G. Joint modelling of multivariate longitudinal profiles: pitfalls of the random-effects approach. Stat Med. 2004;23:3093–3104. doi: 10.1002/sim.1885. [DOI] [PubMed] [Google Scholar]

[R9] 9.Fieuws S, Verbeke G, Maes B, Vanrenterghem Y. Predicting renal graft failure using multivariate longitudinal profiles. Biostatistics. 2008;9:419–431. doi: 10.1093/biostatistics/kxm041. [DOI] [PubMed] [Google Scholar]

[R10] 10.Fitzmaurice GM, Davidian M, Verbeke G, Molenberghs G. Longitudinal Data Analysis. Chapman and Hall/CRC Press; Boca Raton, FL: 2009. [Google Scholar]

[R11] 11.Fitzmaurice GM, Laird NM. Regression models for a bivariate discrete and continuous outcome with clustering. J Amer Statist Assoc. 1995;90:845–852. [Google Scholar]

[R12] 12.Fitzmaurice GM, Laird NM, Ware J. Applied Longitudinal Data. Wiley; Hoboken, New Jersey: 2004. [Google Scholar]

[R13] 13.Gueorguieva RV, Sanacora G. Joint analysis of repeatedly observed continuous and ordinal measures of disease severity. Stat Med. 2006;25:1307–1322. doi: 10.1002/sim.2270. [DOI] [PubMed] [Google Scholar]

[R14] 14.Guo X, Carlin BP. Separate and joint modeling of longitudinal and event time data using standard computer packages. Amer Statist. 2004;58:16–24. [Google Scholar]

[R15] 15.Henderson R, Diggle P, Dobson A. Joint modelling of longitudinal measurements and event time data. Biostatistics. 2000;1:465–480. doi: 10.1093/biostatistics/1.4.465. [DOI] [PubMed] [Google Scholar]

[R16] 16.Hogan JW, Laird NM. Mixture models for the joint distribution of repeated measures and event times. Stat Med. 1997;16:239–257. doi: 10.1002/(sici)1097-0258(19970215)16:3<239::aid-sim483>3.0.co;2-x. [DOI] [PubMed] [Google Scholar]

[R17] 17.Lee Y, Nelder JA. Hierarchical generalized linear models. Journal of the Royal Statistical Society Series B (Methodological) 1996:619–678. [Google Scholar]

[R18] 18.Lee Y, Nelder JA, Pawitan Y. Generalized linear models with random effects: unified analysis via H-likelihood. CRC Press; 2006. [Google Scholar]

[R19] 19.Little RJA. Pattern-mixture models for multivariate incomplete data. J Amer Statist Assoc. 1993;88:125–134. [Google Scholar]

[R20] 20.Liu X, Daniels MJ, Marcus B. Joint models for the association of longitudinal binary and continuous processes with application to a smoking cessation trial. J Amer Statist Assoc. 2010;104:429–438. doi: 10.1198/016214508000000904. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Lunn D, Spiegelhalter D, Thomas A, Best N. The BUGS project: Evolution, critique and future directions (with discussion) Stat Med. 2009;28:3049–3082. doi: 10.1002/sim.3680. [DOI] [PubMed] [Google Scholar]

[R22] 22.Molenberghs G, Verbeke G. Models for Discrete Longitudinal Data. Springer; New York: 2005. [Google Scholar]

[R23] 23.Molenberghs G, Verbeke G, Demetrio CGB, Vieira AM. A family of generalized linear models for repeated measures with normal and conjugate random effects. Statist Sci. 2010;25:325–347. [Google Scholar]

[R24] 24.Richards TJ, Kaminski N, Baribaud F, Flavin S, Brodmerkel C, Horowitz D, Li K, Choi J, Vuga LJ, Lindell KO, Klesen M, Zhang Y, Gibson KF. Peripheral Blood Proteins Predict Mortality in Idiopathic Pulmonary Fibrosis. American Journal of Respiratory and Critical Care Medicine. 2012;185:67–76. doi: 10.1164/rccm.201101-0058OC. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] 25.Rizopoulos D. Dynamic predictions and prospective accuracy in joint models for longitudinal and time-to-event data. Biometrics. 2011;67:819–829. doi: 10.1111/j.1541-0420.2010.01546.x. [DOI] [PubMed] [Google Scholar]

[R26] 26.Rizopoulos D, Ghosh P. A Bayesian semiparametric multivariate joint model for multiple longitudinal outcomes and a time-to-event. Stat Med. 2011;30:1366–1380. doi: 10.1002/sim.4205. [DOI] [PubMed] [Google Scholar]

[R27] 27.Song X, Davidian M, Tsiatis AA. A semiparametric likelihood approach to joint modeling of longitudinal and time-to-event data. Biometrics. 2002;58:742–753. doi: 10.1111/j.0006-341x.2002.00742.x. [DOI] [PubMed] [Google Scholar]

[R28] 28.Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. Bayesian Measures of Model Complexity and Fit. J R Stat Soc Ser B Stat Methodol. 2002;64:583–639. [Google Scholar]

[R29] 29.Tsiatis AA, Davidian M. Joint modeling of longitudinal and time-to-event data: An overview. Statist Sinica. 2004;14:809–834. [Google Scholar]

[R30] 30.van Houwelingen HC, Putter H. Dynamic Prediction in Clinical Survival Analysis. Chapman and Hall/CRC Press; Boca Raton, FL: 2011. [Google Scholar]

PERMALINK

Prediction of transplant-free survival in idiopathic pulmonary fibrosis patients using joint models for event times and mixed multivariate longitudinal data

Jiin Choi, Ph.D.

Stewart J Anderson, Ph.D.

Thomas J Richards, Ph.D.

Wesley K Thompson, Ph.D.

SUMMARY

1. Introduction

2. Joint modeling approach

2.1 Notation and definitions

2.2 Joint model of multivariate longitudinal outcomes and a time-to-event outcome

3. Estimation and model selection

3.1 Parameter estimation

3.2 Model selection criteria

4. Dynamic prediction of event probabilities

5. Simulation Studies

Table 1.

6. Application to mortality in idiopathic pulmonary fibrosis outcomes study

6.1 Data description and Preliminary Analyses

Figure 1.

Table 3.

Figure 2.

6.2 Joint model

6.3 Results

Table 4.

Figure 3.

7. Discussion

Table 2.

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Prediction of transplant-free survival in idiopathic pulmonary fibrosis patients using joint models for event times and mixed multivariate longitudinal data

Jiin Choi, Ph.D.

Stewart J Anderson, Ph.D.

Thomas J Richards, Ph.D.

Wesley K Thompson, Ph.D.

SUMMARY

1. Introduction

2. Joint modeling approach

2.1 Notation and definitions

2.2 Joint model of multivariate longitudinal outcomes and a time-to-event outcome

3. Estimation and model selection

3.1 Parameter estimation

3.2 Model selection criteria

4. Dynamic prediction of event probabilities

5. Simulation Studies

Table 1.

6. Application to mortality in idiopathic pulmonary fibrosis outcomes study

6.1 Data description and Preliminary Analyses

Figure 1.

Table 3.

Figure 2.

6.2 Joint model

6.3 Results

Table 4.

Figure 3.

7. Discussion

Table 2.

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases