Abstract
Over the last 20 or more years a lot of clinical applications and methodological development in the area of joint models of longitudinal and time-to-event outcomes have come up. In these studies, patients are followed until an event, such as death, occurs. In most of the work, using subject-specific random-effects as frailty, the dependency of these two processes has been established. In this article, we propose a new joint model that consists of a linear mixed-effects model for longitudinal data and an accelerated failure time model for the time-to-event data. These two sub-models are linked via a latent random process. This model will capture the dependency of the time-to-event on the longitudinal measurements more directly. Using standard priors, a Bayesian method has been developed for estimation. All computations are implemented using OpenBUGS. Our proposed method is evaluated by a simulation study, which compares the conditional model with a joint model with local independence by way of calibration. Data on Duchenne muscular dystrophy (DMD) syndrome and a set of data in AIDS patients have been analysed.
Keywords: AFT model, Bartlett decomposition, Bayesian, conditional distribution, muscular dystrophy
2010 Mathematics Subject Classifications: 62F15, 62N01
1. Introduction
In many clinical studies, information is collected on subjects at regular time intervals. Sometimes an event time, for example, death or dropout, also becomes a point of interest. If these two responses are correlated then it may be of interest to examine the association structure between longitudinal measurements and event times. To address this issue, a class of statistical models has been developed, referred to as joint models for longitudinal and time-to-event data. This area of research has received considerable attention in recent years. A standard joint model was developed by Faucett and Thomas [16] and Wulfsohn and Tsiatis [53]. Prior to their work, joint modelling approaches with application in AIDS research have been carried out by Self and Pawitan [49] and De Gruttola and Tu [15]. An overview of this can be found in [33,34,39].
This work is motivated by a study on DMD syndrome data. This genetic disease cannot be cured and affected individuals die after some time. Some steroids are given to improve their quality of life as measured by their muscle scores based on six important muscles. Composite muscle scores may vary from 0 to 10. Severity of the disease may be assessed from these scores where low score indicates bad health condition. Two highly correlated time-to-event indicators (time to walk four steps and time to get up from lying state) were also noted. Furthermore, these indicators were subject to right censoring. It may be noted that these time-to-event indicators are highly dependent on muscle scores. In this situation joint modelling of these two correlated processes is highly desirable. Another data set, which has been used by a number of researchers, is also analysed. This is known as ddI/ ddC study and is available in JM package in R. In this data set, square root of CD4 cell count which is a biomarker for AIDS is used as longitudinal response. Death due to AIDS may depend on the CD4 cell count and hence a joint model approach may be considered.
A well known technique for joint modelling of two simultaneous processes is based on introducing a partly observed random variable following a bivariate normal distribution, where one component defines the longitudinal outcome and the second component defines time-to-event. This approach has been used by many authors (see, e.g. [10,13,17,18,37,46]). Extensions of bivariate models to incorporate a multivariate model generalization have been implemented by Molenberghs and Lesaffre [32]. Further, Fieuws et al. [17] proposed an approach for modelling multivariate longitudinal data where all the possible pairs of longitudinal data were separately modelled and were later combined in the final step. Again, Rizopoulos et al. [45] proposed a new method for calculating residuals and producing diagnostic plots in joint models, based on the idea of multiplying and imputing the missing longitudinal responses under the fitted joint model, thus creating random versions of the completed data set, which can then be used to extract conclusions regarding model assumptions.
In the case of joint models for longitudinal and time-to-event data, the Bayesian perspective enhances and improves estimation and prediction, incorporating prior information into the study [22,25,27,48,54]. However, our main objective is to estimate the survival function, the prediction of survival times, or the computation of other posterior distributions for relevant probabilities or rates. The frequentist approach is not adequate to answer all these questions because the probabilistic characteristics of the relevant estimators are practically impossible to compute [26].
Considerable attention has been given to the area of Bayesian approaches to joint models. The primary source of such work comes from medical science (see [3–7,24,28,52,56]). Conceptual simplicity and dynamic inference, based on posterior distributions, may be considered as main factors contributing to this approach, forming a natural framework for joint models. Again, Rizopoulos et al. [42] used dynamic predictions with time-dependent covariates where the main interest was in optimally utilizing the recorded information and providing the medically relevant summary measures, such as survival probabilities, which eventually aided in decision making. Two measures were presented by Rizopoulos et al. [43] based on information theory that can be used to dynamically select models in time and optimally schedule longitudinal biomarker measurements. Further, Andrinopoulou et al. [2] proposed a Bayesian joint model that allows for a time-varying coefficient to link the longitudinal and survival processes, using P-splines to improve the dynamic predictions.
In our present work, we extend the bivariate approach to a data set with n + 1 components where the first n components define the longitudinal process, and the later components describe the time-to-event. In this work, we are using fully parametric accelerated failure time (AFT) model for the time-to-event response. The main concern is to see how the time-to-event is dependent on the longitudinal process. We have used a Bartlett decomposition of the covariance matrix to define the association between these two associated processes. This modelling approach may be considered as an alternative to the existing joint models where a strong assumption regarding the independence of multiple processes is made. In traditional joint models, it is assumed that if the random effects, present in both the models, is given, the processes become independent. Assuming a structural dependency is more general; it does reduce to the standard approach whenever the correlation between two processes becomes zero. In many situations, the longitudinal response may directly influence the time-to-event process. In previous research, Chakraborty and Das [12] used the longitudinal response as a covariate in the exponent part of the Cox proportional hazard model. In case of fully parametric models, like the model we are using here, this dependency can easily be obtained from the conditional mean of the AFT variable.
Most of the packages like JM [39] for joint models have been implemented from a frequentist perspective. In the R language, the only package that uses a Bayesian perspective is JMbayes. The R package JMbayes was introduced by Rizopoulos [40] that fits joint models under a Bayesian approach, which can fit a wide range of joint models, including joint models for continuous and categorical longitudinal responses. It provided several options for modelling the association structure between two outcomes. Though it does not draw fully Bayesian dynamic inferences, it does propose a mechanism for dynamic prediction [38]. The BUGS language Win/OpenBUGS [30] and JAGS [35], Stan [9], and INLA [47] can also be used to fit joint models using Bayesian inference.
This paper is organized as follows. In Section 2, we propose our joint model and discuss Bayesian inferences. We also mention different model selection criteria. Section 3 deals with analyses of two real data sets, one obtained from a study in muscular dystrophy syndrome and the other drawing from an AIDS study. In Section 4, the performance of the proposed methodology is described in some detail and we conclude with some discussions in Section 5.
2. Joint models and estimation
2.1. Joint models
Joint models for the analysis of longitudinal and time-to-event data have been developed by several authors. The most common method is to consider a latent variable model (or a shared random-effects model [23,51]). Again, Rizopoulos and Ghosh [41] developed a spline-based semi-parametric multivariate joint model for multiple longitudinal outcomes and a time-to-event, and the Dirichlet process prior formulation for the latent terms that allow for general shapes of their distribution.
Let denote the longitudinal response for subject at time j for . Let denotes the event time for individual i, with being the censoring indicator. Then the joint distribution of longitudinal and time-to-event data, given subject-specific random-effects , can be modelled by using the multivariate normal specification:
where and are design matrices for fixed effects and for random-effects, respectively, is the vector of regression parameters and
The vector denotes the association between the longitudinal outcome vector and the time-to-event observation for the individual, i.e. . In the existing literature, the association between these two simultaneous processes has been captured through the subject-specific random-effects . However, when random-effects are present, the estimation involves intractable integrals thus necessitating numerical integration. A new computational approach for fitting such models was proposed by Rizopoulos et al. [44] which was based on the Laplace method for integrals, facilitating the inclusion of high-dimensional random-effects structures. In our model, mainly the within-subject longitudinal correlation is captured by . So, even if the two processes do not share a common , dependency can still be captured using the conditional distribution.
It is difficult to model the entire covariance matrix because of two reasons: positive-definiteness constraints [14,36] and high-dimensionality [29] for each subject. Factorization of the joint distribution of may be used to address this problem. In our setting, we factor the joint distribution of and into two components: a marginal model for and a correlated regression model for given . In the context of joint modelling of longitudinal discrete and continuous processes, a similar type of modelling has been used [18,21,29]. In the presence of subject-specific random-effects , let
Then, using a Bartlett decomposition of a covariance matrix, the new models can be expressed as:
and
(1) |
where is the vector reflecting structural association between these two processes and is used to capture within-subject longitudinal correlation. Again, we have assumed . We have also considered the local dependency through non-zero and . Further let us assume, and . Thus it can be clearly seen that the two submodels are linked via the structural association as well as the by the shared random-effects association.
To capture longitudinal correlation, we assume to be a bivariate normal vector following where is structured as with the variance of being set to 1 for identifiability issues also addressed by Luo [31]. Again, Zhu et al. [55] also considered similar structure for capturing the dispersion of the random-effects.
We incorporate the censoring information for the individual through where takes value 1 in case the event time is censored and 0 in the case of non-censored. Thus the time-to-event observation can be expressed as where represents the censored time point for the individual and the time to event observations obtained from the individual.
The contribution to the likelihood can be expressed as:
where and signifies the conditional hazard and conditional survival function. It is easy to note that,
Under the log-normal assumption, we have
where
and
signifies the conditional mean and variability of the time-to-event observations of the individual when the longitudinal and subject-specific random-effects are given.
It is more difficult to handle the AFT structure [11] in the joint modelling setting than for the Cox model, since is more complicated and the baseline function involves unknown quantities, which is completely different from the case in the Cox model. Moreover, for the baseline hazard function, it is not possible to use the point mass function with masses assigned to all uncensored survival times . Now, the complete data likelihood for the individual:
We take with the assumption of independence among subjects.
We let the combined observed data from all the submodels to be with be the unknown parameter vector. Here, is a coefficient row vector reflecting the association strength between the longitudinal and time-to-event submodels captured via subject-specific random-effects. It can be noted that, if , the two submodels are linked only via the structural association. This fact has been explored by us in Section 4 where we consider two parallel setups for demonstrating the efficiency of the structural association in the context of linking the two submodels. The complete data likelihood for the individual:
where, denotes the longitudinal contribution for the individual; denotes the time-to-event contribution for the individual given longitudinal response; denotes the probability density function of the standard normal variate ; cumulative distribution function of the standard normal variate ; signifies the contribution of the subject-specific random-effects for the individual; and .
2.2. Bayesian inference
In this paper, to estimate the unknown parameters we have adopted a fully Bayesian approach via the Markov Chain Monte Carlo (MCMC) method. We need to consider prior distributions for all unknown parameters in the parameter space . Due to the absence of prior information on the parameters of interest, non-informative priors are assigned. Non-informative priors for all the elements of were chosen.
The prior distribution of all the elements of , and are assumed to be normal with high degree of precision. We use prior distribution of and . The priors of variability parameters and are assumed to be Inverse Gamma with high degree of precision.
The model fitting is implemented with the help of OpenBUGS (OpenBUGS version 3.2.2). The joint posterior density finally takes the form:
The posterior distributions of the parameters do not yield standard forms and hence are sampled by a Metropolis-Hastings Acceptance-Rejection algorithm to obtain suitable estimates. The Gelman–Rubin diagnostic [19] is used to ensure the scale reduction of all parameters are smaller than 1.1. We have run multiple chains with over-dispersed initial values to analyse data. Moreover, to ensure the chain convergence we have used the trace plots and auto-correlation functions as mentioned by Gelman et al. [19]. Other choices of priors such as half-Cauchy priors for variability parameters and normal priors varying in mean and degree of precision for regression parameters are explored. The results thus studied show no significant difference indicating robustness when it comes to prior specifications.
2.3. Model selection
A wide variety of model selection criteria in Bayesian inference are available. Among these, we have adopted the deviance information criterion (DIC) [50], the expected Akaike information criterion (EAIC), the expected Bayesian (or Schwarz) information criterion [8]. The deviance information criterion (DIC) may be considered a hierarchical modelling generalization of the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). When the posterior distributions of the models have been obtained by Markov chain Monte Carlo (MCMC) simulation, DIC may be used effectively in Bayesian model selection problems. This quantity is based on a deviance, defined by:
where is the likelihood function for the observed data and where is some fully specified standardizing term, a function of the data alone. Then, the effective dimension is defined as
where
is the posterior mean deviance, and is an estimate of depending on . The Deviance Information Criterion (DIC) for model comparison is defined as
A small DIC value indicates a better fit of the model. Alternatively one may use expected AIC and expected BIC as suggested by Carlin and Louis [8]. Both criteria are of the form , where the mean is averaged over all MCMC samples. EAIC applies a penalty of 2, while EBIC applies a penalty of . Smaller EAIC and EBIC indicate that the model has a better predictive ability.
3. Data analysis
3.1. Duchenne muscular dystrophy data
The motivating data set is from a study on muscular dystrophy conducted in children and collected by the National Neurosciences Centre (NNC), India. Duchenne and Becker muscular dystrophy (DMD/BMD) is a rapidly progressive form of genetic neuromuscular disease of childhood, caused by the alteration of the dystrophin gene, called the DMD gene located at the short arm of the X chromosome ( ). The dystrophin plays a crucial role in maintaining the membrane of muscle cells. Inherited as X-linked recessive diseases, it also often occurs in people from families without a known history of this condition. This basically signifies that it manifests in males whereas females carrying two X chromosomes can only be carriers of this genetic condition (where any one X chromosome carries the defective or mutated gene). Males having one X and one Y chromosome become affected if the X chromosome carrying the defective gene is inherited from their carrier mother. Symptoms can be perceived before the child reaches age six or even in the early infancy characterized by a steady decline in muscle strength (biomarker) leading to a condition where braces will be required by the child for walking before the age of 10. By the time of 12 years of age most of the patients are compelled to be dependent on wheelchairs. Other clinical conditions include abnormal development of bones leading to skeletal deformities in the body, wasting of thigh and pectoral muscles, breathing troubles, cardiomyopathy (enlarged heart condition) and even intellectual or cognitive impairment. Patients usually die around age 25, the main contributing causes of death being respiratory failure and cardiomyopathy.
The data set of our study consists of composite scores based on longitudinal observations on 6 different muscles, i.e. Neck, Deltoid, Biceps, Iliopsas, Quadriceps, and Hamstrings, which are mainly responsible for all movements of an individual. Time-to-event indicators constitute: (a) time taken by the patient to walk four steps; (b) time to get up from lying state.
A censoring mechanism is incorporated in such a way that censored observations occur when the patient is unable to complete four steps in within 90 seconds or fails to stand up within 60 seconds. The two time-to-event indicators are highly correlated as can be seen form Figure 1. It can be concluded from the figure that it might be sufficient to consider any one of the time-to-event indicators for our analysis. We have considered indicator (a), which is regarded as severer condition by the clinicians. The basic objective of the study was to observe how quality of life, as indicated by time taken to walk four steps, depends on composite muscle scores.
Figure 1.
DMD study. Comparing survival curves.
The profile plot in Figure 2(a) exhibits the highly unbalanced nature of the data. The number of observations for an individual ranges over 3–10. There are no covariates in the data set. Figure 2(b) reveals the relationship of the composite muscle score with Visit obtained using lowess smoothing. There exists an inverse relationship between the composite score and months from baseline for a patient; it is quite evident in the case of a progressive genetic neuromuscular disease. Hence we have included the effect of time (i.e. Obstime) in the longitudinal profile of each individual expressed in the form of a linear mixed-effects model:
where denotes the composite muscle score vector (obtained by taking the mean of six major muscles responsible for movement) for the patient at the time point, β the intercept for that patient for the time point, whereas and signify the intercept and slope components of the subject-specific random-effects for the patient. Here, and Obstime signifies the inclusion of the effect of time in the longitudinal profile, as suggested by the lowess smoothing curve. For the time-to-event submodel, we have the AFT model:
where is the time to climb four steps within 90 seconds. The data exhibited varying values of over different time points for a particular patient, hence, we have used the maximum of them as the final time-to-event observation in the sense of capturing the worst possible condition. Here, denotes the intercept for individual time-to-event observation, and the regression coefficients corresponding to and for the particular subject and . Again, captures the structural bond between the two sub-processes.
Figure 2.
Exploratory data analysis on 26 patients suffering from DMD. (a) DMD study. Profile plot and (b) DMD study. Smoothing.
We explore this data set by fitting our proposed model with the help of two setups as described in Section 4. The results under joint model (with both forms of associations between the two processes) and joint model with local independence JM (with only parametric structural association) are displayed in Table 1 displaying the posterior mean, standard deviation (SD) of the estimated posterior means, and the 95% credible intervals (CI). Both the models display low bias and standard deviation (SD) and SD is pretty close to SE.
Table 1.
DMD study.
JM | JM | ||||||||
---|---|---|---|---|---|---|---|---|---|
Parameter | Name of the effect | Mean | SD | 95% CI | Mean | SD | 95% CI | ||
For longitudinal | |||||||||
Intercept | 1.377 | 0.097 | 1.189 | 1.566 | 1.377 | 0.098 | 1.188 | 1.572 | |
Precision | 0.947 | 0.091 | 0.779 | 1.133 | 0.950 | 0.090 | 0.780 | 1.132 | |
Both processes | |||||||||
Covariance between two processes | 0.301 | 0.172 | 0.017 | 0.586 | 0.302 | 0.173 | 0.014 | 0.585 | |
Variance component | 0.342 | 0.178 | 0.122 | 0.737 | 0.339 | 0.163 | 0.127 | 0.731 | |
Correlation | 0.634 | 0.148 | 0.345 | 0.900 | 0.634 | 0.141 | 0.346 | 0.878 | |
For time-to-event | |||||||||
Intercept | 1.000 | 0.101 | 0.801 | 1.196 | 0.991 | 0.100 | 0.803 | 1.195 | |
Presicion | 1.000 | 0.100 | 0.813 | 1.206 | 0.999 | 0.099 | 0.813 | 1.199 | |
Random intercept | 1.000 | 0.102 | 0.803 | 1.201 | |||||
Random slope | 0.999 | 0.100 | 0.802 | 1.194 |
Note: Results of model fitting. Bold implies significant covariates.
It can be clearly seen that both the joint models report comparable estimates for all the parameters thus establishing the effectiveness of the parametric structural association. Again, parameters such as , , ρ in JM are seen to be performing better in terms of lower SD. Same can be exhibited in the time-to-event submodel for the intercept and , the latter signifying the unconditional variability of the time-to-event observations. The parameters and display positive mean values with low standard deviations signifying the role of subject-specific random-effects in binding the two sub models in JM . All the parameters for both the joint models are seen to be significant.
Table 1 also shows for both JM and JM indicating the fact that longitudinal correlation can also be captured by the subject-specific random-effects. The structural dependence parameter is non-zero and significant for both the joint models. This implies the fact that in situations where are not considered, can efficiently capture the assocation between the two processes. This idea is also confirmed by Table 2 where the Dbar, EAIC, EBIC and DIC values for both the joint models are reported and JM reveals more efficiency in terms of model fitting
Table 2.
Model comparison.
DMD study | Aids study | |||||||
---|---|---|---|---|---|---|---|---|
Dbar | EAIC | EBIC | DIC | Dbar | EAIC | EBIC | DIC | |
JM | 203.000 | 221.000 | 215.735 | 232.700 | 1165.00 | 1193.00 | 1202.37 | 1741.00 |
JM | 201.800 | 215.800 | 211.705 | 231.900 | 1158.00 | 1182.00 | 1190.03 | 1738.00 |
The trace plots for each model parameter display evidence of good mixing properties of the chains. This claim is also supported by (Gelman–Rubin Diagnostic), which displays values around 1.001 in the case of both the joint models.
3.2. AIDS data
Four hundred and sixty-seven patients were enrolled for a ddI/ddC study where 230 patients were randomized, each of them to receive didanosine (ddI) at a dose of 500 mg/day and each of 237 patients were to receive zalcitabine (ddC) at a dose of 2.25 mg/day. The study conducted by the Terry Beirn Community Programs for Clinical Research on AIDS was a randomized open-label clinical trial carried out in order to compare the effectiveness of two alternative antiretroviral drugs, ddI and ddC, in patients affected with HIV who were intolerant to ZDV (zidovudine) or had a symptom of disease progression (a CD4 cell count of 300 or fewer) during the period of ZDV intake.
The study revealed that about 66% (309) patients had experienced progression of disease or death, and about 40% (188) had died after an average follow-up of of 15.6 months. A segment of patients of about 35% (164) had become study drug intolerant, and about 57% (143) of the still living and followed-up had been discontinued permanently from the original study drug. For only 1% (4) patients the vital status was not known and about 7% (31) patients were no longer participating in the study. The CD4 cells are a type of white blood cells made in the spleen, lymph nodes, and thymus gland and are part of the infection fighting system. The absolute number of CD4 lymphocyte cells per cubic millimeter of peripheral blood serves as a reasonable biomarker for progression of AIDS and so it has been measured repeatedly over follow-up periods of 2, 6, 12, and 18 month visits. A decrease in the CD4 cell count over time is an indication of the worsened condition of the immune system of the patient and thus enhancing the risk to infection. We intended to study about how the CD4 count in the patient's blood contribute to the disease progression in the patient. More detail about this study can be found in Abrams et al. [1] and Goldman et al. [20].
The CD4 trajectories for each individual in the study acting as indicator of progression of AIDS can be expressed in terms of a mixed-effects model with covariate Observation time (Obstime) and binary explanatory variables Gender ( , ), AZT-Status ( , ), Drug ( , ), Previous Opportunistic Infection, i.e. PrevOI ( diagnosis at baseline, 0=no AIDS diagnosis at baseline) acting as influencing factors on the CD4 cell count in blood and can be expressed as:
where denotes the square root of the CD4 cell count for the patient at the time point with . Here, is the vector of regression coefficients for the longitudinal trajectory. The square root transformation has been adopted as the CD4 cell count data exhibited right skewness. Again, according to our model, the time-to-event outcome variable, i.e. death due to acquired immunodeficiency syndrome can be expressed as:
where denotes the time-to-event observation indicating the time upto which patient has survived (if death occurs) or time at which the observation was censored (if the patient was alive at the end of the study), being the intercept of the time-to-event trajectory, and the coefficients of the intercept and slope components of the random-effects and . Here, captures the structural covariance that is present between the CD4 cell count observations of a patient and the time to death or progression in disease severity.
As our proposed model relies on conditional dependence of the longitudinal biomarker directly on the event time process, it is clear that all the variables affecting the CD4 biomarker in the blood cell are directly affecting the time-to-event outcome, i.e. the event of death or deteriorated condition, even if the covariates were not included in the time-to-event submodel separately. We have also checked this fact by including covariates such as Drug and Previous Opportunistic Infection (i.e. AIDS or no AIDS at the baseline) in the time-to-event submodel (as these two can be considered capable of influencing the time-to-event directly) and obtained comparable results.
We have adopted a fully parametric accelerated failure time model for modelling the time-to-event part of the process where censoring occurs if the person is alive till the end of the study period. The analysis was carried out under two setups as described in Section 4. The models were compared and the findings displayed in Table 3, which shows that the proposed joint model (JM ) and the joint model with local independence (JM ) are equally competent with comparable estimates and low SD for all the parameters.
Table 3.
AIDS study.
JM | JM | ||||||||
---|---|---|---|---|---|---|---|---|---|
Parameter | Name of the effect | Mean | SD | 95% CI | Mean | SD | 95% CI | ||
For longitudinal | |||||||||
Intercept | 2.599 | 0.102 | 2.403 | 2.798 | 2.604 | 0.094 | 2.425 | 2.793 | |
Obs. time | −0.398 | 0.039 | −0.475 | −0.322 | −0.401 | 0.039 | −0.479 | −0.323 | |
Drug | 0.088 | 0.095 | −0.094 | 0.278 | 0.075 | 0.095 | −0.106 | 0.259 | |
Gender | −0.003 | 0.081 | −0.159 | 0.156 | −0.010 | 0.082 | −0.173 | 0.148 | |
Prev opp infection | −0.442 | 0.059 | −0.553 | −0.326 | −0.433 | 0.059 | −0.548 | −0.324 | |
Intol to ZDV | −0.028 | 0.059 | −0.143 | 0.089 | −0.042 | 0.057 | −0.156 | 0.065 | |
Precision | 3.239 | 0.134 | 2.985 | 3.506 | 3.232 | 0.134 | 2.976 | 3.902 | |
For both processes | |||||||||
covariance between two processes | 0.501 | 0.289 | 0.024 | 0.974 | 0.497 | 0.289 | 0.023 | 0.975 | |
variance component | 0.203 | 0.046 | 0.123 | 0.294 | 0.211 | 0.046 | 0.126 | 0.308 | |
correlation | −0.216 | 0.099 | −0.395 | −0.005 | −0.209 | 0.097 | −0.392 | −0.021 | |
For time-to-event | |||||||||
Intercept | 0.904 | 1.118 | −1.277 | 3.069 | 0.904 | 1.110 | −1.272 | 3.064 | |
precision | 1.002 | 0.101 | 0.812 | 1.208 | 0.999 | 0.099 | 0.813 | 1.204 | |
random intercept | 1.000 | 0.100 | 0.806 | 1.195 | |||||
random slope | 1.000 | 0.099 | 0.805 | 1.193 |
Note: Results of model fitting. Bold implies significant covariates
Further, Table 2 displaying the values of Dbar, EAIC, EBIC and DIC suggests that JM is a preferable model fit than JM . This asserts the fact that structural association between the two submodels can solely serve the role of association structure between the two submodels in the joint model framework.
The estimated regression coefficient of the follow up time was found to be significant with for JM and for JM implying detoriating CD4 cell counts of the patients under study. The covariate Drug held no significance thus conveying the ineffectiveness of one of the drug (ddI/ ddC) over the other in improving the clinical condition. The diagnosis of AIDS at study entry level is found to be significant with estimated coefficients for JM and for JM . Therefore, the patients who were diagnosed with the disease at the start of the study were showing detoriated CD4 cell count as the study progressed. Other factors such as gender and the the patient's intolerance to zidovudine therapy recorded at the entry of the study were found to be insignificant.
It may be noted that, in the time-to-event model, we have not considered any covariate directly. Effects of covariates are incorporated through the conditional mean which involves all the covariates used in longitudinal submodel. This implies that the presence of structural association is very much needed. The structural association parameter was significant with an estimate of for JM and for JM . This reflects the fact that structural association between the observed CD4 cell counts and the time-to-event, i.e. death or deteriorated condition of the patient under this modelling framework is alone sufficient to arrest the correlation between two processes. The precision parameter for the unconditional variability of the time-to-event observations is found to be significant with low standard deviation. The estimated regression coefficients for the intercept and slope for the subject-specific random-effects show significance with positive values and low standard deviation thus conveying the importance of random-effects in binding the two submodels in JM .
4. Simulation study
We conduct extensive simulation studies to evaluate the performance of the proposed joint model and to examine the effectiveness of the structural association based on conditional distribution. Thus we consider two parallel setups and design the simulation study as the proposed joint model (JM ) and the joint model with local independence (JM ). It is to be noted that, JM has the two submodels linked by the structural association captured by as well as by the shared parameter in the form of random-effects. JM links the two submodels only through the structural association as is considered in this setup.
For both the set up we consider 100 datasets with m = 200 individuals each having varying number of longitudinal outcomes. This is generated through ). A single continuous covariate is used for simulation. We simulate covariate values for the patient at the time point, i.e. from a distribution. The random-effects are simulated from , where is of the form .
Longitudinal observations are generated from where the linear trajectory and , where denotes an identity matrix of order and denotes vector representing the time points. Time-to-event observations, i.e. are generated from
We draw the censoring times from an exponential distribution with mean 0.5 and . The censoring percentage was observed to be around 40%. The whole setup was also simulated with a censoring percentage of around 20% and comparable results were obtained. The parameter values were chosen as 1.85, 1.3, 1, 0.3, 0.99, 0.8, 0.3, 0.9, 0.9 and 1.11. The data ( is generated from these parametric assumptions where is the censoring indicator.
The Bayesian framework in Section 2.2 is implemented for obtaining the inference. We examined the Bayesian trace plots and the autocorrelation plots from the summary to ensure the desired number of burn-in iterations to be 5000 and have examined the MCMC convergence and the mixing of the chains. We also examined three parallel MCMC chains with overdispersed initial values and the inference has been based on 5000 iteration values for each chain. The chains exhibit rapid convergence, all achieving convergence within 5000 iterations with all the parameters displaying (the between-chain and within-chain standard deviation ratio) to be around 1.01.
The estimation results are reported in Table 4, displaying the bias (the deviation of the estimated value from the true parameter value), standard deviation (SD) of the estimated posterior means) standard error (SE), i.e. the square root of the average of the posterior variance, and the coverage probabilities (CP) of equal-tailed credible intervals (CI). Table 4 supports the conclusion that the proposed joint model framework provides estimates of all the model parameters with negligible bias where SE is pretty close to SD and the credible interval coverage probabilities exceed the nominal level (95%) for both the joint models. We can see that the Joint model with local independence (JM ) slightly outperforms the proposed joint model (JM ) in terms of smaller bias and standard deviation for most of the parameters. This highlights the fact that even if the two processes do not share common subject-specific random-effects, association can be successfully captured by the structural dependency.
Table 4.
Simulation study.
Joint Model-JM | JM with local independence-JM | ||||||||
---|---|---|---|---|---|---|---|---|---|
Parameter | True | BIAS | SD | SE | CP | BIAS | SD | SE | CP |
For longitudinal outcomes | |||||||||
α | 1.850 | −0.072 | 0.183 | 0.175 | 1.000 | −0.003 | 0.194 | 0.187 | 1.000 |
β | 1.300 | 0.004 | 0.062 | 0.053 | 0.990 | 0.002 | 0.055 | 0.057 | 0.990 |
0.900 | −0.036 | 0.038 | 0.042 | 0.990 | −0.039 | 0.038 | 0.077 | 0.990 | |
For both processes | |||||||||
0.300 | −0.006 | 0.174 | 0.143 | 1.000 | 0.003 | 0.173 | 0.145 | 1.000 | |
0.800 | 0.023 | 0.088 | 0.073 | 0.990 | 0.019 | 0.085 | 0.077 | 0.990 | |
ρ | 0.300 | 0.025 | 0.090 | 0.085 | 1.000 | 0.004 | 0.093 | 0.086 | 1.000 |
For time-to-event | |||||||||
1.000 | 0.001 | 0.099 | 0.103 | 1.000 | 0.003 | 0.102 | 0.097 | 1.000 | |
1.010 | −0.012 | 0.098 | 0.091 | 0.990 | −0.008 | 0.099 | 0.093 | 0.990 | |
0.900 | 0.097 | 0.101 | 0.097 | 0.990 | |||||
0.900 | 0.096 | 0.101 | 0.096 | 0.990 |
Note: Operational characteristics for simulation studies.
Here, in our theoretical framework, we have modelled the time-to-event process by a parametric AFT model, so it is essential that we verify whether our proposed joint model is sufficiently robust under model misspecification. Table 5 displays the results of fitting our proposed joint model and the joint model with local independence when time-to-event data are being simulated from a log-logistic distribution. Here, in most of the cases bias increased to some extent as apprehended, but still within acceptance limit. For example, in longitudinal submodel, it may be seen that bias for β has increased to 0.089 from 0.004. SD and SE also increased as expected. The MCMC convergence and the mixing of the chains has been examined. All chains exhibit rapid convergence with convergence being achieved within 5000 iterations with good mixing of the chains with value within 1.02 for all the parameters.
Table 5.
Simulation study.
Joint Model-JM | JM with local independence-JM | ||||||||
---|---|---|---|---|---|---|---|---|---|
Parameter | True | BIAS | SD | SE | CP | BIAS | SD | SE | CP |
For longitudinal | |||||||||
α | 1.850 | −0.149 | 0.259 | 0.242 | 1.000 | −0.144 | 0.254 | 0.239 | 1.000 |
β | 1.300 | 0.089 | 0.078 | 0.080 | 0.990 | 0.085 | 0.077 | 0.079 | 0.990 |
0.900 | −0.087 | 0.037 | 0.038 | 0.970 | 0.089 | 0.039 | 0.084 | 0.970 | |
For both processes | |||||||||
0.300 | 0.005 | 0.174 | 0.292 | 1.000 | −0.008 | 0.172 | 0.141 | 1.000 | |
0.800 | −0.024 | 0.087 | 0.083 | 0.990 | −0.044 | 0.085 | 0.075 | 0.990 | |
ρ | 0.300 | −0.038 | 0.124 | 0.114 | 1.000 | −0.030 | 0.113 | 0.115 | 1.000 |
For time-to-event | |||||||||
1.000 | −0.004 | 0.102 | 0.102 | 1.000 | −0.005 | 0.103 | 0.106 | 1.000 | |
1.010 | −0.010 | 0.103 | 0.196 | 0.990 | −0.008 | 0.100 | 0.094 | 0.990 | |
0.900 | 0.101 | 0.095 | 0.185 | 0.990 | |||||
0.900 | 0.104 | 0.098 | 0.098 | 0.990 |
Note: Robustness check under model misspecification.
5. Discussion
In this manuscript, efforts have been undertaken towards introducing a new class of models based on a conditional distribution. This approach may further be extended to other cases where the conditional distribution of one process carries information about the other process. This proposed model can further be generalized. In the longitudinal sub-model, instead of using the linear mixed effects model, one may use some nonlinear or partial linear mixed effects model (PLMM) to capture nonlinear disease progression rate. In the accelerated failure time model (AFT) one can introduce some smoothing terms for covariate modelling. A joint model having submodels like PLMM and AFT model with splines will pose computational challenges.
Bayesian framework gives an additional advantage in terms of computational complexity. Frequentist approaches are mostly based on EM algorithm which is highly dependent on the choice of the parameter values at the initial stage. Using OpenBUGS the posterior computation becomes easier. A blend of OpenBUGS and R has been used for computation. The OpenBUGS code used for this model is provided as Supplementary information. Sensitivity of prior selection has also been studied with different standard priors. Two different data sets have been analysed in this work. The AIDS data set is available in R and the DMD data set is also available subject to the approval of National Neurosciences Centre.
It may also be noted that under the proposed set up, it may not be possible to test the dependency of the simultaneous processes which may be done for traditional set up. This shortcoming may be considered as a future research problem. Another challenge will be to incorporate the missing data (non-ignorable) in this model. We are currently working in this direction.
Supplementary Material
Acknowledgments
We like to thank National Neurosciences Centre (NNC) for helping us with the data. We also thank the anonymous reviewers for providing us with valuable suggestions which enhanced the quality of work.
Disclosure statement
No potential conflict of interest was reported by the author(s).
References
- 1.Abrams D.I., Goldman A.I., Launer C., Korvick J.A., Neaton J.D., Crane L.R., Grodesky M., Wakefield S., Muth K., Kornegay S., Cohn D.L., Harris A., Luskin-Hawk R., Markowitz N., Sampson J.H., Thompson M., and Deyton L., A comparative trial of didanosine or zalcitabine after treatment with zidovudine in patients with human immunodeficiency virus infection, New Engl. J. Med. 330 (1994), pp. 657–662. [DOI] [PubMed] [Google Scholar]
- 2.Andrinopoulou E.R., Eilers P.H., Takkenberg J.J., and Rizopoulos D., Improved dynamic predictions from joint models of longitudinal and survival data with time-varying effects using p-splines, Biometrics 74 (2018), pp. 685–693. [DOI] [PubMed] [Google Scholar]
- 3.Armero C., Cabras S., Castellanos M., Perra S., Quirós A., Oruezábal M., and Sánchez-Rubio J., Bayesian analysis of a disability model for lung cancer survival, Stat. Methods Med. Res. 25 (2016), pp. 336–351. [DOI] [PubMed] [Google Scholar]
- 4.Armero C., Forte A., Perpiñán H., Sanahuja M.J., and Agustí S., Bayesian joint modeling for assessing the progression of chronic kidney disease in children, Stat. Methods Med. Res. 27 (2018), pp. 298–311. [DOI] [PubMed] [Google Scholar]
- 5.Baghfalaki T., Ganjali M., and Hashemi R., Bayesian joint modeling of longitudinal measurements and time-to-event data using robust distributions, J. Biopharm. Stat. 24 (2014), pp. 834–855. [DOI] [PubMed] [Google Scholar]
- 6.Brown E.R. and Ibrahim J.G., A Bayesian semiparametric joint hierarchical model for longitudinal and survival data, Biometrics 59 (2003), pp. 221–228. [DOI] [PubMed] [Google Scholar]
- 7.Buta E., O'Malley S.S., and Gueorguieva R., Bayesian joint modelling of longitudinal data on abstinence, frequency and intensity of drinking in alcoholism trials, J. R. Stat. Soc. Ser. A (Statist. Soc.) 181 (2018), pp. 869–888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Carlin B.P. and Louis T.A., Bayesian Methods for Data Analysis, CRC Press, London, 2008. [Google Scholar]
- 9.Carpenter B., Gelman A., Hoffman M.D., Lee D., Goodrich B., Betancourt M., Brubaker M., Guo J., Li P., and Riddell A., Stan: A probabilistic programming language, J. Stat. Softw. 76 (2017), pp. 1–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Catalano P.J. and Ryan L.M., Bivariate latent variable models for clustered discrete and continuous outcomes, J. Amer. Statist. Assoc. 87 (1992), pp. 651–658. [Google Scholar]
- 11.Chakraborty A., Bounded influence function based inference in joint modelling of ordinal partial linear model and accelerated failure time model, Stat. Methods Med. Res. 25 (2016), pp. 2714–2732. [DOI] [PubMed] [Google Scholar]
- 12.Chakraborty A. and Das K., Inferences for joint modelling of repeated ordinal scores and time to event data, Comput. Math. Methods Med. 11 (2010), pp. 281–95. [DOI] [PubMed] [Google Scholar]
- 13.Cox D.R. and Wermuth N., Response models for mixed binary and quantitative variables, Biometrika 79 (1992), pp. 441–461. [Google Scholar]
- 14.Daniels M. and Kass K., Shrinkage estimators for covariance matrices, Biometrics 4 (2001), pp. 1173–1184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.De Gruttola V. and Tu X.M., Modelling progression of CD4-lymphocyte count and its relationship to survival time, Biometrics 50 (1994), pp. 1003–1014. [PubMed] [Google Scholar]
- 16.Faucett C.L. and Thomas D.C., Simultaneously modelling censored survival data and repeatedly measured covariates: A Gibbs sampling approach, Stat. Med. 15 (1996), pp. 1663–1685. [DOI] [PubMed] [Google Scholar]
- 17.Fieuws S., Verbeke G., and Molenberghs G., Random-effects models for multivariate repeated measures, Stat. Methods Med. Res. 16 (2007), pp. 387–397. [DOI] [PubMed] [Google Scholar]
- 18.Fitzmaurice G.M. and Laird N.M., Regression models for a bivariate discrete and continuous outcome with clustering, J. Amer. Statist. Assoc. 90 (1995), pp. 845–852. [Google Scholar]
- 19.Gelman A., Carlin J., Stern H., and Savitz D., Bayesian Data Analysis, London, 2004. [Google Scholar]
- 20.Goldman A.I., Carlin B.P., Crane L.R., Launer C., Korvick J.A., Deyton L., and Abrams D.I., Response of cd4 lymphocytes and clinical consequences of treatment using ddi or ddc in patients with advanced hiv infection, JAIDS J. Acquir. Immune Defic. Syndr. 11 (1996), pp. 161–169. [DOI] [PubMed] [Google Scholar]
- 21.Gueorguieva R.V. and Agresti A., A correlated probit model for joint modeling of clustered binary and continuous responses, J. Amer. Statist. Assoc. 96 (2001), pp. 1102–1112. [Google Scholar]
- 22.Guo X. and Carlin B.P., Separate and joint modeling of longitudinal and event time data using standard computer packages, Am. Stat. 58 (2004), pp. 16–24. [Google Scholar]
- 23.Henderson R., Diggle P., and Dobson A., Joint modelling of longitudinal measurements and event time data, Biostatistics 1 (2000), pp. 465–480. [DOI] [PubMed] [Google Scholar]
- 24.Hu W., Li G., and Li N., A Bayesian approach to joint analysis of longitudinal measurements and competing risks failure time data, Stat. Med. 28 (2009), pp. 1601–1619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Ibrahim J.G., Chen M.H., and Sinha D., Bayesian methods for joint modeling of longitudinal and survival data with applications to cancer vaccine trials, Statist. Sinica 14 (2004), pp. 863–883. [Google Scholar]
- 26.Ibrahim J.G., Chen M.H., and Sinha D., Bayesian Survival Analysis, Wiley Online Library, New York, 2005. [Google Scholar]
- 27.Li K. and Luo S., Bayesian functional joint models for multivariate longitudinal and time-to-event data, Comput. Statist. Data Anal. 129 (2019), pp. 14–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Li K. and Luo S., Dynamic predictions in Bayesian functional joint models for longitudinal and time-to-event data: An application to alzheimer's disease, Stat. Methods Med. Res. 28 (2019), pp. 327–342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Liu X., Daniels M., and Marcus B., Regression models for a bivariate discrete and continuous outcome with clustering, J. Amer. Statist. Assoc. 90 (2001), pp. 429–438. [Google Scholar]
- 30.Lunn D.J., Thomas A., Best N., and Spiegelhalter D., Winbugs-a Bayesian modelling framework: Concepts, structure, and extensibility, Stat. Comput. 10 (2000), pp. 325–337. [Google Scholar]
- 31.Luo S., A Bayesian approach to joint analysis of multivariate longitudinal data and parametric accelerated failure time, Stat. Med. 33 (2014), pp. 580–594. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Molenberghs G. and Lesaffre E., Marginal modeling of correlated ordinal data using a multivariate plackett distribution, J. Amer. Statist. Assoc. 89 (1994), pp. 633–644. [Google Scholar]
- 33.Molenberghs G., Thijs H., Kenward M.G., and Verbeke G., Sensitivity analysis of continuous incomplete longitudinal outcomes, Stat. Neerl. 57 (2003), pp. 112–135. [Google Scholar]
- 34.G. Molenberghs and G. Verbeke, Models for Discrete Longitudinal Data, Springer, New York, 2005, pp. 481–488.
- 35.M. Plummer, JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling, Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003) March 20–22, Vienna, Austria, 2003. [Google Scholar]
- 36.Pourahmadi M. and Daniels M., Dynamic conditionally linear mixed models for longitudinal data, Biometrics 58 (2002), pp. 225–231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Regan M.M. and Catalano P.J., Bivariate dose-response modeling and risk estimation in developmental toxicology, J. Agric. Biol. Environ. Stat. 4 (1999), pp. 217–237. [Google Scholar]
- 38.Rizopoulos D., Dynamic predictions and prospective accuracy in joint models for longitudinal and time-to-event data, Biometrics 67 (2011), pp. 819–829. [DOI] [PubMed] [Google Scholar]
- 39.Rizopoulos D., Joint Models for Longitudinal and Time-to-Event Data: With Applications in R, CRC press, New York, 2012. [Google Scholar]
- 40.Rizopoulos D., The R package jmbayes for fitting joint models for longitudinal and time-to-event data using mcmc, preprint (2014). Available at arXiv, 1404.7625.
- 41.Rizopoulos D. and Ghosh P., A Bayesian semiparametric multivariate joint model for multiple longitudinal outcomes and a time-to-event, Stat. Med. 30 (2011), pp. 1366–1380. [DOI] [PubMed] [Google Scholar]
- 42.Rizopoulos D., Molenberghs G., and Lesaffre E.M., Dynamic predictions with time-dependent covariates in survival analysis using joint modeling and landmarking, Biom. J. 59 (2017), pp. 1261–1276. [DOI] [PubMed] [Google Scholar]
- 43.Rizopoulos D., Taylor J.M., Van Rosmalen J., Steyerberg E.W., and Takkenberg J.J., Personalized screening intervals for biomarkers using joint models for longitudinal and survival data, Biostatistics 17 (2015), pp. 149–164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Rizopoulos D., Verbeke G., and Lesaffre E., Fully exponential laplace approximations for the joint modelling of survival and longitudinal data, J. R. Stat. Soc. Ser. B (Statist. Methodol.) 71 (2009), pp. 637–654. [Google Scholar]
- 45.Rizopoulos D., Verbeke G., and Molenberghs G., Multiple-imputation-based residuals and diagnostic plots for joint models of longitudinal and survival outcomes, Biometrics 66 (2010), pp. 20–29. [DOI] [PubMed] [Google Scholar]
- 46.Roy J. and Lin X., Latent variable models for longitudinal data with multiple continuous outcomes, Biometrics 56 (2000), pp. 1047–1054. [DOI] [PubMed] [Google Scholar]
- 47.Rue H., Martino S., Lindgren F., Simpson D., Riebler A., and Krainski E., INLA: Functions which allow to perform a full Bayesian analysis of structured additive models using integrated nested laplace approximation, 2009, R package version 0.0.
- 48.Rue M., Andrinopoulou E.R., Alvares D., Armero C., Forte A., and Blanch L., Bayesian joint modeling of bivariate longitudinal and competing risks data: An application to study patient-ventilator asynchronies in critical care patients, Biom. J. 59 (2017), pp. 1184–1203. [DOI] [PubMed] [Google Scholar]
- 49.Self S. and Pawitan Y., Modeling a Marker of Disease Progression and Onset of Disease, Springer, Boston, MA, 1992. [Google Scholar]
- 50.Spiegelhalter D.J., Best N.G., Carlin B.P., and Van Der Linde A., Bayesian measures of model complexity and fit, J. R. Stat. Soc. Ser. B (Statist. Methodol.) 64 (2002), pp. 583–639. [Google Scholar]
- 51.G. Verbeke, G. Molenberghs, and D. Rizopoulos, Longitudinal Research with Latent Variables, Springer, Berlin, Heidelberg, 2010, pp. 37–96.
- 52.Wu J., Chen M.H., Schifano E.D., Ibrahim J.G., and Fisher J.D., A new Bayesian joint model for longitudinal count data with many zeros, intermittent missingness, and dropout with applications to hiv prevention trials, Stat. Med. 38 (2019), pp. 5565–5586. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Wulfsohn M.S. and Tsiatis A.A., A joint model for survival and longitudinal data measured with error, Biometrics 53 (1997), pp. 330–339. [PubMed] [Google Scholar]
- 54.Yang J., Li R., Xiang P., Hu J., Lu W., Ni Z., and Cai G., On the predictive performance of two Bayesian joint models: A simulation study, Comm. Statist. Simul. Comput. (2020), pp. 1–10. doi: 10.1080/03610918.2020.1804579. [DOI] [Google Scholar]
- 55.Zhu H., DeSantis S.M., and Luo S., Joint modeling of longitudinal zero-inflated count and time-to-event data: A Bayesian perspective, Stat. Methods Med. Res. 27 (2018), pp. 1258–1270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Zhu H., Ibrahim J.G., Chi Y.Y., and Tang N., Bayesian influence measures for joint models for longitudinal and survival data, Biometrics 68 (2012), pp. 954–964. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.