Abstract
Joint modeling of associated mixed biomarkers in longitudinal studies leads to a better clinical decision by improving the efficiency of parameter estimates. In many clinical studies, the observed time for two biomarkers may not be equivalent and one of the longitudinal responses may have recorded in a longer time than the other one. In addition, the response variables may have different missing patterns. In this paper, we propose a new joint model of associated continuous and binary responses by accounting different missing patterns for two longitudinal outcomes. A conditional model for joint modeling of the two responses is used and two shared random effects models are considered for intermittent missingness of two responses. A Bayesian approach using Markov Chain Monte Carlo (MCMC) is adopted for parameter estimation and model implementation. The validation and performance of the proposed model are investigated using some simulation studies. The proposed model is also applied for analyzing a real data set of bariatric surgery.
Keywords: Conditional model, intermittent missingness, joint modeling, longitudinal data, MCMC methods, mixed-effects model
1. Introduction
In many medical studies, more than one biomarker, with different types of outcomes, may be collected for the same patient. In most of the applications, these biomarkers are highly associated. They may also show different patterns for the effect of treatment. Separate modeling of these biomarkers may lead to misleading results, since the associations among biomarkers are not taken into account.
Simultaneous or joint modeling of mixed biomarkers is an effective approach for accounting associations among different biomarkers and more likely to provide more valid results and conclusions.
The motivation for our study is based on the data collected from people with morbid obesity undergoing a weight loss surgery recorded in an Iranian hospital. This data set contains one longitudinal binary response and one continuous longitudinal outcome. The continuous longitudinal outcome (BMI in our study) was collected from the first visit of the study while the longitudinal binary outcome (having complication due to surgery) was collected after the surgery. The existence of missing values is an inevitable part of each longitudinal study. In our study, the pattern of missingness of two longitudinal outcomes is different. Therefore, we consider two missingness mechanisms for our two longitudinal outcomes.
Rubin [21] distinguished among three important missingness mechanisms. When missingness mechanism is unrelated to the missing or observed values, it is called missing completely at random (MCAR). When missingness mechanism depends only on the observed data, that is, given the observed data, missingness does not depend on unobserved values, missing mechanism is called missing at random mechanism (MAR). A mechanism wherein missingness depends on the unobserved data, perhaps in addition to the observed data, is called missing not at random (MNAR). In the likelihood and Bayesian paradigms and when mild regularity conditions are satisfied, the MCAR and MAR mechanisms are ignorable, in the sense that inferences can proceed by analyzing the observed data only, without explicitly addressing a parametric form for the missing mechanism. But, MNAR mechanism is non-ignorable that is for analyzing the data set with MNAR mechanism modeling missingness mechanism in addition to longitudinal measurements modeling is essential. In this status, ignoring the missingness mechanism leads to biased parameter estimates. Therefore, we need some methods that can be jointly modeled missingness mechanisms and longitudinal measurements.
Joint modeling of longitudinal mixed outcomes has been studied by many authors. Usually, a random effects model, a marginal model or a conditional model is used for accounting association between the response variables (vide [6–11,14,22,24]). Li et al. [16] proposed the use of a random effects model for joint modeling of longitudinal continuous, binary and ordinal variables in a Bayesian paradigm. Kurum et al. [15] propose a semivarying joint model for longitudinal binary and continuous outcome in a frequentist paradigm. Liu et al. [17] proposed a joint model for modeling longitudinal binary and continuous data in a Bayesian paradigm and they used a data augmentation approach for handling missing values considering an ignorable missingness mechanism. Also, Gaskins et al. [12] propose a non-ignorable drop-out model via using a pattern mixture model for joint modeling of two longitudinal outcomes.
In this paper, we propose a joint modeling of binary and longitudinal continuous data using a conditional model. As the patterns of misingness for two longitudinal outcomes are different, we use two shared parameter models for missingness mechanisms. Also, dependence of two missingness mechanisms is considered by a conditional model. We develop the Bayesian paradigm for parametric inference. To adopt the Bayesian approach, Markov Chain Monte Carlo (MCMC) methods are used. The iterative simulation of the conditional posterior distribution for each parameter is done using the Gibbs sampler, where the OpenBUGS software is used for model implementation. We use the proposed approach to analyze a real data with morbid obesity recorded in Rasul-e-Akram hospital of Tehran in Iran. In this study, all the patients who have a weight loss surgery between July 2008 and April 2017 are considered. The surgery is categorized on three levels: Mini-gastric bypass, RY gastric bypass and some other laparoscopic surgeries. The simultaneous effect of surgery on BMI and the indicator of complications due to surgery, as two associated mixed longitudinal responses, is investigated.
The rest of the paper is organized as follows. In Section 2, we describe the real data set that would be analyzed in this paper. In Section 3, the formulation of joint model for analyzing of longitudinal mixed continuous and binary measurements is discussed. Section 4 includes a Bayesian implementation of the proposed method. Some simulation studies are implemented for investigating the performance of the proposed method in Section 5. Section 6 includes our application where the proposed model is considered for analyzing the data set. Some conclusions are also given in the last section.
2. Motivation: applied example
The motivation of our study is based on the data collected from people with morbid obesity in Rasul-e-Akram hospital of Tehran which is a referral hospital for bariatric surgery (weight loss surgery) in Iran. The data are recorded between July 2008 and April 2017 for people who undergo Mini-gastric bypass, RY gastric bypass and some other laparoscopic surgery. These people were enrolled and followed up for 9 months. The data on these patients were drawn from Iran national obesity surgery data base. The evaluation of the patients was conducted at three phases: preoperation, peri-operation and postoperation follow-up at 1-, 3-, 6- and 9-month after surgery.
In the pre-operative phase, the basic information includes demographic characteristics (age, sex) and medical history [fatty liver, type 2 Diabetes, dyslipidemia, hypothyroidism (also called underactive or low thyroid)]. These are recorded and considered as predictors in our study. Table 1 summarizes these basic characteristics for each type of surgery. In this table, the mean of age and the proportion of having each characteristic are reported. Also, p-values are given for testing whether samples are from the same distribution for binary outcomes and for testing the equality of means for three types of surgeries. In the peri-operation phase which includes after surgery, after 10 days from surgery and at each follow-up period, the patients were examined for complications due to surgery such as vomiting, constipation, diarrhea, and dumping and the results were recorded. Also, BMI at each visit was recorded. This cohort study was conducted on 554 male patients and 2336 female patients. In this cohort study, 1558 out of 2890 patients have Mini-gastric bypass surgery, 1031 out of 2890 patients have RY gastric bypass surgery and the 301 remaining patients have other laparoscopic surgeries. The mean±sd of BMI for entry, before surgery, after surgery, 10th days after surgery and at 1-, 3-, 6- and 9-month after surgery are given by , , , , , , and , respectively. The behavior of BMI for three types of surgeries during time are plotted in Figure 1. Also, patterns of missingness for BMI and indicator for complications due to surgery are plotted in Figure 2. In each panel of this figure, each row represents an existing pattern of missingness. The red color represents observed values and the gray color represents missingness for that time.
Table 1. Basic characteristics for each type of surgery and total patients ( ).
Mini-gastric bypass | RY gastric bypass | Other laparoscopic | p-value | Total | |
---|---|---|---|---|---|
Age | 0.063 | ||||
Gender | 0.055 | ||||
Fatty liver | <0.001 | ||||
Helicobacter | 0.102 | ||||
Type 2 Diabetes | <0.001 | ||||
Dyslipidemia | <0.001 | ||||
Hypothyroidism | 0.176 |
Figure 1.
Profiles of BMI measurements over time for different types of laparoscopic surgery superimposed by mean over all observed individuals at each time shown by black line. (a) Mini-gastric bypass surgery, (b) RY gastric bypass surgery, (c) other laparoscopic surgery.
Figure 2.
Pattern of missingness for BMI (left panel) and complication due to surgery (right panel). Each row represents a pattern of missingness at each time point. Red color represents observed values and gray color represents missingness for the occasion time.
3. Model specification
3.1. Joint model
The data structure can be described by four components. The first component contains continuous repeated measurements that are assumed to follow a linear mixed-effects model. The second component contains repeated measurements in a binary scale that are assumed to follow a mixed-effects logistic regression model. The last two components contain the missingness mechanisms for the first two components.
Let be the longitudinal continuous response for the ith, subject at time and be the vector of longitudinal measurements for the ith subject at time . Assume that follows a linear mixed-effects model as
(1) |
where is a vector of unknown fixed-effects parameters, is a -vector of the explanatory variables, is a vector of unobservable random effects, is an -dimensional design vector for random effects and is the random error variable. We assume that s are independent and identically normally distributed with zero-mean vector and covariance matrix . Typically, the error terms, 's, are assumed to be normal with zero mean and variance .
Also, assume that in addition to -dimensional vector of longitudinal continuous measurements , longitudinal binary measurements are also observed for the ith subject. Let be the longitudinal binary response for the ith subject, at time . Also, let be the vector of longitudinal binary measurements at time . Assume that follows the logistic mixed-effects regression model as
(2) |
where is a vector of unknown fixed-effects parameters, is a -vector of the explanatory variables, is a vector of unobservable random effects, is a -dimensional design vector for random effects and is the effect of the first response at time (i.e. ) on the second response on the time (i.e. ), when . This parameter can take into account the association between two responses at the same time. We assume that s are independent and identically normally distributed with zero-mean vector and covariance matrix . Note that, for simplicity and our scientific question of interest in application, we assume that the longitudinal continuous response at a time point may affect the longitudinal binary response at the same time point.
Let and be the missingness indicators for the continuous and the binary response at time and , respectively, such that if is missed and if is missed. We consider a shared parameter model [3] for considering non-ignorable mechanisms for and . Assume that follows the logistic mixed-effects regression model as
(3) |
where is a vector of unknown fixed-effects parameters, is a -vector of the explanatory variables, is the vector of unobservable random effects in model (1), is an -dimensional vector of parameters that relate the missingness to the continuous outcome data by the shared random effects. Also, we assume that follows the logistic mixed-effects regression model as
(4) |
where is a vector of unknown fixed-effects parameters, is a -vector of the explanatory variables, is the vector of unobservable random effects in model (2), is an -dimensional vector of the parameters that relates the missingness to the binary outcome data by the shared random effects and is the effect of the first missingness mechanism at time on the second missingness mechanism at time , when . Let and .
The joint model, given the random effects (for simplicity conditioning on the explanatory variables is dropped throughout) is given by
(5) |
Also, given the random effects, we have
(6) |
(7) |
(8) |
(9) |
Therefore, the joint distribution function of is given by
(10) |
Note that and can be partitioned to and , respectively. Therefore,
In this paper, we propose Bayesian approach via MCMC to estimate the parameters of the model. The approach is described in the next section.
4. Estimation via MCMC
The model of the previous section can be written as the following hierarchical model:
(11) |
where , and are given in (2), (3) and (4), respectively. We now consider the following prior distributions for the unknown parameters:
(12) |
where denotes the inverse gamma distribution with shape parameter a, scale parameter b, and mean a/b, denotes an inverse Wishart distribution with scale parameter ν and matrix parameter . Also, represents the p-variate normal distribution with mean vector and covariance matrix .
The hyperparameters of (12) are assumed to be known and the priors are proper. In practice, the elicitation of hyperparameters may be difficult, they can be chosen based on strong prior knowledge or diffuse prior information. In this paper, we have no prior information from historical data or previous experience, so we prefer to assign the hyperparameters such that they lead to low-informative prior distribution for the parameters.
Suppose that , , , , , , and . We further assume , for and , to be the vector of unknown parameters. Also, elements of the parameter vector to be independent are assumed. The joint posterior density of all unobservable components is given by
(13) |
where is the joint prior distribution of the parameters i.e. the product of the priors given in (12). This joint posterior distribution is analytically intractable but MCMC methods such as the Gibbs sampler and the Metropolis–Hastings algorithm may be used to draw samples, from which features of the marginal posterior distribution of interest can be inferred. The Gibbs sampler works by drawing samples iteratively from conditional posterior distributions derived from (13). For this purpose, we need full conditional distributions. Let , and ω be one of its components. We define for the above-mentioned vector when ω is omitted from it. Based on this notation, the full conditional distributions for the unknown parameters are given by:
(14) |
where .
(15) |
(16) |
(17) |
(18) |
(19) |
(20) |
(21) |
(22) |
(23) |
(24) |
(25) |
(26) |
(27) |
and
(28) |
5. Simulation studies
5.1. Evaluation of the model and its performance comparison with that of using separate models
To examine the performance of the proposed joint model and to compare it with those obtained by using separate models, we conducted some simulation studies. We simulated N = 50 data sets with different sample sizes. The numbers of repeated measurements for individuals are different in each outcome and they obtained under a non-random missingness and ranged from 1 to for the continuous outcome and from 1 to for binary outcome. We consider the following joint model:
(29) |
(30) |
where , if and if .
(31) |
where .
(32) |
where . In this simulation study, , , , , , , , , , and are generated from a and a , respectively.
5.1.1. The effect of large sample size
We consider simulation studies with sample sizes n = 500, 1000, 2000 and 3000. Also, different association parameters are considered. The first is and , the second set of choices for parameters is and and the third set of choices for parameters is and . Also, the true values of parameters are , . The other true values are considered as , , and . Another simulation study with , , , and and , , is also performed.
The proposed joint model and separate models are fitted to each generated data. The results of these simulation studies are reported in Tables B.1 and B.2 of the Supplementary Materials B for the first set of association parameters, in Tables B.3 and B.4 for zero association parameters, in Tables B.5 and B.6 for the third association parameters and in Tables B.7 and B.8 for the last set of parameters. For more information about computational times of these models see Supplementary Materials E. In these tables, posterior mean, empirical standard errors, root of mean squared errors and relative biases are reported. The two letter criteria are given by and where , , is the estimate of θ for the ith sample. For the MCMC sampling, we ran 2 chains with 30,000 iterations, and 10,000 iterations of each chain are considered as burn-in period. Gelman-Rubin diagnosis test [13] are checked to ensure convergence of the MCMC chains. We consider the following prior distributions: , j = 1, 2, k = 0, 1, 2, 3, , k = 1, 2, 3, 4, 5, , k = 1, 2, 3, 4, 5, , , k = 1, 2, , j = 1, 2, k = 0, 1 and . The ‘R2OpenBUGS’ package is used for the implementation of these models.
Based on the results of these tables, the performance of the proposed joint model is well known for analyzing generated data under either zero or non-zero association parameters. The results of the simulation study for the non-zero association parameters model show that the increase in sample size is an effective way of decreasing biases and standard deviations of parameters. As shown in tables B.1, B.3, B.5 and B.7, relative biases and RMSEs are reduced by increasing sample sizes. This suggests that the proposed model is consistent. Based on the results of Tables B.2, B.6 and B.8, the performance of the separate models when data are generated under non-zero association parameters are not well, specially, the values of RMSEs and relative biases for the components of the variances are large. Also, based on the results of Tables B.3 and B.4, the performance of the proposed joint model when data are generated under the separate models is as good as that of the separate models. In general, these tables show that the proposed method performs well.
5.1.2. The effect of small sample size
For checking the performance of the model using small sample size, three different small sample sizes n = 50, 80, 100 are also considered. We use the above-mentioned model with and , , . The other true values are considered as , , and . The Bayesian implementation including prior distributions and the number of iterations, etc. are the same as those of the previous simulation. The results of this simulation study for the joint and separate models are given in Tables B.9 and B.10, respectively. Note that, the number of parameters for joint model is 27 and N = 50 sample size seems to be too small for having accurate parameter estimations. Also, increasing sample size to n = 100 leads to more accurate estimations. The results of fitting separate models for small sample sizes do not show acceptable results and the value of RMSE and relative biases for some parameters are too large.
5.2. Evaluation of distributional assumption for random effects
In this section, another simulation study is performed to investigate the performance of the proposed model with respect to different distributional assumption for random effects. For this purpose, the mentioned model of Section 5.1.1 with n = 500 and three different distributional assumptions for and , are considered and the proposed joint modeling is used for analyzing the data. These distributional assumptions are given as follows:
Student's t distribution: , that is, standard Student's t distribution with 2 degrees of freedom.
Skew-normal distribution: , that is, zero-mean skew-normal distribution with scale parameter 1 and skewness parameter , such that, .
Mixture of two normal distributions: , that is a mixture of two normal distributions with mean zero.
The other true values are considered as , , , , , and . The Bayesian implementation including prior distributions and the number of iterations, etc. are the same as those of the previous simulations.
The results of this simulation study are given in Table B.11. In comparison to the results of the normal distributional assumption, results of this simulation (see Table B.1) show that the regression coefficients of continuous longitudinal model are not very sensitive with respect to different distributional assumptions, but the regression coefficients of binary longitudinal model are sensitive, specially for Student's t and skew-normal distributions. Also, the regression coefficients of missingness mechanism for binary outcome of Student's t and skew-normal distributions are far from the real values.
6. Analysis of morbid obesity data
The main objective of the analysis of the morbid obesity data is to find the simultaneous effect of type of surgery on having any complication due to surgery and on BMI. Also, the association between BMI and complication due to surgery is in particular interest. The data were collected from n = 2890 patients who had at least 2 measurements of both BMI and complication due to surgery and other measurements are ranged from 1 to 8 for BMI and from 1 to 6 for complication due to surgery. Let be the BMI for patient i at time . Also, be complication due to surgery for patient i at time . Also, let , , , , , , , , , denote the age, gender and having Helicobacter, Type 2 Diabetes, Fatty liver, Dyslipidemia, Hypothyroidism, Mini-gastric bypass and RY gastric bypass, respectively. We consider the following joint model:
(33) |
where and . Also, the following models are used for missingness mechanisms: , where
and , where
where and .
In the Bayesian method, two parallel MCMC chains are run with different initial values for 80,000 iterations each. Then, we have discarded the first 40,000 iterations as pre-convergence burn-in and retained 40,000 for the posterior inference. For checking the convergence of the MCMC chains, we have used the Gelman–Rubin diagnostic test. Also, trace plots of unknown parameters are shown in Figures D.1–D.6 of Supplementary Materials D. The prior distributions for the unknown parameters are the same as those of the simulation study section and are given by: , , , , , and . Table 2 shows the results of the Bayesian joint modeling and separate models on the data set. These two models are compared using EAIC, EBIC, DIC and LPML. These criteria are given by Baghfalaki et al. [4,5]. Unlike EAIC, EBIC and DIC, larger values of LPML indicate a better fitting model. These comparison criteria show that the joint model has a better fit to the data.
Table 2. Parameter estimates (Est.), standard deviation (S.D.) and 95 credible interval for analyzing morbid obesity data.
Joint model | Separate model | |||||||
---|---|---|---|---|---|---|---|---|
Parameters | Est. | S.D. | CI 2.5 | CI 97.5 | Est. | S.D. | CI 2.5 | CI 97.5 |
Continuous submodel (BMI) | ||||||||
(Intercept) | 50.850 | 3.027 | 45.270 | 55.150 | 52.430 | 1.691 | 48.870 | 55.100 |
(Time) | −2.092 | 0.013 | −2.116 | −2.067 | −2.092 | 0.012 | −2.117 | −2.068 |
(Age) | 0.001 | 0.012 | −0.024 | 0.027 | −0.001 | 0.012 | −0.025 | 0.021 |
(Gender; female) | −1.988 | 0.344 | −2.623 | −1.308 | −2.143 | 0.298 | −2.691 | −1.519 |
(Mini-gastric bypass) | 0.704 | 0.353 | 0.051 | 1.450 | 0.693 | 0.357 | −0.049 | 1.353 |
(RY gastric bypass) | 0.118 | 0.362 | −0.544 | 0.877 | 0.127 | 0.363 | −0.603 | 0.805 |
(Fatty liver) | 1.060 | 0.264 | 0.572 | 1.588 | 1.065 | 0.245 | 0.572 | 1.530 |
(Helicobacter) | 0.049 | 0.204 | −0.356 | 0.441 | 0.036 | 0.208 | −0.377 | 0.454 |
(Type 2 Diabetes) | −0.895 | 0.288 | −1.474 | −0.333 | −0.880 | 0.289 | −1.442 | −0.308 |
(Dyslipidemia) | −0.545 | 0.222 | −0.976 | −0.108 | −0.566 | 0.218 | −0.996 | −0.140 |
(Hypothyroidism) | 0.281 | 0.262 | −0.237 | 0.788 | 0.281 | 0.259 | −0.230 | 0.781 |
(Intercept) | 0.775 | 0.044 | 0.689 | 0.862 | 0.774 | 0.044 | 0.689 | 0.861 |
(Mini-gastric bypass) | 0.597 | 0.049 | 0.500 | 0.694 | 0.598 | 0.049 | 0.502 | 0.695 |
(RY gastric bypass) | 0.047 | 0.050 | −0.052 | 0.146 | 0.048 | 0.050 | −0.051 | 0.146 |
27.840 | 0.789 | 26.320 | 29.410 | 27.820 | 0.785 | 26.310 | 29.391 | |
11.000 | 0.129 | 10.750 | 11.250 | 11.000 | 0.130 | 10.740 | 11.250 | |
−0.007 | 0.003 | −0.013 | 0.000 | −0.007 | 0.003 | −0.013 | 0.000 | |
Binary submodel (Complication due to surgery) | ||||||||
(Intercept) | 1.554 | 0.938 | −0.080 | 3.696 | 2.376 | 0.528 | 1.397 | 3.411 |
(Time) | −0.136 | 0.098 | −0.324 | 0.054 | −0.277 | 0.013 | −0.303 | −0.250 |
(Age) | −0.001 | 0.003 | −0.007 | 0.004 | −0.002 | 0.002 | −0.006 | 0.002 |
(Gender; female) | −0.373 | 0.086 | −0.549 | −0.211 | −0.326 | 0.067 | −0.461 | −0.195 |
(Mini-gastric bypass) | 0.244 | 0.095 | 0.057 | 0.428 | 0.095 | 0.074 | −0.051 | 0.242 |
(RY gastric bypass) | 0.100 | 0.099 | −0.097 | 0.290 | 0.090 | 0.078 | −0.064 | 0.243 |
(Fatty liver) | −0.182 | 0.060 | −0.299 | −0.065 | −0.224 | 0.050 | −0.324 | −0.125 |
(Helicobacter) | −0.079 | 0.049 | −0.175 | 0.016 | −0.058 | 0.041 | −0.140 | 0.022 |
(Type 2 Diabetes) | −0.077 | 0.068 | −0.211 | 0.057 | −0.060 | 0.058 | −0.173 | 0.054 |
(Dyslipidemia) | 0.145 | 0.051 | 0.045 | 0.246 | 0.134 | 0.043 | 0.050 | 0.220 |
(Hypothyroidism) | 0.008 | 0.061 | −0.112 | 0.129 | −0.045 | 0.052 | −0.146 | 0.056 |
(Intercept) | −0.594 | 0.118 | −0.826 | −0.361 | 0.571 | 0.099 | 0.375 | 0.766 |
(Mini-gastric bypass) | 0.887 | 0.127 | 0.634 | 1.136 | 0.936 | 0.110 | 0.723 | 1.154 |
(RY gastric bypass) | −0.092 | 0.130 | −0.346 | 0.164 | −0.059 | 0.112 | −0.278 | 0.163 |
0.506 | 0.068 | 0.382 | 0.647 | 0.093 | 0.027 | 0.042 | 0.146 | |
2.258 | 0.177 | 1.940 | 2.629 | 4.679 | 0.826 | 3.569 | 6.718 | |
Shared parameters | ||||||||
0.070 | 0.008 | 0.056 | 0.085 | – | – | – | – | |
−0.015 | 0.006 | −0.026 | −0.004 | – | – | – | – | |
−0.021 | 0.004 | −0.030 | −0.012 | – | – | – | – | |
−0.012 | 0.005 | −0.021 | −0.003 | – | – | – | – | |
0.001 | 0.006 | −0.011 | 0.013 | – | – | – | – | |
0.005 | 0.009 | −0.012 | 0.022 | – | – | – | – | |
4.059 | 0.103 | 3.859 | 4.263 | – | – | – | – | |
2.373 | 0.081 | 2.215 | 2.533 | – | – | – | – | |
1.803 | 0.072 | 1.664 | 1.943 | – | – | – | – | |
1.449 | 0.076 | 1.302 | 1.599 | – | – | – | – | |
1.083 | 0.081 | 0.922 | 1.241 | – | – | – | – | |
0.854 | 0.096 | 0.666 | 1.043 | – | – | – | – | |
EAIC | 142,190 | 147966 | ||||||
EBIC | 142,458.6 | 148,163 | ||||||
DIC | 4604 | 4717 | ||||||
LPML | −59,119.75 | −61,951.82 |
This table shows that time, gender, type of surgery, fatty liver, type 2 diabetes and dyslipidemia are significant predictors in the continuous model (BMI), also, gender, type of surgery (mini-gastric bypass), fatty liver and dyslipidemia are significant predictors in the binary model (indicator of complication due to surgery). Based on the results, the increase in time reduces BMI index, females have smaller BMI and smaller probability to have complication due to surgery. The persons with fatty liver have larger values of BMI and smaller probability to have complication due to surgery. In addition, the persons with Dyslipidemia have smaller BMI but have the larger probability to have complication due to surgery. The conditional model shows a statistically significant relationship between missing time points and complication due to surgery ( is significantly different from zero), but not for BMI because, this dependence was not detected with the shared parameter model, where was not significantly different from zero. Therefore, a significant positive relationship between complication due to surgery and intermittent missingness (estimate was highly significant), suggesting that the missing-data mechanism is non-ignorable. Also, parameters and , , show that the continuous and binary longitudinal measurements are dependent, also, two missingness mechanisms are dependent. Figures C.1–C.6 of supplementary material C show the posterior densities for the unknown parameters in the models. Figures C.1–C.4 compare the posterior densities of parameters using separate and joint models. Figure C.1 shows that the behavior of the predictors on modeling BMI in the two competing models is similar. Also, Figure C.2 shows that the performance of two models in estimating parameters of the binary model is a little different. Based on Figure C.3, except , the regression coefficients of missingness mechanisms in the two approaches are equivalent. Also, Figure C.4 shows that posterior densities of components of variance in the two approaches are different. Figures C.5 and C.6 show the posterior densities for the association parameters of the joint model. These figures show that most of the association parameters have non-zero means.
As other criteria for comparing two models, we use true positive rate (TPR), false positive rate (FPR) and true discovery rate (TDR) of the observed values of indicators of complication due to surgery and their predicted values by two approaches. Let if th component of the indicator of complication due to surgery are predicted to be one. Also, for the observed values of situation if th component of the indicator of complication due to surgery is in fact one. The TPR, the FPR and the TDR of the method can be calculated as follows:
A receiver operating characteristic (ROC) curve plots FPR versus TPR for the possible cutoffs. A common method of comparing the methods is the area under the ROC curve (often for simplicity referred to as the AUC). AUC is a portion of the area of the unit square; its value is always between 0 and 1. The larger the value of AUC, the better is the performance of the approach. The values of TPR, FPR and TDR for cutoff point equal to 0.5 for two competing models are reported in Table 3. TPR for the joint model is a bit smaller than that of separate models, but the difference of FPR between two models is a large value. Other criteria of this table also show the better performance of the joint model in comparison with that of using separate models. Furthermore, the values of AUC for two competing models are reported in this table and the larger value belongs to the joint model. Figure 3 shows ROC curve for indicator of complication due to surgery for two competing models which shows the AUC for joint model is larger than that of separate model. A sensitivity analysis with respect to the prior distributions of regression coefficients and association parameters is also performed. For this purpose, the sensitivity of the posterior mean to different values of the variance of the prior distribution is investigated. The results show that the parameter estimates are robust with respect to the normal priors when variances are considered to be larger than 100.
Table 3. The values of TPR, FPR and TDR for cutoff point equal to 0.5 and AUC for two competing models in analyzing morbid obesity data.
Joint model | Separate models | |
---|---|---|
TPR | 0.875 | 0.909 |
FPR | 0.469 | 0.864 |
TDR | 0.697 | 0.527 |
AUC | 0.817 | 0.660 |
Figure 3.
ROC curves of observed and predicted values by two competing approaches of complication due to surgery for the morbid obesity data.
7. Conclusions
In this paper, we propose a joint model for a mixed continuous and binary outcomes with different missing patterns. We use an MCMC procedure via the Gibbs sampler and Metropolis-Hastings algorithm to draw samples for parameters estimation. This process is facilitated using the freely available OpenBUGS software [23].
Two shared parameter models are considered for missingness mechanisms of two responses which jointly modeled using a conditional model. The usefulness of shared-parameter models for analyzing longitudinal data subject to non-random missing data is shown by many authors [1–3]. It should be recognized that, although these models account for NMAR data, it is impossible to distinguish between MAR and NMAR data within this class of models [3]. Also, one can use pattern mixture model and selection model frameworks for modeling missing mechanism [18–20].
In this paper, a normality assumption is used for the random effects and errors. One can use more flexible distributional assumption such as t, skew-normal, skew-t or other distributional assumptions for random effects. Also, a logistic mixed-effects model is used for analyzing binary data including binary longitudinal data and missingness mechanisms. As a future work, one can use the probit mixed-effects model for modeling the binary outcomes.
Also, we consider non-informative prior distributions for the unknown parameters, one can use informative priors or empirical Bayes approach for choosing hyper parameters in the Bayesian paradigm.
Supplementary Material
Acknowledgments
This work has been supported by the grant number 97014144 from Iranian National Science Foundation (INSF). The authors would like to thank the INSF. The authors also acknowledge the valuable suggestions from the referees.
Funding Statement
This work has been supported by Iranian National Science Foundation (INSF) [grant number 97014144].
Disclosure statement
No potential conflict of interest was reported by the author(s).
References
- 1.Albert P.S. and Follmann D.A., Modeling repeated count data subject to informative dropout, Biometrics 56 (2000), pp. 667–677. doi: 10.1111/j.0006-341X.2000.00667.x [DOI] [PubMed] [Google Scholar]
- 2.Albert P.S. and Follmann D.A., A random effects transition model for longitudinal binary data with informative missingness, Statistica Neerlandica 57(2003), pp. 100–111. doi: 10.1111/1467-9574.00223 [DOI] [Google Scholar]
- 3.Albert P.S. and Follmann D., Shared-parameter models, in Longitudinal Data Analysis, Chapman & Hall/CRC, Boca Raton, 2009, pp. 433–452.
- 4.Baghfalaki T., Ganjali M., and Berridge D., Joint modeling of multivariate longitudinal mixed measurements and time to event data using a Bayesian approach, J. Appl. Stat. 41 (2014), pp. 1934–1955. doi: 10.1080/02664763.2014.898132 [DOI] [Google Scholar]
- 5.Baghfalaki T., Ganjali M., and Hashemi R., Bayesian joint modeling of longitudinal measurements and time-to-event data using robust distributions, J. Biopharm. Stat. 24 (2014), pp. 834–855. doi: 10.1080/10543406.2014.903657 [DOI] [PubMed] [Google Scholar]
- 6.Bahrami Samani E. and Ganjali M., Bayesian latent variable model for mixed continuous and ordinal responses with possibility of missing responses, J. Appl. Stat. 38 (2011), pp. 1103–1116. doi: 10.1080/02664763.2010.484485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Buhule O.D., Wahed A.S., and Youk A.O., Bayesian hierarchical joint modeling of repeatedly measured continuous and ordinal markers of disease severity: Application to Ugandan diabetes data, Stat. Med. 36 (2017), pp. 4677–4691. doi: 10.1002/sim.7444 [DOI] [PubMed] [Google Scholar]
- 8.Catalano P.J., Bivariate modelling of clustered continuous and ordered categorical outcomes, Stat. Med. 16 (1997), pp. 883–900. doi: [DOI] [PubMed] [Google Scholar]
- 9.Dunson D.B. and Herring A.H., Bayesian latent variable models for mixed discrete outcomes, Biostatistics 6 (2005), pp. 11–25. doi: 10.1093/biostatistics/kxh025 [DOI] [PubMed] [Google Scholar]
- 10.Ganjali M., A model for mixed continuous and discrete responses with possibility of missing responses, J. Sci. Islam. Repub. Iran 14 (2003), pp. 53–60. [Google Scholar]
- 11.Ganjali M. and Shafie K., A transition model for an ordered cluster of mixed continuous and discrete responses with non-monotone missingness, J. Appl. Stat. Sci. 15 (2006), pp. 331–343. [Google Scholar]
- 12.Gaskins J.T., Daniels M.J., and Marcus B.H., Bayesian methods for nonignorable dropout in joint models in smoking cessation studies, J. Am. Stat. Assoc. 111 (2016), pp. 1454–1465. doi: 10.1080/01621459.2016.1167693 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gelman A. and Rubin D.B., Inference from iterative simulation using multiple sequences, Stat. Sci. 7 (1992), pp. 457–472. doi: 10.1214/ss/1177011136 [DOI] [Google Scholar]
- 14.Gueorguieva R.V. and Sanacora G., Joint analysis of repeatedly observed continuous and ordinal measures of disease severity, Stat. Med. 25 (2006), pp. 1307–1322. doi: 10.1002/sim.2270 [DOI] [PubMed] [Google Scholar]
- 15.Kurum E., Hughes J., and Li R., A semivarying joint model for longitudinal binary and continuous outcomes, Canadian Journal of Statistics 44 (2016), pp. 44–57. doi: 10.1002/cjs.11273 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Li Q., Pan J., and Belcher J., Bayesian inference for joint modelling of longitudinal continuous, binary and ordinal events, Stat. Methods Med. Res. 25 (2016), pp. 2521–2540. doi: 10.1177/0962280214526199 [DOI] [PubMed] [Google Scholar]
- 17.Liu X., Daniels M.J., and Marcus B., Joint models for the association of longitudinal binary and continuous processes with application to a smoking cessation trial, J. Am. Stat. Assoc. 104 (2009), pp. 429–438. doi: 10.1198/016214508000000904 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Michiels B., Molenberghs G., Bijnens L., Vangeneugden T., and Thijs H., Selection models and pattern-mixture models to analyse longitudinal quality of life data subject to drop-out, Stat. Med. 21 (2002), pp. 1023–1041. doi: 10.1002/sim.1064 [DOI] [PubMed] [Google Scholar]
- 19.Michiels B., Molenberghs G., and Lipsitz S.R., Selection models and pattern-mixture models for incomplete categorical data with covariates, Biometrics 55 (1999), pp. 978–983. doi: 10.1111/j.0006-341X.1999.00978.x [DOI] [PubMed] [Google Scholar]
- 20.Molenberghs G., Michiels B., and Kenward M.G, Pseudo-likelihood for combined selection and pattern-mixture models for missing data problems, Biom. J. 40 (1998), pp. 557–572. doi: [DOI] [Google Scholar]
- 21.Rubin D.B., Inference and missing data, Biometrika 63 (1976), pp. 581–592. doi: 10.1093/biomet/63.3.581 [DOI] [Google Scholar]
- 22.Samani E.B., Ganjali M., and Eftekhari S., A latent variable model for mixed continuous and ordinal responses with nonignorable missing responses: Assessing the local influence via covariance structure, Sankhya B 72 (2010), pp. 38–57. doi: 10.1007/s13571-010-0003-7 [DOI] [Google Scholar]
- 23.Spiegelhalter D.J., Best N.G., Carlin B.P., and van der Linde A., Bayesian measures of model complexity and fit, J. R. Stat. Soc. B (Stat. Methodol.), 64(2002), pp. 583–639. doi: 10.1111/1467-9868.00353 [DOI] [Google Scholar]
- 24.Teimourian M., Baghfalaki T., Ganjali M., and Berridge D., Joint modeling of mixed skewed continuous and ordinal longitudinal responses: A Bayesian approach, J. Appl. Stat. 42(2015), pp. 2233–2256. doi: 10.1080/02664763.2015.1023557 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.