Skip to main content
Journal of Applied Statistics logoLink to Journal of Applied Statistics
. 2020 Sep 18;49(3):638–655. doi: 10.1080/02664763.2020.1822303

A Bayesian shared parameter model for joint modeling of longitudinal continuous and binary outcomes

T Baghfalaki a,CONTACT,, M Ganjali b, A Kabir c, A Pazouki c,d,e
PMCID: PMC9042025  PMID: 35706768

Abstract

Joint modeling of associated mixed biomarkers in longitudinal studies leads to a better clinical decision by improving the efficiency of parameter estimates. In many clinical studies, the observed time for two biomarkers may not be equivalent and one of the longitudinal responses may have recorded in a longer time than the other one. In addition, the response variables may have different missing patterns. In this paper, we propose a new joint model of associated continuous and binary responses by accounting different missing patterns for two longitudinal outcomes. A conditional model for joint modeling of the two responses is used and two shared random effects models are considered for intermittent missingness of two responses. A Bayesian approach using Markov Chain Monte Carlo (MCMC) is adopted for parameter estimation and model implementation. The validation and performance of the proposed model are investigated using some simulation studies. The proposed model is also applied for analyzing a real data set of bariatric surgery.

Keywords: Conditional model, intermittent missingness, joint modeling, longitudinal data, MCMC methods, mixed-effects model

1. Introduction

In many medical studies, more than one biomarker, with different types of outcomes, may be collected for the same patient. In most of the applications, these biomarkers are highly associated. They may also show different patterns for the effect of treatment. Separate modeling of these biomarkers may lead to misleading results, since the associations among biomarkers are not taken into account.

Simultaneous or joint modeling of mixed biomarkers is an effective approach for accounting associations among different biomarkers and more likely to provide more valid results and conclusions.

The motivation for our study is based on the data collected from people with morbid obesity undergoing a weight loss surgery recorded in an Iranian hospital. This data set contains one longitudinal binary response and one continuous longitudinal outcome. The continuous longitudinal outcome (BMI in our study) was collected from the first visit of the study while the longitudinal binary outcome (having complication due to surgery) was collected after the surgery. The existence of missing values is an inevitable part of each longitudinal study. In our study, the pattern of missingness of two longitudinal outcomes is different. Therefore, we consider two missingness mechanisms for our two longitudinal outcomes.

Rubin [21] distinguished among three important missingness mechanisms. When missingness mechanism is unrelated to the missing or observed values, it is called missing completely at random (MCAR). When missingness mechanism depends only on the observed data, that is, given the observed data, missingness does not depend on unobserved values, missing mechanism is called missing at random mechanism (MAR). A mechanism wherein missingness depends on the unobserved data, perhaps in addition to the observed data, is called missing not at random (MNAR). In the likelihood and Bayesian paradigms and when mild regularity conditions are satisfied, the MCAR and MAR mechanisms are ignorable, in the sense that inferences can proceed by analyzing the observed data only, without explicitly addressing a parametric form for the missing mechanism. But, MNAR mechanism is non-ignorable that is for analyzing the data set with MNAR mechanism modeling missingness mechanism in addition to longitudinal measurements modeling is essential. In this status, ignoring the missingness mechanism leads to biased parameter estimates. Therefore, we need some methods that can be jointly modeled missingness mechanisms and longitudinal measurements.

Joint modeling of longitudinal mixed outcomes has been studied by many authors. Usually, a random effects model, a marginal model or a conditional model is used for accounting association between the response variables (vide [6–11,14,22,24]). Li et al. [16] proposed the use of a random effects model for joint modeling of longitudinal continuous, binary and ordinal variables in a Bayesian paradigm. Kurum et al. [15] propose a semivarying joint model for longitudinal binary and continuous outcome in a frequentist paradigm. Liu et al. [17] proposed a joint model for modeling longitudinal binary and continuous data in a Bayesian paradigm and they used a data augmentation approach for handling missing values considering an ignorable missingness mechanism. Also, Gaskins et al. [12] propose a non-ignorable drop-out model via using a pattern mixture model for joint modeling of two longitudinal outcomes.

In this paper, we propose a joint modeling of binary and longitudinal continuous data using a conditional model. As the patterns of misingness for two longitudinal outcomes are different, we use two shared parameter models for missingness mechanisms. Also, dependence of two missingness mechanisms is considered by a conditional model. We develop the Bayesian paradigm for parametric inference. To adopt the Bayesian approach, Markov Chain Monte Carlo (MCMC) methods are used. The iterative simulation of the conditional posterior distribution for each parameter is done using the Gibbs sampler, where the OpenBUGS software is used for model implementation. We use the proposed approach to analyze a real data with morbid obesity recorded in Rasul-e-Akram hospital of Tehran in Iran. In this study, all the patients who have a weight loss surgery between July 2008 and April 2017 are considered. The surgery is categorized on three levels: Mini-gastric bypass, RY gastric bypass and some other laparoscopic surgeries. The simultaneous effect of surgery on BMI and the indicator of complications due to surgery, as two associated mixed longitudinal responses, is investigated.

The rest of the paper is organized as follows. In Section 2, we describe the real data set that would be analyzed in this paper. In Section 3, the formulation of joint model for analyzing of longitudinal mixed continuous and binary measurements is discussed. Section 4 includes a Bayesian implementation of the proposed method. Some simulation studies are implemented for investigating the performance of the proposed method in Section 5. Section 6 includes our application where the proposed model is considered for analyzing the data set. Some conclusions are also given in the last section.

2. Motivation: applied example

The motivation of our study is based on the data collected from people with morbid obesity in Rasul-e-Akram hospital of Tehran which is a referral hospital for bariatric surgery (weight loss surgery) in Iran. The data are recorded between July 2008 and April 2017 for people who undergo Mini-gastric bypass, RY gastric bypass and some other laparoscopic surgery. These people were enrolled and followed up for 9 months. The data on these patients were drawn from Iran national obesity surgery data base. The evaluation of the patients was conducted at three phases: preoperation, peri-operation and postoperation follow-up at 1-, 3-, 6- and 9-month after surgery.

In the pre-operative phase, the basic information includes demographic characteristics (age, sex) and medical history [fatty liver, type 2 Diabetes, dyslipidemia, hypothyroidism (also called underactive or low thyroid)]. These are recorded and considered as predictors in our study. Table 1 summarizes these basic characteristics for each type of surgery. In this table, the mean of age and the proportion of having each characteristic are reported. Also, p-values are given for testing whether samples are from the same distribution for binary outcomes and for testing the equality of means for three types of surgeries. In the peri-operation phase which includes after surgery, after 10 days from surgery and at each follow-up period, the patients were examined for complications due to surgery such as vomiting, constipation, diarrhea, and dumping and the results were recorded. Also, BMI at each visit was recorded. This cohort study was conducted on 554 male patients and 2336 female patients. In this cohort study, 1558 out of 2890 patients have Mini-gastric bypass surgery, 1031 out of 2890 patients have RY gastric bypass surgery and the 301 remaining patients have other laparoscopic surgeries. The mean±sd of BMI for entry, before surgery, after surgery, 10th days after surgery and at 1-, 3-, 6- and 9-month after surgery are given by 44.958±6.562, 44.857±6.236, 45.362±6.338, 42.302±6.002, 40.581±5.999, 36.130±5.570, 32.561±5.241 and 30.559±5.105, respectively. The behavior of BMI for three types of surgeries during time are plotted in Figure 1. Also, patterns of missingness for BMI and indicator for complications due to surgery are plotted in Figure 2. In each panel of this figure, each row represents an existing pattern of missingness. The red color represents observed values and the gray color represents missingness for that time.

Table 1. Basic characteristics for each type of surgery and total patients ( mean±sd).

  Mini-gastric bypass RY gastric bypass Other laparoscopic p-value Total
Age 38.132±10.538 38.488±10.065 36.827±10.010 0.063 38.123±10.324
Gender 0.799±0.010 0.830±0.011 0.777±0.024 0.055 0.808±0.007
Fatty liver 0.817±0.009 0.729±0.013 0.724±0.025 <0.001 0.776±0.007
Helicobacter 0.438±0.012 0.424±0.015 0.372±0.027 0.102 0.426±0.009
Type 2 Diabetes 0.212±0.010 0.147±0.011 0.089±0.016 <0.001 0.176±0.007
Dyslipidemia 0.384±0.010 0.441±0.011 0.335±0.016 <0.001 0.399±0.007
Hypothyroidism 0.198±0.010 0.192±0.012 0.153±0.021 0.176 0.191±0.007

Figure 1.

Figure 1.

Profiles of BMI measurements over time for different types of laparoscopic surgery superimposed by mean over all observed individuals at each time shown by black line. (a) Mini-gastric bypass surgery, (b) RY gastric bypass surgery, (c) other laparoscopic surgery.

Figure 2.

Figure 2.

Pattern of missingness for BMI (left panel) and complication due to surgery (right panel). Each row represents a pattern of missingness at each time point. Red color represents observed values and gray color represents missingness for the occasion time.

3. Model specification

3.1. Joint model

The data structure can be described by four components. The first component contains continuous repeated measurements that are assumed to follow a linear mixed-effects model. The second component contains repeated measurements in a binary scale that are assumed to follow a mixed-effects logistic regression model. The last two components contain the missingness mechanisms for the first two components.

Let Yij be the longitudinal continuous response for the ith, i=1,2,,n, subject at time tij,j=1,2,,my and Yi=(Yi1,,Yimy) be the vector of longitudinal measurements for the ith subject at time ti=(ti1,,timy). Assume that Yij follows a linear mixed-effects model as

Yij=xijyβ1+wijyb1i+εij, (1)

where β1 is a vector of py unknown fixed-effects parameters, xijy is a py-vector of the explanatory variables, b1i is a vector of qy unobservable random effects, wijy is an qy-dimensional design vector for random effects and εij is the random error variable. We assume that b1is are independent and identically normally distributed with zero-mean vector and covariance matrix Dy. Typically, the error terms, εij's, are assumed to be normal with zero mean and variance σε2.

Also, assume that in addition to my-dimensional vector of longitudinal continuous measurements Yi, longitudinal binary measurements are also observed for the ith subject. Let Zij be the longitudinal binary response for the ith subject, i=1,2,,n at time sij,j=1,2,,mz. Also, let Zi=(Zi1,,Zimz) be the vector of longitudinal binary measurements at time si=(si1,,simz). Assume that Zij follows the logistic mixed-effects regression model as

Zij|b2i,yijBer(πij),logit(πij)=xijzβ2+wijzb2i+γjyij, (2)

where β2 is a vector of pz unknown fixed-effects parameters, xijz is a pz-vector of the explanatory variables, b2i is a vector of qz unobservable random effects, wijz is a qz-dimensional design vector for random effects and γj is the effect of the first response at time tij (i.e. Yij) on the second response on the time sij (i.e. Zij), when tij=sij. This parameter can take into account the association between two responses at the same time. We assume that b2is are independent and identically normally distributed with zero-mean vector and covariance matrix Dz. Note that, for simplicity and our scientific question of interest in application, we assume that the longitudinal continuous response at a time point may affect the longitudinal binary response at the same time point.

Let Rijy and Rijz be the missingness indicators for the continuous and the binary response at time tij and sij, respectively, such that Rijy=1 if Yij is missed and Rijz=1 if Zij is missed. We consider a shared parameter model [3] for considering non-ignorable mechanisms for Yi and Zi. Assume that Rijy follows the logistic mixed-effects regression model as

Rijy|b1iBer(πijRy),logit(πijRy)=uijyα1+ψyb1i, (3)

where α1 is a vector of pry unknown fixed-effects parameters, uijy is a pry-vector of the explanatory variables, b1i is the vector of unobservable random effects in model (1), ψy is an qy-dimensional vector of parameters that relate the missingness to the continuous outcome data by the shared random effects. Also, we assume that Rijz follows the logistic mixed-effects regression model as

Rijz|b2i,rijyBer(πijRz),logit(πijRz)=uijzα2+ψzb2i+κjrijy, (4)

where α2 is a vector of prz unknown fixed-effects parameters, uijz is a prz-vector of the explanatory variables, b2i is the vector of unobservable random effects in model (2), ψz is an qz-dimensional vector of the parameters that relates the missingness to the binary outcome data by the shared random effects and κj is the effect of the first missingness mechanism at time tij (Rijy) on the second missingness mechanism at time sij (Rijz), when tij=sij. Let Riy=(Ri1y,R2iy,,Rimyy) and Riz=(Ri1z,R2iz,,Rimzz).

The joint model, given the random effects (for simplicity conditioning on the explanatory variables is dropped throughout) is given by

f(yi,zi,riy,riz|b1i,b2i)=f(zi|yi,riy,riz,b1i,b2i)×f(yi|riy,riz,b1i)×f(riz|riy,b2i,b2i)×f(riy|b1i,b2i)=f(zi|yi,riz,b2i)×f(yi|riy,b1i)×f(riz|riy,b2i)×f(riy|b1i)=f(zi|yi,b2i)×f(yi|b1i)×f(riz|riy,b2i)×f(riy|b1i). (5)

Also, given the random effects, we have

f(yi|b1i)=j=1myf(yij|b1i), (6)
f(zi|b2i,yi)=j=1mzf(zij|b1i,yij), (7)
f(riy|b1i)=j=1myf(rijy|b1i), (8)
f(riz|b2i,riy)=j=1mzf(rijz|b2i,rijy). (9)

Therefore, the joint distribution function of (yi,zi,riy,riz) is given by

f(yi,zi,riy,riz)=b1ib2if(yi,zi,riy,riz|b1i,b2i)g(b1i,b2i)db1idb2i=b1ib2ij=1myf(yij|b1i)j=1mzf(zij|b2i,yij)×j=1myf(rijy|b1i)j=1mzf(rijz|b2i,rijy)g(b1i,b2i)db1idb2i. (10)

Note that yi and zi can be partitioned to yi=(yiobs,yimis) and zi=(ziobs,zimis), respectively. Therefore,

f(yiobs,ziobs,riy,riz)=yimiszimisb1ib2if(yi,zi,riy,riz|b1i,b2i)g(b1i,b2i)db1idb2idyimis=b1ib2if(yiobs,ziobs,riy,riz|b1i,b2i)g(b1i,b2i)db1idb2i.

In this paper, we propose Bayesian approach via MCMC to estimate the parameters of the model. The approach is described in the next section.

4. Estimation via MCMC

The model of the previous section can be written as the following hierarchical model:

Yij|b1iN(xijyβ1+wijyb1i,σ2),Zij|b2i,yijBer(πij),Rijy|b1iBer(πijRy),Rijz|b2i,rijyBer(πijRz), (11)

where πij, πijRy and πijRz are given in (2), (3) and (4), respectively. We now consider the following prior distributions for the unknown parameters:

β1Npy(μβ1,Σβ1),σ2IΓ(aσ2,bσ2),β2Npz(μβ2,Σβ2),D1IWishart(ΨD1,νD1),D2IWishart(ΨD2,νD2),α1Npry(μα1,Σα1),α2Nprz(μα2,Σα2),ψyNqy(μψy,Σψy),ψzNqz(μψz,Σψz),κjN(μκj,σκj2),j=1,2,,my,γjN(μγj,σγj2),j=1,2,,mz, (12)

where IΓ(a,b) denotes the inverse gamma distribution with shape parameter a, scale parameter b, and mean a/b, Iwishart(ψ,ν) denotes an inverse Wishart distribution with scale parameter ν and matrix parameter ψ. Also, Np(μ,Σ) represents the p-variate normal distribution with mean vector μ and covariance matrix Σ.

The hyperparameters of (12) are assumed to be known and the priors are proper. In practice, the elicitation of hyperparameters may be difficult, they can be chosen based on strong prior knowledge or diffuse prior information. In this paper, we have no prior information from historical data or previous experience, so we prefer to assign the hyperparameters such that they lead to low-informative prior distribution for the parameters.

Suppose that yobs=(y1obs,,ynobs), ymis=(y1mis,,ynmis), zobs=(z1obs,,znobs), zmis=(z1mis,,znmis), ry=(r1y,,rny), rz=(r1z,,rnz), b1=(b11,,b1n) and b2=(b21,,b2n). We further assume θ={β1,β2,σ2,D1,D2,α1,α2,ψy,ψz,κk,γl}, for k=1,2,,,my and l=1,2,,,mz, to be the vector of unknown parameters. Also, elements of the parameter vector to be independent are assumed. The joint posterior density of all unobservable components is given by

π(θ,b1,b2,ymis,zmis|yobs,zobs,ry,rz)i=1nj=1myϕ(yij;xijyβ1+wijyb1i,σε2)×(πijRy)rijy(1πijRy)rijy×i=1nk=1mzπikzik(1πik)zik×(πikRz)rikz(1πikRz)rikz×π(θ), (13)

where π(θ) is the joint prior distribution of the parameters i.e. the product of the priors given in (12). This joint posterior distribution is analytically intractable but MCMC methods such as the Gibbs sampler and the Metropolis–Hastings algorithm may be used to draw samples, from which features of the marginal posterior distribution of interest can be inferred. The Gibbs sampler works by drawing samples iteratively from conditional posterior distributions derived from (13). For this purpose, we need full conditional distributions. Let ϑ=(θ,b1,b2,ymis,zmis), and ω be one of its components. We define ϑ(ω) for the above-mentioned vector when ω is omitted from it. Based on this notation, the full conditional distributions for the unknown parameters are given by:

β|ϑ(β),yobs,zobs,ry,rzNpy(Σβ|(i=1nxiyyiwiyb1iσε2),Σβ|) (14)

where Σβ|=(i=1nxiyxiyσε2+Σβ11)1.

σε2|ϑ(σε2),yobs,zobs,ry,rzIΓ(i=1n12myn2+aσ2,bσ2+12i=1nj=1my(yijxijyβ1+wijyb1i)2) (15)
π(β2|ϑ(β2),yobs,zobs,ry,rz)i=1nj=1mzexp(xijzβ2+wijzb2i+γjyij)zij1+exp(xijzβ2+wijzb2i+γjyij)×ϕpz(β2;μβ2,Σβ2), (16)
π(α1|ϑ(α1),yobs,zobs,ry,rz)i=1nj=1mzexp(uijyα1+ψyb1i)rijy1+exp(uijyα1+ψyb1i)×ϕpry(α1;μα1,Σα1), (17)
π(α2|ϑ(α2),yobs,zobs,ry,rz)i=1nj=1mzexp(uijzα2+ψzb2i+κjrijy)rijz1+exp(uijzα2+ψzb2i+κjrijy)×ϕprz(α2;μα2,Σα2), (18)
D1|ϑ(D1),yobs,zobs,ry,rzIwishart(ψy+i=1nb1ib1i,ν+qy+n) (19)
D2|ϑ(D2),yobs,zobs,ry,rzIwishart(ψz+i=1nb2ib2i,ν+qz+n) (20)
π(ψy|ϑ(ψy),yobs,zobs,ry,rz)i=1nj=1mzexp(uijyα1+ψyb1i)rijy1+exp(uijyα1+ψyb1i)×ϕqy(ψy;μψy,Σψy), (21)
π(ψz|ϑ(ψz),yobs,zobs,ry,rz)i=1nj=1mzexp(uijzα2+ψzb2i+κjrijy)rijz1+exp(uijzα2+ψzb2i+κjrijy)×ϕqy(ψz;μψz,Σψz), (22)
π(κj|ϑ(κj),yobs,zobs,ry,rz)i=1nexp(uijzα2+ψzb2i+κjrijy)rijz1+exp(uijzα2+ψzb2i+κjrijy)ϕ(κj;μκj,σκj2), (23)
π(γj|ϑ(γj),yobs,zobs,ry,rz)i=1nexp(xijzβ2+wijzb2i+γjyij)zij1+exp(xijzβ2+wijzb2i+γjyij)×ϕ(γj;μγj,σγj2), (24)
π(b1i|ϑ(b1i),yobs,zobs,ry,rz)j=1myϕ(yij;xijyβ1+wijyb1i,σε2)×j=1mzexp(uijyα1+ψyb1i)rijy1+exp(uijyα1+ψyb1i)×ϕqy(b1i;0,D1), (25)
π(b2i|ϑ(b2i),yobs,zobs,ry,rz)j=1mzexp(xijzβ2+wijzb2i+γjyij)zij1+exp(xijzβ2+wijzb2i+γjyij)×j=1mzexp(uijzα2+ψzb2i+κjrijy)rijz1+exp(uijzα2+ψzb2i+κjrijy)×ϕqz(b2i;0,D2), (26)
Yijmis|ϑ(Yijmis),yobs,zobs,ry,rzN(xijyβ1+wijyb1i,σε2) (27)

and

Zijmis|ϑ(Zijmis),yobs,zobs,ry,rzBer(πij). (28)

5. Simulation studies

5.1. Evaluation of the model and its performance comparison with that of using separate models

To examine the performance of the proposed joint model and to compare it with those obtained by using separate models, we conducted some simulation studies. We simulated N = 50 data sets with different sample sizes. The numbers of repeated measurements for individuals are different in each outcome and they obtained under a non-random missingness and ranged from 1 to my=7 for the continuous outcome and from 1 to mz=5 for binary outcome. We consider the following joint model:

Yij=xijyβ1+b1i+εij,i=1,2,,n,j=1,,my, (29)
Yij=xijzβ2+b2i+γjyij+εij,i=1,2,,n,j=1,,mz, (30)

where εijlogistic(0,1), Zij=1 if Yij0 and Zij=0 if Yij<0.

Rijy|b1iBer(πijRy), (31)

where logit(πijRy)=uiyα1+ψyb1i.

Rijz|b2iBer(πijRz), (32)

where logit(πijRz)=uizα2+ψzb2i+κjrijy. In this simulation study, xijyβ1=β10+β11x1i+β12x2i+β13tj, xijzβ2=β20+β21x1i+β22x2i+β23sj, uiyα1=α01+α11x1i, uizα2=α02+α12x2i, εijN(0,σε2), b1iN(0,σb12), b2iN(0,σb22), tj=j,j=0,0.5,1,2,,my, sj=1,2,,mz, x1 and x2 are generated from a Ber(0.3) and a Ber(0.4), respectively.

5.1.1. The effect of large sample size

We consider simulation studies with sample sizes n = 500, 1000, 2000 and 3000. Also, different association parameters are considered. The first is γj=1.000,j=1,,5 and κj=1.000,j=1,,5, the second set of choices for parameters is γj=0.000,j=1,,5 and κj=0.000,j=1,,5 and the third set of choices for parameters is γj=0.500,j=1,,5 and κj=0.500,j=1,,5. Also, the true values of parameters are σε2=4, σb12=σb22=1. The other true values are considered as β1=(2,1,1,2), β2=(0,2,1,1), α1=α2=(2,2) and (ψy,ψz)=(2,2). Another simulation study with β1=(8,2,1,2), β2=(0,1,1,1), α1=(4,0),α2=(0,2), γj=0.500,j=1,,5 and κj=2.000,j=1,,5 and (ψy,ψz)=(1,2), σε2=9, σb12=4,σb22=1 is also performed.

The proposed joint model and separate models are fitted to each generated data. The results of these simulation studies are reported in Tables B.1 and B.2 of the Supplementary Materials B for the first set of association parameters, in Tables B.3 and B.4 for zero association parameters, in Tables B.5 and B.6 for the third association parameters and in Tables B.7 and B.8 for the last set of parameters. For more information about computational times of these models see Supplementary Materials E. In these tables, posterior mean, empirical standard errors, root of mean squared errors and relative biases are reported. The two letter criteria are given by RMSE(θ)=1Ni=1N(θiθ¯)2 and Rel.bias(θ)=θ¯θ1 where θi, i=1,2,,N, is the estimate of θ for the ith sample. For the MCMC sampling, we ran 2 chains with 30,000 iterations, and 10,000 iterations of each chain are considered as burn-in period. Gelman-Rubin diagnosis test [13] are checked to ensure convergence of the MCMC chains. We consider the following prior distributions: βjkN(0,1000), j = 1, 2, k = 0, 1, 2, 3, γkN(0,1000), k = 1, 2, 3, 4, 5, κkN(0,1000), k = 1, 2, 3, 4, 5, σ2IΓ(0.01,0.01), σbk2IΓ(0.01,0.01), k = 1, 2, αjkN(0,1000), j = 1, 2, k = 0, 1 and ψy,ψzN(0,1000). The ‘R2OpenBUGS’ package is used for the implementation of these models.

Based on the results of these tables, the performance of the proposed joint model is well known for analyzing generated data under either zero or non-zero association parameters. The results of the simulation study for the non-zero association parameters model show that the increase in sample size is an effective way of decreasing biases and standard deviations of parameters. As shown in tables B.1, B.3, B.5 and B.7, relative biases and RMSEs are reduced by increasing sample sizes. This suggests that the proposed model is consistent. Based on the results of Tables B.2, B.6 and B.8, the performance of the separate models when data are generated under non-zero association parameters are not well, specially, the values of RMSEs and relative biases for the components of the variances are large. Also, based on the results of Tables B.3 and B.4, the performance of the proposed joint model when data are generated under the separate models is as good as that of the separate models. In general, these tables show that the proposed method performs well.

5.1.2. The effect of small sample size

For checking the performance of the model using small sample size, three different small sample sizes n = 50, 80, 100 are also considered. We use the above-mentioned model with γj=1.000,j=1,,5 and κj=1.000,j=1,,5, σε2=4, σb12=σb22=1. The other true values are considered as β1=(2,1,1,2), β2=(0,2,1,1), α1=α2=(2,2) and (ψy,ψz)=(2,2). The Bayesian implementation including prior distributions and the number of iterations, etc. are the same as those of the previous simulation. The results of this simulation study for the joint and separate models are given in Tables B.9 and B.10, respectively. Note that, the number of parameters for joint model is 27 and N = 50 sample size seems to be too small for having accurate parameter estimations. Also, increasing sample size to n = 100 leads to more accurate estimations. The results of fitting separate models for small sample sizes do not show acceptable results and the value of RMSE and relative biases for some parameters are too large.

5.2. Evaluation of distributional assumption for random effects

In this section, another simulation study is performed to investigate the performance of the proposed model with respect to different distributional assumption for random effects. For this purpose, the mentioned model of Section 5.1.1 with n = 500 and three different distributional assumptions for b1i and b2i,i=1,2,,n, are considered and the proposed joint modeling is used for analyzing the data. These distributional assumptions are given as follows:

  • Student's t distribution: b1i,b2iiidt(2), that is, standard Student's t distribution with 2 degrees of freedom.

  • Skew-normal distribution: b1i,b2iiidSN(2piδ,1,λ), that is, zero-mean skew-normal distribution with scale parameter 1 and skewness parameter λ=2, such that, δ=λ/(1+λ2).

  • Mixture of two normal distributions: b1i,b2iiid0.5N(2,1)+0.5N(2,1), that is a mixture of two normal distributions with mean zero.

The other true values are considered as γj=1.000,j=1,,5, κj=1.000,j=1,,5, σε2=4, β1=(2,1,1,2), β2=(0,2,1,1), α1=α2=(2,2) and (ψy,ψz)=(2,2). The Bayesian implementation including prior distributions and the number of iterations, etc. are the same as those of the previous simulations.

The results of this simulation study are given in Table B.11. In comparison to the results of the normal distributional assumption, results of this simulation (see Table B.1) show that the regression coefficients of continuous longitudinal model are not very sensitive with respect to different distributional assumptions, but the regression coefficients of binary longitudinal model are sensitive, specially for Student's t and skew-normal distributions. Also, the regression coefficients of missingness mechanism for binary outcome of Student's t and skew-normal distributions are far from the real values.

6. Analysis of morbid obesity data

The main objective of the analysis of the morbid obesity data is to find the simultaneous effect of type of surgery on having any complication due to surgery and on BMI. Also, the association between BMI and complication due to surgery is in particular interest. The data were collected from n = 2890 patients who had at least 2 measurements of both BMI and complication due to surgery and other measurements are ranged from 1 to 8 for BMI and from 1 to 6 for complication due to surgery. Let BMIij,i=1,2,,n,j=1,,8 be the BMI for patient i at time tj. Also, Complicatik,i=1,2,,n,k=1,,6 be complication due to surgery for patient i at time sk. Also, let Agei, Genderi, Helicobi, T2Diabi, Fliveri, Dyslipi, Hypothi, Minii, Rygi, denote the age, gender and having Helicobacter, Type 2 Diabetes, Fatty liver, Dyslipidemia, Hypothyroidism, Mini-gastric bypass and RY gastric bypass, respectively. We consider the following joint model:

BMIij|b1iN(μij,σ2),Complicatik|b2i,BMIikBer(πik), (33)

where μij=β10+β11tj+β12Agei+β13Genderi+β14Minii+β15RYGi+β16Fliveri+β17Helicobi+β18T2Diabi+β19Dyslipi+β1,10Hypothi+b1i and logit(πik)=β20+β21sk+β22Agei+β23Genderi+β24Minii+β25RYGi+β26Fliveri+β27Helicobi+β28T2Diabi+β29Dyslipi+β2,10Hypothi+γkBMIisk+b2i. Also, the following models are used for missingness mechanisms: RijBMIBer(πijBMI), where

logit(πijBMI)=α10+α11Minii+α12RYGi+ψBMIb1i

and RikComplicatBer(πikComplicat), where

logit(πikComplicat)=α20+α21Minii+α22RYGi+κkriskBMI+ψComplicatb2i,

where b1iN(0,σb12) and b2iN(0,σb22).

In the Bayesian method, two parallel MCMC chains are run with different initial values for 80,000 iterations each. Then, we have discarded the first 40,000 iterations as pre-convergence burn-in and retained 40,000 for the posterior inference. For checking the convergence of the MCMC chains, we have used the Gelman–Rubin diagnostic test. Also, trace plots of unknown parameters are shown in Figures D.1–D.6 of Supplementary Materials D. The prior distributions for the unknown parameters are the same as those of the simulation study section and are given by: βjkN(0,1000),j=1,2,k=0,1,2,,11, γkN(0,1000),k=1,2,3,,6, κkN(0,1000),k=1,2,3,,6, σ2IΓ(0.01,0.01), ψBMI,ψComplicatN(0,1000), σbk2IΓ(0.01,0.01),k=1,2 and αjkN(0,1000),j=1,2,k=0,1,2. Table 2 shows the results of the Bayesian joint modeling and separate models on the data set. These two models are compared using EAIC, EBIC, DIC and LPML. These criteria are given by Baghfalaki et al. [4,5]. Unlike EAIC, EBIC and DIC, larger values of LPML indicate a better fitting model. These comparison criteria show that the joint model has a better fit to the data.

Table 2. Parameter estimates (Est.), standard deviation (S.D.) and 95 % credible interval for analyzing morbid obesity data.

  Joint model Separate model
Parameters Est. S.D. CI 2.5 % CI 97.5 % Est. S.D. CI 2.5 % CI 97.5 %
  Continuous submodel (BMI)
β10 (Intercept) 50.850 3.027 45.270 55.150 52.430 1.691 48.870 55.100
β11 (Time) −2.092 0.013 −2.116 −2.067 −2.092 0.012 −2.117 −2.068
β12 (Age) 0.001 0.012 −0.024 0.027 −0.001 0.012 −0.025 0.021
β13 (Gender; female) −1.988 0.344 −2.623 −1.308 −2.143 0.298 −2.691 −1.519
β14 (Mini-gastric bypass) 0.704 0.353 0.051 1.450 0.693 0.357 −0.049 1.353
β15 (RY gastric bypass) 0.118 0.362 −0.544 0.877 0.127 0.363 −0.603 0.805
β16 (Fatty liver) 1.060 0.264 0.572 1.588 1.065 0.245 0.572 1.530
β17 (Helicobacter) 0.049 0.204 −0.356 0.441 0.036 0.208 −0.377 0.454
β18 (Type 2 Diabetes) −0.895 0.288 −1.474 −0.333 −0.880 0.289 −1.442 −0.308
β19 (Dyslipidemia) −0.545 0.222 −0.976 −0.108 −0.566 0.218 −0.996 −0.140
β110 (Hypothyroidism) 0.281 0.262 −0.237 0.788 0.281 0.259 −0.230 0.781
α10 (Intercept) 0.775 0.044 0.689 0.862 0.774 0.044 0.689 0.861
α11 (Mini-gastric bypass) 0.597 0.049 0.500 0.694 0.598 0.049 0.502 0.695
α12 (RY gastric bypass) 0.047 0.050 −0.052 0.146 0.048 0.050 −0.051 0.146
σb12 27.840 0.789 26.320 29.410 27.820 0.785 26.310 29.391
σ2 11.000 0.129 10.750 11.250 11.000 0.130 10.740 11.250
ψBMI −0.007 0.003 −0.013 0.000 −0.007 0.003 −0.013 0.000
  Binary submodel (Complication due to surgery)
β20 (Intercept) 1.554 0.938 −0.080 3.696 2.376 0.528 1.397 3.411
β21 (Time) −0.136 0.098 −0.324 0.054 −0.277 0.013 −0.303 −0.250
β22 (Age) −0.001 0.003 −0.007 0.004 −0.002 0.002 −0.006 0.002
β23 (Gender; female) −0.373 0.086 −0.549 −0.211 −0.326 0.067 −0.461 −0.195
β24 (Mini-gastric bypass) 0.244 0.095 0.057 0.428 0.095 0.074 −0.051 0.242
β25 (RY gastric bypass) 0.100 0.099 −0.097 0.290 0.090 0.078 −0.064 0.243
β26 (Fatty liver) −0.182 0.060 −0.299 −0.065 −0.224 0.050 −0.324 −0.125
β27 (Helicobacter) −0.079 0.049 −0.175 0.016 −0.058 0.041 −0.140 0.022
β28 (Type 2 Diabetes) −0.077 0.068 −0.211 0.057 −0.060 0.058 −0.173 0.054
β29 (Dyslipidemia) 0.145 0.051 0.045 0.246 0.134 0.043 0.050 0.220
β210 (Hypothyroidism) 0.008 0.061 −0.112 0.129 −0.045 0.052 −0.146 0.056
α20 (Intercept) −0.594 0.118 −0.826 −0.361 0.571 0.099 0.375 0.766
α21 (Mini-gastric bypass) 0.887 0.127 0.634 1.136 0.936 0.110 0.723 1.154
α22 (RY gastric bypass) −0.092 0.130 −0.346 0.164 −0.059 0.112 −0.278 0.163
σb22 0.506 0.068 0.382 0.647 0.093 0.027 0.042 0.146
ψComplicat 2.258 0.177 1.940 2.629 4.679 0.826 3.569 6.718
  Shared parameters
γ1 0.070 0.008 0.056 0.085
γ2 −0.015 0.006 −0.026 −0.004
γ3 −0.021 0.004 −0.030 −0.012
γ4 −0.012 0.005 −0.021 −0.003
γ5 0.001 0.006 −0.011 0.013
γ6 0.005 0.009 −0.012 0.022
κ1 4.059 0.103 3.859 4.263
κ2 2.373 0.081 2.215 2.533
κ3 1.803 0.072 1.664 1.943
κ4 1.449 0.076 1.302 1.599
κ5 1.083 0.081 0.922 1.241
κ6 0.854 0.096 0.666 1.043
EAIC 142,190 147966
EBIC 142,458.6 148,163
DIC 4604 4717
LPML −59,119.75 −61,951.82

This table shows that time, gender, type of surgery, fatty liver, type 2 diabetes and dyslipidemia are significant predictors in the continuous model (BMI), also, gender, type of surgery (mini-gastric bypass), fatty liver and dyslipidemia are significant predictors in the binary model (indicator of complication due to surgery). Based on the results, the increase in time reduces BMI index, females have smaller BMI and smaller probability to have complication due to surgery. The persons with fatty liver have larger values of BMI and smaller probability to have complication due to surgery. In addition, the persons with Dyslipidemia have smaller BMI but have the larger probability to have complication due to surgery. The conditional model shows a statistically significant relationship between missing time points and complication due to surgery ( ω2 is significantly different from zero), but not for BMI because, this dependence was not detected with the shared parameter model, where ω1 was not significantly different from zero. Therefore, a significant positive relationship between complication due to surgery and intermittent missingness (estimate ω2 was highly significant), suggesting that the missing-data mechanism is non-ignorable. Also, parameters γk and κk, k=1,2,,6, show that the continuous and binary longitudinal measurements are dependent, also, two missingness mechanisms are dependent. Figures C.1–C.6 of supplementary material C show the posterior densities for the unknown parameters in the models. Figures C.1–C.4 compare the posterior densities of parameters using separate and joint models. Figure C.1 shows that the behavior of the predictors on modeling BMI in the two competing models is similar. Also, Figure C.2 shows that the performance of two models in estimating parameters of the binary model is a little different. Based on Figure C.3, except α20, the regression coefficients of missingness mechanisms in the two approaches are equivalent. Also, Figure C.4 shows that posterior densities of components of variance in the two approaches are different. Figures C.5 and C.6 show the posterior densities for the association parameters of the joint model. These figures show that most of the association parameters have non-zero means.

As other criteria for comparing two models, we use true positive rate (TPR), false positive rate (FPR) and true discovery rate (TDR) of the observed values of indicators of complication due to surgery and their predicted values by two approaches. Let ComplicatikPred=1 if (i,k)th component of the indicator of complication due to surgery are predicted to be one. Also, for the observed values of situation Complicatik=1 if (i,k)th component of the indicator of complication due to surgery is in fact one. The TPR, the FPR and the TDR of the method can be calculated as follows:

TPR=i=1nkComplicatikComplicatikPredi=1nkComplicatik,FPR=i=1nk(1Complicatik)ComplicatikPredi=1nk(1Complicatik),TDR=i=1nkComplicatikComplicatikPredi=1nkComplicatikPred.

A receiver operating characteristic (ROC) curve plots FPR versus TPR for the possible cutoffs. A common method of comparing the methods is the area under the ROC curve (often for simplicity referred to as the AUC). AUC is a portion of the area of the unit square; its value is always between 0 and 1. The larger the value of AUC, the better is the performance of the approach. The values of TPR, FPR and TDR for cutoff point equal to 0.5 for two competing models are reported in Table 3. TPR for the joint model is a bit smaller than that of separate models, but the difference of FPR between two models is a large value. Other criteria of this table also show the better performance of the joint model in comparison with that of using separate models. Furthermore, the values of AUC for two competing models are reported in this table and the larger value belongs to the joint model. Figure 3 shows ROC curve for indicator of complication due to surgery for two competing models which shows the AUC for joint model is larger than that of separate model. A sensitivity analysis with respect to the prior distributions of regression coefficients and association parameters is also performed. For this purpose, the sensitivity of the posterior mean to different values of the variance of the prior distribution is investigated. The results show that the parameter estimates are robust with respect to the normal priors when variances are considered to be larger than 100.

Table 3. The values of TPR, FPR and TDR for cutoff point equal to 0.5 and AUC for two competing models in analyzing morbid obesity data.

  Joint model Separate models
TPR 0.875 0.909
FPR 0.469 0.864
TDR 0.697 0.527
AUC 0.817 0.660

Figure 3.

Figure 3.

ROC curves of observed and predicted values by two competing approaches of complication due to surgery for the morbid obesity data.

7. Conclusions

In this paper, we propose a joint model for a mixed continuous and binary outcomes with different missing patterns. We use an MCMC procedure via the Gibbs sampler and Metropolis-Hastings algorithm to draw samples for parameters estimation. This process is facilitated using the freely available OpenBUGS software [23].

Two shared parameter models are considered for missingness mechanisms of two responses which jointly modeled using a conditional model. The usefulness of shared-parameter models for analyzing longitudinal data subject to non-random missing data is shown by many authors [1–3]. It should be recognized that, although these models account for NMAR data, it is impossible to distinguish between MAR and NMAR data within this class of models [3]. Also, one can use pattern mixture model and selection model frameworks for modeling missing mechanism [18–20].

In this paper, a normality assumption is used for the random effects and errors. One can use more flexible distributional assumption such as t, skew-normal, skew-t or other distributional assumptions for random effects. Also, a logistic mixed-effects model is used for analyzing binary data including binary longitudinal data and missingness mechanisms. As a future work, one can use the probit mixed-effects model for modeling the binary outcomes.

Also, we consider non-informative prior distributions for the unknown parameters, one can use informative priors or empirical Bayes approach for choosing hyper parameters in the Bayesian paradigm.

Supplementary Material

Supplementary_Data

Acknowledgments

This work has been supported by the grant number 97014144 from Iranian National Science Foundation (INSF). The authors would like to thank the INSF. The authors also acknowledge the valuable suggestions from the referees.

Funding Statement

This work has been supported by Iranian National Science Foundation (INSF) [grant number 97014144].

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

  • 1.Albert P.S. and Follmann D.A., Modeling repeated count data subject to informative dropout, Biometrics 56 (2000), pp. 667–677. doi: 10.1111/j.0006-341X.2000.00667.x [DOI] [PubMed] [Google Scholar]
  • 2.Albert P.S. and Follmann D.A., A random effects transition model for longitudinal binary data with informative missingness, Statistica Neerlandica 57(2003), pp. 100–111. doi: 10.1111/1467-9574.00223 [DOI] [Google Scholar]
  • 3.Albert P.S. and Follmann D., Shared-parameter models, in Longitudinal Data Analysis, Chapman & Hall/CRC, Boca Raton, 2009, pp. 433–452.
  • 4.Baghfalaki T., Ganjali M., and Berridge D., Joint modeling of multivariate longitudinal mixed measurements and time to event data using a Bayesian approach, J. Appl. Stat. 41 (2014), pp. 1934–1955. doi: 10.1080/02664763.2014.898132 [DOI] [Google Scholar]
  • 5.Baghfalaki T., Ganjali M., and Hashemi R., Bayesian joint modeling of longitudinal measurements and time-to-event data using robust distributions, J. Biopharm. Stat. 24 (2014), pp. 834–855. doi: 10.1080/10543406.2014.903657 [DOI] [PubMed] [Google Scholar]
  • 6.Bahrami Samani E. and Ganjali M., Bayesian latent variable model for mixed continuous and ordinal responses with possibility of missing responses, J. Appl. Stat. 38 (2011), pp. 1103–1116. doi: 10.1080/02664763.2010.484485 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Buhule O.D., Wahed A.S., and Youk A.O., Bayesian hierarchical joint modeling of repeatedly measured continuous and ordinal markers of disease severity: Application to Ugandan diabetes data, Stat. Med. 36 (2017), pp. 4677–4691. doi: 10.1002/sim.7444 [DOI] [PubMed] [Google Scholar]
  • 8.Catalano P.J., Bivariate modelling of clustered continuous and ordered categorical outcomes, Stat. Med. 16 (1997), pp. 883–900. doi: [DOI] [PubMed] [Google Scholar]
  • 9.Dunson D.B. and Herring A.H., Bayesian latent variable models for mixed discrete outcomes, Biostatistics 6 (2005), pp. 11–25. doi: 10.1093/biostatistics/kxh025 [DOI] [PubMed] [Google Scholar]
  • 10.Ganjali M., A model for mixed continuous and discrete responses with possibility of missing responses, J. Sci. Islam. Repub. Iran 14 (2003), pp. 53–60. [Google Scholar]
  • 11.Ganjali M. and Shafie K., A transition model for an ordered cluster of mixed continuous and discrete responses with non-monotone missingness, J. Appl. Stat. Sci. 15 (2006), pp. 331–343. [Google Scholar]
  • 12.Gaskins J.T., Daniels M.J., and Marcus B.H., Bayesian methods for nonignorable dropout in joint models in smoking cessation studies, J. Am. Stat. Assoc. 111 (2016), pp. 1454–1465. doi: 10.1080/01621459.2016.1167693 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Gelman A. and Rubin D.B., Inference from iterative simulation using multiple sequences, Stat. Sci. 7 (1992), pp. 457–472. doi: 10.1214/ss/1177011136 [DOI] [Google Scholar]
  • 14.Gueorguieva R.V. and Sanacora G., Joint analysis of repeatedly observed continuous and ordinal measures of disease severity, Stat. Med. 25 (2006), pp. 1307–1322. doi: 10.1002/sim.2270 [DOI] [PubMed] [Google Scholar]
  • 15.Kurum E., Hughes J., and Li R., A semivarying joint model for longitudinal binary and continuous outcomes, Canadian Journal of Statistics 44 (2016), pp. 44–57. doi: 10.1002/cjs.11273 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Li Q., Pan J., and Belcher J., Bayesian inference for joint modelling of longitudinal continuous, binary and ordinal events, Stat. Methods Med. Res. 25 (2016), pp. 2521–2540. doi: 10.1177/0962280214526199 [DOI] [PubMed] [Google Scholar]
  • 17.Liu X., Daniels M.J., and Marcus B., Joint models for the association of longitudinal binary and continuous processes with application to a smoking cessation trial, J. Am. Stat. Assoc. 104 (2009), pp. 429–438. doi: 10.1198/016214508000000904 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Michiels B., Molenberghs G., Bijnens L., Vangeneugden T., and Thijs H., Selection models and pattern-mixture models to analyse longitudinal quality of life data subject to drop-out, Stat. Med. 21 (2002), pp. 1023–1041. doi: 10.1002/sim.1064 [DOI] [PubMed] [Google Scholar]
  • 19.Michiels B., Molenberghs G., and Lipsitz S.R., Selection models and pattern-mixture models for incomplete categorical data with covariates, Biometrics 55 (1999), pp. 978–983. doi: 10.1111/j.0006-341X.1999.00978.x [DOI] [PubMed] [Google Scholar]
  • 20.Molenberghs G., Michiels B., and Kenward M.G, Pseudo-likelihood for combined selection and pattern-mixture models for missing data problems, Biom. J. 40 (1998), pp. 557–572. doi: [DOI] [Google Scholar]
  • 21.Rubin D.B., Inference and missing data, Biometrika 63 (1976), pp. 581–592. doi: 10.1093/biomet/63.3.581 [DOI] [Google Scholar]
  • 22.Samani E.B., Ganjali M., and Eftekhari S., A latent variable model for mixed continuous and ordinal responses with nonignorable missing responses: Assessing the local influence via covariance structure, Sankhya B 72 (2010), pp. 38–57. doi: 10.1007/s13571-010-0003-7 [DOI] [Google Scholar]
  • 23.Spiegelhalter D.J., Best N.G., Carlin B.P., and van der Linde A., Bayesian measures of model complexity and fit, J. R. Stat. Soc. B (Stat. Methodol.), 64(2002), pp. 583–639. doi: 10.1111/1467-9868.00353 [DOI] [Google Scholar]
  • 24.Teimourian M., Baghfalaki T., Ganjali M., and Berridge D., Joint modeling of mixed skewed continuous and ordinal longitudinal responses: A Bayesian approach, J. Appl. Stat. 42(2015), pp. 2233–2256. doi: 10.1080/02664763.2015.1023557 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary_Data

Articles from Journal of Applied Statistics are provided here courtesy of Taylor & Francis

RESOURCES