Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Dec 16.
Published in final edited form as: J R Stat Soc Ser A Stat Soc. 2012 Dec 12;176(3):10.1111/j.1467-985X.2012.12002.x. doi: 10.1111/j.1467-985X.2012.12002.x

Bayesian Hierarchical Semiparametric Modelling of Longitudinal Post-treatment Outcomes from Open Enrolment Therapy Groups

Susan M Paddock *, Terrance D Savitsky *
PMCID: PMC3864894  NIHMSID: NIHMS436949  PMID: 24353375

Abstract

There are several challenges to testing the effectiveness of group therapy-based interventions in alcohol and other drug use (AOD) treatment settings. Enrollment into AOD therapy groups typically occurs on an open (rolling) basis. Changes in therapy group membership induce a complex correlation structure among client outcomes, with relatively small numbers of clients attending each therapy group session. Primary outcomes are measured post-treatment, so each datum reflects the effect of all sessions attended by a client. The number of post-treatment outcomes assessments is typically very limited. The first feature of our modeling approach relaxes the assumption of independent random effects in the standard multiple membership model by employing conditional autoregression (CAR) to model correlation in random therapy group session effects associated with clients’ attendance of common group therapy sessions. A second feature specifies a longitudinal growth model under which the posterior distribution of client-specific random effects, or growth parameters, is modeled non-parametrically. The Dirichlet process prior helps to overcome limitations of standard parametric growth models given limited numbers of longitudinal assessments. We motivate and illustrate our approach with a data set from a study of group cognitive behavioral therapy to reduce depressive symptoms among residential AOD treatment clients.

Keywords: Bayesian hierarchical models, Conditional autoregressive prior, Dirichlet process, Group therapy, Growth curve, Mental health, Substance abuse treatment

1 Introduction

Group therapy is an important treatment modality in mental health care and substance abuse treatment. A primary goal of group therapy research studies is to determine the comparative effectiveness of one intervention versus another. A central challenge posed by group therapy data is to account for the correlation among client outcomes, as clients who participate jointly in group therapy are likely to have similar experiences and outcomes as compared to other clients. Possible analytic approaches to address this issue when enrollment of clients into the therapy group is closed – that is, therapy group membership is not expected to change session-to-session – include modeling client outcomes using a multilevel (hierarchical) model with random intercept terms to reflect client participation in a single closed group (Roberts & Roberts 2005, Bauer et al. 2008).

In contrast, enrollment into a therapy group on an open basis allows for clients to enter the therapy group as space permits, which induces a more complex correlation structure among client outcomes. To illustrate, suppose Clients X and Y attend a therapy group session at time t. Client Y leaves the group immediately afterward. Client Z joins the group at time t + 1. Even if Clients Y and Z never meet, their outcomes might be correlated due to the fact they both attend group therapy sessions that overlap with attendance by others, such as Client X, who attend at times t and t + 1. Therapy groups with open enrollment policies – also called rolling groups – are widespread in practice, given their ability to help maximize the number of persons participating in therapy at any one time; shorten client wait time for entrance into treatment; and maintain a clinically necessary minimum number of members to maintain the group dynamic (Monti et al. 2002). An illustration of their practical importance is that most therapy groups offered in outpatient treatment settings for treating alcohol and other drug use (AOD) disorders use open enrollment (Morgan-Lopez & Fals-Stewart 2006).

Despite the ubiquity of open-enrollment groups in practice, there is a lack of guidance on how to properly analyze such data. The analytic approaches developed thus far are limited to the special case of outcomes data collected from clients as they undergo treatment, such that each datum yis reflects an outcome for individual i observed at therapy group session s. Morgan-Lopez & Fals-Stewart (2007) focus on modeling informatively missing data due to clients failing to complete all sessions, but their approach does not generalize to post-treatment outcomes, which are collected regardless of adherence to the assigned intervention. Paddock et al. (2011) explicitly model correlation at the therapy group session level using conditional autoregression (CAR) (Besag, York & Mollié 1991). Its strengths include allowing one to specify ‘neighboring’ sessions and closeness (distance) of sessions in a sufficiently general way to cover several clinically relevant notions of defining session closeness, such as by overlapping client attendance across sessions or by time at which the session was offered (Morgan-Lopez & Fals-Stewart 2006).

Though group therapy intervention studies are frequently longitudinal, the number of post-treatment assessments is typically quite limited (e.g., 1–3) (e.g., Granholm et al. 2005, Rohsenow et al. 2001). Parametric growth curve modeling provides an efficient way to model longitudinal data for examining variation in client-specific growth curves and accounting for correlated outcomes within individuals (Muthén et al. 1997). However, a limited number of longitudinal assessments constrains how generally one might specify a parametric growth model for modeling the range of growth curves across individuals without adding prior constraints to identify parameters (Kenny & Milan 2012).

We present an approach for modeling post-treatment outcomes from open-enrollment therapy group studies that is a novel combination of multiple membership (MM) modeling (Hill & Goldstein 1998) with the CAR approach of Paddock et al. (2011). This combines the strengths of MM models for multilevel non-nested data – which is appropriate since each post-treatment outcome reflects the contribution of all sessions attended by a client – but relaxes the assumption of the standard MM model in which session random effects are assumed to be independently drawn from a random effects distribution, thus better reflecting the inter-relatedness of client attendance across sessions.

We employ a hierarchical Bayesian modeling perspective under a formulation that permits the borrowing of strength across multiple therapy group sessions for clients who attend open-enrollment therapy groups. Borrowing strength reduces the effective number of parameters of the model (Spiegelhalter et al. 2002), facilitating parsimonious modeling of how session effects are related to post-treatment outcomes. We similarly borrow strength in non-parametrically modeling client-level growth curve parameters to allow our model to capture a broad range of client growth curves, despite having relatively few longitudinal post-treatment outcome assessments per client. We motivate and illustrate our approach with a real data set from a study that is representative of group therapy-based interventions for AOD treatment.

2 Case Study: Post-Treatment Depressive Symptom Scores for BRIGHT versus Usual Care clients

The Building Recovery by Improving Goals, Habits and Thoughts (BRIGHT) study (Watkins et al. 2011) was a community-based effectiveness trial of a group cognitive behavioral therapy (CBT) intervention for treating residential substance abuse treatment clients having depressive symptoms. An aim of the study was to test whether clients receiving the BRIGHT intervention would have sustained improvements in depressive symptoms following treatment. The BRIGHT study employed a quasiexperimental design in which cohorts of clients at each of four study sites received either residential treatment as usual (UC) or residential treatment enhanced with the BRIGHT intervention (CBT) provided by trained substance abuse counselors over the study enrollment period. Clients were assigned to receive either CBT or UC according to which intervention was offered at their study sites at the time of entry into residential substance abuse treatment. CBT and UC were offered at each study site on an alternating basis over time. In order to participate in the study, clients had to screen positive for persistent depressive symptoms and not have bipolar disorder, schizophrenia, or cognitive impairment. Overall, 299 clients enrolled into the study, with 140 assigned to CBT and 159 to UC.

The BRIGHT group therapy consisted of 16 sessions of group CBT offered over an eight-week period that were organized into four modules of four sessions each. The four modules over 16 sessions were offered on a rotating basis. Thus, the CBT group had a semi-open admissions policy, as client admission to the CBT group could change every four sessions. Though clients were able to initiate treatment at the first session of any module, clients often did not complete each module as intended. In all, S = 245 group CBT sessions were offered to clients from all treatment sites. This included 14 offerings of the 16-session sequence (224 sessions); 20 additional sessions from two of the modules to increase exposure to group CBT for those who joined the therapy group late for three particular 16-week sequences; and one additional session that followed a long holiday weekend to make up for poor attendance at the regularly scheduled session. These 245 sessions were divided into G = 4 CBT open-enrollment therapy groups having distinct clients. The number of sessions for each of these four groups was 36, 40, 40, and 129 and number of clients enrolled in each open-enrollment group was 17, 21, 19, and 83, respectively. Each client attended sessions of only one of the four open-enrollment groups. Data were collected from clients at a baseline survey administered when clients enrolled in the BRIGHT study and at two post-treatment follow-up assessments at 3 and 6 months, having response rates of 86% and 87%, respectively. The primary outcome we use to illustrate the methodology is the extent of client depressive symptoms as measured by the Beck Depression Inventory-II (BDI-II) (Beck et al. 1996). The BDI-II score is a sum across 21 four-level items (scored 0 – 3), with a higher score indicating a greater level of depressive symptoms.

3 Modeling longitudinal outcomes without accounting for session attendance

It is typical in post-treatment outcomes studies of group therapy interventions to conduct an analysis without accounting for the potential correlation of outcomes for those clients who attend the same therapy group, regardless of whether enrollment is open (Morgan-Lopez & Fals-Stewart 2006) or closed (Bauer et al. 2008). Further, these analyses are typically fully parametric and conducted using routines available in standard statistical software packages. The standard linear longitudinal growth model (LGM) serves as an example of a typical approach one might choose to model the BDI-II outcomes collected during the BRIGHT study. The LGM can be expressed as:

yij=α+xijβ+zijbi+εij, (1)

where yij is the depressive symptom score for client i = (1, …, n) at repeated measurement event j = (1, …, mi), such that mi reflects the fact that different numbers of repeated measures may be observed across clients. On the right-hand side of the equation, α is a fixed intercept term; xij is a p × 1 set of fixed effects predictors, with the effects of predictors on the outcome expressed as β; zij is a set of q predictors that multiply the q client random effects, or growth parameters, bi; and εij is an observational error term. The standard linear LGM involves assuming the εij’s are independent and identically distributed as Gaussian with mean 0 and variance σε2. The bi’s are modeled as exchangeable and follows a common Gaussian distribution. To allow for non-linear data patterns, a reasonable choice for xij would be to set it equal to a (p = 5) × 1 set of fixed effects predictors with Ti an indicator of treatment arm assigned to client i (Ti = 1 for CBT, Ti = 0 for UC), such that xij=(Ti,tij,tij2,Titij,Titij2), where tij denotes the continuously-valued time at which outcome yij was observed and β are parameters representing the effects of CBT assignment, time trend and the differential effect of CBT on the average time trend in y. We model time, tij, as continuous to express the notion that the underlying process of depression symptom progression is a continuous one on which we make discrete observations.

Under Equation 1, the choices for specifying client deviations about the average trend via zij and bi are limited since there is a maximum of J = 3 observations per client in BRIGHT. Specifications for the parametric LGM we examine thus include:

  • LGM(1RE) Equation 1 under zi′ = (1) and bi = (b0i).

  • LGM(2RE) Equation 1 under zi′ = (1, tij) and bi = (b0i, b1i)

To improve the borrowing of strength across common trends in depressive symptoms expression between clients, we employ the Dirichlet Process (DP) (Ferguson 1973, Escobar & West 1995) to nonparametrically model the distribution for the random growth parameters (Brown & Ibrahim 2003, Müller et al. 2007). Our application of DP to post-treatment outcomes serves the novel purpose of reducing the dimension of the number of growth parameters to allow for capturing the continuous trend without creating ad-hoc constraints to facilitate model-fitting, such as those specified for LGM(1RE) and LGM(2RE) (e.g., implicitly setting random quadratic effects equal to zero), or by identifying parameters strictly through the prior. The standard parametric distribution on the random growth parameters, B={bi} would be replaced by:

b1,,bn|F~F,  F~DP(c,F0), (2)

where b1, …, bn are conditionally independent given F, and are generated from the unknown distribution F. The DP prior is typically specified with two hyperparameters. The first is a base measure, F0, which represents the ‘best guess’ about the form of F prior to observing data; the expected value of F is F0. The second parameter of the DP prior, a precision parameter, c > 0, expresses the degree of confidence that F0 is the correct generating distribution for B; the higher the value of c, the more F is expected to conform to F0.

The alternative stick-breaking construction of the DP prior highlights that F may be expressed as a mixture with an infinite number of components (Sethuraman 1994). Given the difficulty of working with an infinite mixture, a common practice is to work with an equivalent specification in which F is marginalized out of the distribution on B and to employ the required exchangeability property of the {bi}; see (Blackwell & MacQueen 1973):

bi|Bi~1n1+cj=1,jinδbj(bi)+cn1+cF0, (3)

where Bi = (b1, …, bi−1, bi+1, …, bn) and δbj (bi) is a function that equals 1 if bi = bj and 0 otherwise. The DP provides a compromise between assuming that all clients have the same growth trajectories – in which case the mean trend would apply equally well to everyone – versus assuming each client has a distinct trajectory arising from a known, parametric distribution, F0.

Equation 3 reveals a positive probability for ties to occur among the values of the bis. The set of unique values among the {bi} are called the ‘locations,’ or clusters, and are denoted by bk*, where k = 1, …, M and M is the number of locations such that Mn. It is this feature of the DP prior that we exploit in order to gain traction in fitting quadratic growth curves to all n clients. Specifically, clients with similar growth curves will have higher probabilities of sharing locations. The right-hand mixture component in Equation 3 highlights that higher values of c increase the probability mass to select this component which would increase the number, M, of unique locations, bk* ’s, which are generated from F0. This would result in the stronger agreement of F with F0. We will refer to this model as:

  • LGM(DP): Equation 1 under zi=(1,tij,tij2) and bi = (b0i, b1i, b2i) where {bi} are modeled using Equation 3. We choose F0Nq (0, τb Iq=3), τb = (τb,1, …, τb,q=3), to discover how greatly the estimated model differs from an analogous parametric approach.

4 Modeling longitudinal outcomes while accounting for session attendance

4.1 Adding Random Session Intercepts to the Model

We build upon LGM(DP) by augmenting (1) to now include random session effects to model correlation among post-treatment outcomes for CBT-attending clients. First, we augment Equation 1 as follows:

yij=α+xijβ+zijbi+wiγ+εij. (4)

The set of random session effect parameters is γ = (γ1, …, γS), where S denotes the number of therapy group sessions. Since outcomes, yij, are not linked to specific therapy group sessions but rather to all sessions attended by client i, a multiple membership (MM) model can be specified to map each element of γ to the y’s (Hill & Goldstein 1998). Under the MM model, the components of γ are mapped to the yij’s by multiplying γ by an S × 1 weight vector, wi, that is normalized to sum to 1; in particular, Si equals the number of sessions attended by client i; wis = 1/Si if client i attended session s and wis = 0 otherwise. Denote by an n × S multiple membership matrix, W = {wi}, to encode session attendances for client i. Given the design of the BRIGHT study where clients assigned to usual care do not receive the group therapy treatment, the nUC rows of W corresponding to observations for clients who are assigned to usual care will be filled with zeros, indicating they receive no allocation of session effects. The session effects parameters, γ, will allow for correlations among client observations to the extent that they attend common sessions. We now describe two approaches, one that restricts the γ ’s to be conditionally independent as well as one that relaxes that assumption.

An important variation on Equation (4) replaces the single term, wiγ, with an additive set of terms to reflect the disjoint sub-grouping of clients whose sessions attended do not overlap with one another, as each client attends sessions belonging to only one of the G = 4 open-enrollment therapy groups in BRIGHT:

yij=α+xijβ+zijbi+(wi,gi)γgi+εij, (5)

where gi ∈ {1, …, G} denotes which of the G therapy groups client i is a member, wi,gi has length Sgi, where Sgi is the number of sessions in therapy group gi, and γgi is an Sgi − length vector of session effects for the therapy group attended by client i. In matrix form, Wg, g = 1, …, G, is an ng × Sg matrix for therapy group g that maps Sg sessions (where Σg=1GSg=S) to ng clients who attend therapy group g. Similarly, γg represents an Sg × 1 vector of random effects for sessions in group g. This specification allows us to use distinct hyperparameters in the prior specifications for each γg, g = 1, …, G, as discussed below.

4.1.1 Model Session Effects as Independent

The simplest formulation of a model for session effects is the standard MM model in which session effects are assumed to be independent and identically distributed and following a Gaussian distribution:

  • MM(I): Equation (4) under independent prior formulation for sessions effects, γ:γ1,,γS|τγ~Ns(0,τγ1IS),

where the parameter, τγ, represents the precision across session effects. The shorthand ‘MM(I)’ is such that ‘MM’ denotes that the multiple membership weight matrix, W, links the (typically) multiple sessions attended by a CBT client to their post-treatment outcomes.

We also examine a variant that relaxes the assumption of a single precision term, τγ, applying to all sessions in all open-enrollment groups:

  • MM-D(I): Equation (5) with γg|τγg~NSg(0,τγg1ISg), where τγg is the precision term for open-enrollment group g.

4.1.2 Model Session Effects as Correlated

In order to model session effects as correlated, we first specify the distance, or closeness, of sessions, which in turn provides a way to designate which sessions are neighbors of one another. For example, sessions offered at adjacent opportunities in time could be assumed to be closer than sessions offered at non-adjacent opportunities, such that neighboring sessions are those that are offered at adjacent time opportunities. Alternatively, one might assume the degree of closeness between any two sessions is proportional to the overlap of clients attending those two sessions, with the neighbors of a given session defined as those other sessions with which there is at least one client in common (Paddock et al. 2011).

The CAR prior permits modeling correlation among the set of random effects such that, conditional on its neighboring session’s random effects, the random effect for session s is independent of the effects for non-neighboring sessions; however, all the γs’s within an open-enrollment therapy group are unconditionally dependent. We follow Besag, Mollie, York & Mollié (1991), who enumerate a two-part form for the covariance matrix. Firstly, define an S × S adjacency matrix, Ω, to encode dependence among neighboring sessions where we set ωss′ ≥ 0 if session s is a neighbor of session s′ (i.e.,, s ~ s′), and 0, otherwise, and ωss is defined to be 0. For the analyses we discuss here, we employ a canonical choice for ωss when s ~ s′ of ωss′ = 1, which reflects whether sessions s and s′ are offered at adjacent opportunities in time and belong to the same open-enrollment group (Paddock et al. 2011). Secondly, construct D = Diag(ωs+), where ωs+ = Σj ωsj equals the number of (1st order or adjacent) neighbors of session s that may be viewed as capturing the relative influence of session s. Then compose covariance matrix, Q = (DΩ), the Moore-Penrose pseudo-inverse, as Q is not of full rank, and specify the joint distribution of random session effects,

γ|τγ,Ω~N(0,[τγ(DΩ)]), (6)

where scalar precision parameter, τγ, controls the overall strength of variation. The rank of (DΩ) is SG, where G represents the number of distinct open-enrollment therapy groups (Hodges et al. 2003). Despite the improper joint distribution, this construction specifies a set of proper Gaussian full conditional distributions,

γs|γs,τγ,Ω~N(γ̄s,(τγωs+)1), (7)

where γ̄s = Σj=s ωsjγjs+ is the average of the session effects for the neighbors of session s. Besag, Mollie, York & Mollié (1991). We label our resulting models as:

  • MM(CAR): Employ additive model of Equation (4) under the joint prior construction of Equation (6) and the associated full conditionals given by Equation (7).

  • MM-D(CAR): Employ additive model of Equation (5) with Equations (6) and (7) modified to allow separate precision parameters, τγg, for each open-enrollment group, g = 1, …, G.

5 Prior Distributions

5.1 Models without Multiple Membership and Session Effects

These models are differentiated from one another based on the priors selected for the sets of client effects, B = {bi}. For the LGM models, we jointly specify an independent Gaussian prior, b1, …, bn ~ Nq (0, Diag (τb)), where we recall that n are the total number of clients and q the number of random effects for each client: q = 2 for LGM(2RE). Under this prior construction, the q × 1 vector of precision parameters, τb, allows each component of bi to separately vary. We move to the LGM(DP) model by now using this independent Gaussian prior for the DP base distribution, F0Nq (0, Diag (τb)). We specify non-informative, flat priors for both the p × 1 fixed effects, β, and for the global intercept, α. Independent Ga (0.1, 0.1) are defined for (τb,1, …, τb,q).

Specification for the hyperparameters used for the prior on DP concentration parameter, c, is important due to the influence of c on the number of clusters formed and thus on the adaptability of F to the data (Escobar & West 1995). We see in (3) that higher values for c induce formation of more clusters, resulting in greater adherence of F to its parametric base distribution, F0. The typical formulation employs c ~ Ga (ac, 1), with shape parameter ac and rate parameter 1. The shape parameter value for ac may be chose to encode a prior expectation for the number of client effect clusters. We choose ac = 3, which produces a prior mean of M = 14 clusters of the n = 299 clients. Since we have no strong reason for this prior belief, we choose a low value to which the posterior for c is relatively insensitive. Sensitivity analysis reveals little change in number of posterior clusters discovered under our varied model formulations that employ the DP construction for a range of values for ac from 1 – 5.

5.2 Models with Multiple Membership and Session Effects

As before, we specify non-informative, flat priors for both β and α. The hyperparameter, τγ, serves different roles for each of the three models incorporating session effects. Under MM(I), τγ plays the usual role that controls the variation and we allow for uncertainty in this choice by employing τγ ~ Ga(0.1, 0.1). We experimented with a range of shape and rate hyperparameter settings (∈ {1, 0.1, 0.01}) and discovered little sensitivity of posterior results as similarly noted by Banerjee et al. (2003), and so utilize the same value as above-noted for MM(I).

6 Analysis of BRIGHT Post-treatment Outcomes

6.1 Model Comparison

Comparisons of goodness of fit and model complexity are often done using the deviance information criterion (DIC) of Spiegelhalter et al. (2002), which penalizes the model deviance for parameter complexity to prevent over-fitting. In particular, the effective dimension statistic used to penalize the average deviance, , is pD = D(θ̃), where θ̃ is typically taken to be the posterior mode or mean, θ̄ = E(θ|y). However, Spiegelhalter et al. (2002) also point out that there is not clear choice for θ̃ under the (countably infinite) mixture induced with our DP prior. The DIC3 criterion (Celeux et al. 2006) was developed in response to this limitation and focuses on the marginal (predictive) density f(y) to estimate f(y|θ). Celeux et al. (2006) note that DIC3 tends to under-estimate pD as the number of components in a mixture increase. Given no single model comparison statistic comprehensively assesses all aspects the performances of our models, we also estimate the log-pseudo marginal likelihood that employs “leave-one-out” cross-validation (Gelfand & Dey 1994). We estimate f(y) using ΠNi=1f (yi|yi, Mk), the log of which is the log pseudo marginal likelihood (LPML). We utilize in our LPML estimation the harmonic mean estimator for f (yi|yi, Mk) outlined in Congdon (2005). This method may display numerical sensitivity for outlying data points that express low likelihood values, in which case a law of large numbers result would not be achieved (Held et al. 2010). We thus also explored a weighted re-sampling of Θ from existing posterior draws in a fashion that provides model parameter samples from f (Θ|yi, Mk) (Stern & Cressie 2000). The importance weights are composed as reciprocals of likelihood values, and so may suffer from similar numerical instability as harmonic averaging. Congdon (2005) note that both approaches for leave-one-out estimation are generally numerically robust for log likelihood values above − 10 which is met in our application. Both methods produce similar estimates in our application and so we report harmonic mean results for − LP ML. LPML is expected to provide a relatively high complexity penalty as the computation relies on cross-validation to compose the required posterior predictive distribution (Plummer 2008).

6.2 Computational Approach

The joint posterior distribution of all parameters, Θ = (α, β, B, γ, τ), was simulated at a each iteration of a Gibbs sampler by sequentially sampling from each full conditional posterior distribution. Output from L Gibbs sampling iterations were retained following a burn-in phase of 30, 000 iterations and used to draw posterior inferences. We chose L = 120, 000, run multiple times with the Gibbs samplers initialized from randomly selected starting points. Convergence of the sampler was assessed by employing a fixed width estimator with Monte Carlo standard errors (MCSE) computed using the consistent batch means (CBM) method (Jones et al. 2006). Computational software for the posterior distribution simulations is available in our package for the R statistical software (R Development Core Team 2011) package called growcurves (Savitsky & Paddock 2012).

6.3 Results: Models without Multiple Membership Session Effects

Model fit statistics, , − LP ML, and DIC3, are presented in Table 1. One observes a marked drop (improvement) in all three model fit statistics for LGM(DP) versus the LGM models due to the greater flexibility in modeling growth curve shapes under the DP prior that borrows strength across clients. The improvement is not solely due to allowing for a quadratic random effect term in LGM(DP), as an LGM that includes a quadratic effect has worse fit (− LP ML = 3040). The DP prior enables better capture of within-client growth curves by borrowing estimation strength among a subset of clients whose curves express a convex (or bell-shaped) profile.

Table 1.

Model Fit Comparisons: , − LP ML and DIC3 scores for model alternatives. Lower values imply better performance.

Model LP M L DIC3
LGM(1RE) 5916 3064 6100
LGM(2RE) 5698 3028 5982
DP(LGM) 5520 2989 5691
MM(I) 5502 2987 5676
MM(CAR) 5501 2982 5675
MM-D(I) 5493 2976 5670
MM-D(CAR) 5485 2975 5665

6.4 Results: Models with Multiple Membership Session Effects

Modeling random session effects as independent (model MM(I)) improves model fit statistics. Allowing for correlated session effects using MM(CAR) results in further improvements (Table 1). Allowing each therapy group to have its own precision parameter (models MM-D(I) and MM-D(CAR)) further improves fit statistics. The value of making the MM models more flexible to therapy group-specific variation is demonstrated by noting how the range of session effects varies across groups, going from 0.6 points in Group 1 to 5 points in Group 2 (Figure 1). The MM-D(CAR) session effects trend for Group 4 highlight the session effects adapting to the data, for example oscillating between sessions 50 – 100. The property of the CAR prior to borrow strength among adjacent sessions allows for discovering a stronger signal in group 3 and accommodating shrinkage of relatively extreme session effects in group 4 under MM-D(CAR) versus MM-D(I). Exploratory analyses of BDI-II scores centered about the mean BDI-II at each measurement time show a similar pattern to what is shown in Figure 1.

Figure 1.

Figure 1

Posterior means for session effects from MM(CAR) and MM(I) models

The posterior mean for session s represents its contribution to BDI-II scores for clients attending session s, given treatment assignment and client random effects. Lower session effects indicate lower levels of depressive symptoms. For example, clients who attend only the first session of group 3 have a BDI-II score expected to be 2.3 points lower than the average, while those attending only the last session of group 3 have an average relative increase of 1 point. Though fit statistics (Table 1) support including session effects into the model, the practical importance of doing so is highlighted by noting the range of session effects shown in Figure 1 of 5 points, which is a clinically meaningful change in BDI-II scores (Furukawa 2010). Additionally, the variation in session effects suggests further attention in trying to explain whether session effects vary according to session-level factors, such as session leader or session content, might be warranted.

6.5 Results: Change over Time in BDI-II Scores

To provide a greater understanding of the total effect of modeling both session effects and using the DP to model client growth parameters, Figure 2 displays growth curves for a randomly selected subset of clients within the CBT and UC conditions comparing MM-D(CAR) to the base comparison LGM(1RE) and LGM(2RE) models. MM-D(CAR)-based growth curves better adapt in both shape and orientation to the data. For example, client “CBT, 26” highlights how the MM-D(CAR) better captures an augmentation of depressive symptoms between months 3 and 6 while retaining good fidelity to the attenuation pattern between 0 and 3 months. The curves for client “Usual Care, 283” displays enhanced flexibility for MM-D(CAR) to model a convex growth pattern. The LGM models generally detect the common concave or U-shaped growth curves but fit less well for those clients whose depressive symptoms decrease in more convex trajectories.

Figure 2.

Figure 2

Comparison of Posterior Means of Client Growth Curves under LGM(1RE), LGM(2RE) and MM(CAR) for Randomly-selected Clients.

To facilitate interpretation of model results, we display treatment effects estimated using the predictive margins technique (Lane & Nelder 1982) (Figure 3). For each MCMC sample, we predicted outcomes as if all clients were in group CBT, then predicted outcomes as if all clients were in UC, and subsequently summarized the client-level differences. The treatment effects at 3 months are provided in the left-hand panel and those for 6 months in the right-hand panel. As indicated by the 95% posterior probability intervals (indicated by the vertical line segments of the box plots) not crossing the y = 0 line, declines in mean BDI-II scores for CBT versus usual care are found under each model. The distributions of the treatment effect under the MM models are more concentrated than the LGM models; however, the LGM models express larger treatments effect than the remaining models, all of which include DP modeling of client random effects. This difference is due to the LGM being able to only discover concave or U-shaped trajectory of BDI-II scores, as illustrated in Figure 2. A strictly concave curve lies below a straight line connecting 0-to-6 months and produces a more negative BDI-II score and fixed effects estimates. In contrast, the DP is more sensitive to the 20% of clients whose BDI-II score imply convex growth curves associated with less change in depressive symptoms.

Figure 3.

Figure 3

Predictive margins for the treatment effect of CBT versus usual care at 3 (left panel) and 6 (right panel) months. Segments reflect the 95% credible intervals and boxes represent the interquartile range of the marginal posterior distribution.

7 Discussion

Our innovative combination of multiple membership and CAR modeling confronts a previously-unaddressed difficulty of analyzing post-treatment outcomes data from open-enrollment group therapy studies. While the multiple membership modeling-based approaches examined here can be readily applied to non-longitudinal post-treatment outcomes data, we illustrate our approach using longitudinal post-treatment data for several reasons. Since studies like BRIGHT typically aim to compare change over time between multiple treatment conditions, it is important to demonstrate that our approach generalizes to longitudinal data analyses. Modeling random session effects combined with nonparametric modeling of client growth parameters further demonstrates the flexibility of our novel approach for handling a range of application-specific data scenarios, such as accommodating the reality of having few post-treatment observations per client in substance abuse treatment group therapy studies. For applications in which more follow-up assessment times are available, one might consider employing an intrinsic Gaussian Markov field (Rue & Held 2005), penalized spline, or Gaussian process (Savitsky & Vannucci 2010) on a set of group-time means instead of the DP prior. In the context of the BRIGHT study, we show that MM modeling yields clinically meaningful variation in session effects across and within open-enrollment groups, emphasizing the importance of our modeling approach for the behavioral health research community. We further promote the use of our methodological advancements with a freely available software package called growcurves for the R statistical software platform (R Development Core Team 2011).

Next steps in our investigations include examining data from BRIGHT and other open-enrollment group therapy studies to compile estimates of key input parameters one would prospectively consider when designing open-enrollment group therapy studies to examine how statistical power to detect treatment effects varies based on these key parameters. Other important issues include how to examine the effects of session-specific features – such as session theme, session leader, or fidelity to which the intervention was delivered – on outcomes while accounting for their correlation with session effects. Our approach should prove useful over a variety of other behavioral data settings in which observations are likely to be correlated due to complex interactions among individuals that mirror those of the open-enrollment group therapy setting. Examples include measuring school effectiveness when a student’s achievement is linked to multiple schools (e.g. Hill & Goldstein 1998) or examining outcomes among persons participating in a common online forum, discussion board, or online expert panel (Dalal et al. 2011). Our software package is written in a general way to readily accommodate such applications.

Acknowlegdements

This research was supported by the National Institute on Alcohol Abuse and Alcoholism grant to Susan Paddock (grant number R01AA019663). Data collection was supported by National Institute on Alcohol Abuse and Alcoholism grant to Katherine Watkins (grant number R01AA014699). The authors would like to thank the Associate Editor and two referees for their valuable comments and suggestions.

The order of author names is strictly alphabetical.

References

  1. Banerjee S, Wall MM, Carlin BP. Frailty modeling for spatially correlated survival data, with application to infant mortality in Minnesota. Biostatistics. 2003;4(1):123–142. doi: 10.1093/biostatistics/4.1.123. [DOI] [PubMed] [Google Scholar]
  2. Bauer D, Sterba S, Hallfors D. Evaluating group-based interventions when control participants are ungrouped. Multivariate Behavioral Research. 2008;43(2):210–236. doi: 10.1080/00273170802034810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Beck A, Steer R, Brown G. Manual for the Beck Depression Inventory-II. Psychological Corporation; 1996. [Google Scholar]
  4. Besag J, Mollie A, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics. Annals of the Institute of Statistical Mathematics. 1991;43:1–20. [Google Scholar]
  5. Besag J, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics. Annals of the Institute of Statistical Mathematics. 1991;43:1–59. [Google Scholar]
  6. Blackwell D, MacQueen JB. Ferguson distributions via pólya urn schemes. The Annals of Statistics. 1973;1:353–355. [Google Scholar]
  7. Brown ER, Ibrahim JG. A Bayesian semiparametric joint hierarchical model for longitudinal and survival data. Biometrics. 2003;59(2):221–228. doi: 10.1111/1541-0420.00028. [DOI] [PubMed] [Google Scholar]
  8. Celeux G, Forbes F, Robert CP, Titterington DM. Reply to comments on“Deviance information criteria for missing data models” (Pkg: P651–706) Bayesian Analysis. 2006;1(4):701–706. [Google Scholar]
  9. Congdon P. Bayesian models for categorical data. Chichester, England: John Wiley and Sons, Ltd.; 2005. [Google Scholar]
  10. Dalal S, Khodyakov D, Srinivasan R, Strauss S, Adams J. ExpertLens: A system for eliciting opinions from a large pool of non-collocated experts with diverse knowledge. Technological Forecasting and Social Change. 2011;78(8):1426–1444. [Google Scholar]
  11. Escobar MD, West M. Bayesian density estimation and inference using mixtures. Journal of American Statistical Association. 1995;90:577–588. [Google Scholar]
  12. Ferguson TS. A Bayesian analysis of some nonparametric problems. The Annals of Statistics. 1973;1:209–230. [Google Scholar]
  13. Furukawa T. Assessment of mood: Guides for clinicians. Journal of Psychosomatic Reseach. 2010;68:581–589. doi: 10.1016/j.jpsychores.2009.05.003. [DOI] [PubMed] [Google Scholar]
  14. Gelfand AE, Dey DK. Bayesian model choice: Asymptotics and exact calculations. Journal of the Royal Statistical Society, Series B: Methodological. 1994;56:501–514. [Google Scholar]
  15. Granholm E, McQuaid J, McClure F, Auslander L, Perivoliotis D, Pedrelli P, Patterson T, Jeste D. A randomized, controlled trial of cognitive behavioral social skills training for middle-aged and older outpatients with chronic schizophrenia. American Journal of Psychiatry. 2005;162:520–529. doi: 10.1176/appi.ajp.162.3.520. [DOI] [PubMed] [Google Scholar]
  16. Held L, Schrödle B, Rue H. Posterior and cross-validatory predictive checks: A comparison of MCMC and INLA. In: Kneib T, Tutz G, editors. Statistical Modelling and Regression Structures: Festschrift in Honour of Ludwig Fahrmeir. Physica-Verlag Ges.m.b.H; 2010. pp. 91–110. [Google Scholar]
  17. Hill PW, Goldstein H. Multilevel modeling of educational data with cross-classification and missing identification for units. Journal of Educational and Behavioral Statistics. 1998;23:117–128. [Google Scholar]
  18. Hodges JS, Carlin BP, Fan Q. On the precision of the conditionally autoregressive prior in spatial models. Biometrics. 2003;59(2):317–322. doi: 10.1111/1541-0420.00038. [DOI] [PubMed] [Google Scholar]
  19. Jones GL, Haran M, Caffo BS, Neath R. Fixed-width output analysis for Markov Chain Monte Carlo. Journal of the American Statistical Association. 2006;101(476):1537–1547. [Google Scholar]
  20. Kenny D, Milan S. Identification: A non-technical discussion of a technical issue. In: Hoyle R, Kaplan D, Marcoulides G, West S, editors. Handbook of Structural Equation Modeling. New York: Guilford Press; 2012. pp. 145–163. [Google Scholar]
  21. Lane PW, Nelder JA. Analysis of covariance and standardization as instances of prediction. Biometrics. 1982;38:613–621. [PubMed] [Google Scholar]
  22. Monti P, Kadden R, Rohsenow D, Cooney N, Abrams D. Treating Alcohol Dependence: A Coping Skills Training Guide. New York: Guilford Press; 2002. [Google Scholar]
  23. Morgan-Lopez A, Fals-Stewart W. Analytic complexities associated with group therapy in substance abuse treatment research: Problems, recommendations, and future directions. Experimental and Clinical Psychopharmacology. 2006;14(2):265–273. doi: 10.1037/1064-1297.14.2.265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Morgan-Lopez A, Fals-Stewart W. Analytic methods for modeling longitudinal data from rolling therapy groups with membership turnover. Journal of Consulting and Clinical Psychology. 2007;75(4):580–593. doi: 10.1037/0022-006X.75.4.580. [DOI] [PubMed] [Google Scholar]
  25. Müller P, Quintana FA, Rosner GL. Semiparametric Bayesian Inference for Multilevel Repeated Measurement Data. Biometrics. 2007;63(1):280–289. doi: 10.1111/j.1541-0420.2006.00668.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Muthén BO, Muthen BO, Curran PJ. General longitudinal modeling of individual differences in experimental designs: A latent variable framework for analysis and power estimation. Psychological Methods. 1997;2:371–402. [Google Scholar]
  27. Paddock SM, Hunter SB, Watkins KE, McCaffrey DF. Analysis of rolling group therapy data using conditionally autoregressive priors. Annals of Applied Statistics. 2011;5(2A):605–627. doi: 10.1214/10-AOAS434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Plummer M. Penalized loss functions for bayesian model comparison. Biostatistics. 2008;9(3):523–539. doi: 10.1093/biostatistics/kxm049. URL: http://dx.doi.org/10.1093/biostatistics/kxm049. [DOI] [PubMed] [Google Scholar]
  29. R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2011. ISBN 3-900051-07-0. [Google Scholar]
  30. Roberts C, Roberts S. Design and analysis of clinical trials with clustering effects due treatment. Clinical Trials. 2005;2:152–162. doi: 10.1191/1740774505cn076oa. [DOI] [PubMed] [Google Scholar]
  31. Rohsenow D, Monti P, Rubonis A, Gulliver S, Colby S, Binkoff J, Abrams D. Cue exposure with coping skills training and communication skills training for alcohol dependence: 6- and 12-month outcomes. Addiction. 2001;96:1161–1174. doi: 10.1046/j.1360-0443.2001.96811619.x. [DOI] [PubMed] [Google Scholar]
  32. Rue H, Held L. Gaussian Markov Random Fields: Theory and Applications. Chapman & Hall Ltd/CRC; 2005. [Google Scholar]
  33. Savitsky TD, Vannucci M. Spiked dirichlet process priors for generalized Gaussian process models. Journal of Probability and Statistics. 2010;2010(2010):1–14. doi: 10.1155/2010/201489. (Article ID 201489) [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Savitsky T, Paddock S. growcurves: Semiparametric Hierarchical Bayesian Modeling of Longitudinal Outcomes. R package version 2.14. 2012 URL: http://CRAN.R-project.org/package=growcurves. [Google Scholar]
  35. Sethuraman J. A contructive definition of Dirichlet priors. Stastica Sinica. 1994;4(2):639–650. [Google Scholar]
  36. Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society, Series B. 2002;64(4):583–616. [Google Scholar]
  37. Stern HS, Cressie N. Posterplior predictive model checks for disease mapping models. Statistics in Medicine. 2000;19(17–18):2377–2397. doi: 10.1002/1097-0258(20000915/30)19:17/18<2377::aid-sim576>3.0.co;2-1. [DOI] [PubMed] [Google Scholar]
  38. Watkins KE, Hunter SB, Hepner KA, Paddock SM, de la Cruz E, Zhou AJ, Gilmore J. An effectiveness trial of group cognitive behavioral therapy for patients with persistent depressive symptoms in substance abuse treatment. Archives of General Psychiatry. 2011;68(6):1–8. doi: 10.1001/archgenpsychiatry.2011.53. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES