Skip to main content
Sage Choice logoLink to Sage Choice
. 2022 Apr 29;11(1):59–76. doi: 10.1177/21677026221076832

The Development and Internal Evaluation of a Predictive Model to Identify for Whom Mindfulness-Based Cognitive Therapy Offers Superior Relapse Prevention for Recurrent Depression Versus Maintenance Antidepressant Medication

Zachary D Cohen 1, Robert J DeRubeis 2, Rachel Hayes 3, Edward R Watkins 4, Glyn Lewis 5,6, Richard Byng 6,7, Sarah Byford 8, Catherine Crane 9, Willem Kuyken 9, Tim Dalgleish 10,11, Susanne Schweizer 12,13,
PMCID: PMC7614103  EMSID: EMS140968  PMID: 36698442

Abstract

Depression is highly recurrent, even following successful pharmacological and/or psychological intervention. We aimed to develop clinical prediction models to inform adults with recurrent depression choosing between antidepressant medication (ADM) maintenance or switching to mindfulness-based cognitive therapy (MBCT). Using previously published data (N = 424), we constructed prognostic models using elastic-net regression that combined demographic, clinical, and psychological factors to predict relapse at 24 months under ADM or MBCT. Only the ADM model (discrimination performance: area under the curve [AUC] = .68) predicted relapse better than baseline depression severity (AUC = .54; one-tailed DeLong’s test: z = 2.8, p = .003). Individuals with the poorest ADM prognoses who switched to MBCT had better outcomes compared with individuals who maintained ADM (48% vs. 70% relapse, respectively; superior survival times, z = −2.7, p = .008). For individuals with moderate to good ADM prognoses, both treatments resulted in similar likelihood of relapse. If replicated, the results suggest that predictive modeling can inform clinical decision-making around relapse prevention in recurrent depression.

Keywords: antidepressant medication, depression, mindfulness-based cognitive therapy, precision medicine, relapse prevention


Globally, depression is now the leading cause of life years lived with disability (Patel et al., 2016, 2018). In many cases, the course of depression is recurrent over the life span (Kessler & Bromet, 2013), even following successful acute-phase interventions (Cuijpers et al., 2013). Successful prevention of the return of depression is therefore key to alleviating the individual and societal burden of depressive disorders. Antidepressant medication (ADM) following successful treatment is currently the predominant preventive intervention targeted at depressive relapse.1 Multiple agencies, including the UK National Institute for Health and Care Excellence (NICE), the British Association for Pharmacology (Cleare et al., 2015), and the American Psychiatric Association, recommend both prescription of ADM and mindfulness-based cognitive therapy (MBCT) after remission if a person is deemed at high risk of relapse because of multiple previous episodes or high residual symptoms (Gelenberg et al., 2010; NICE, 2009). An international review of 13 sets of ADM guidelines revealed that recommendations for the duration of such continuation or maintenance2 treatment in people deemed at high risk ranged from 1 year to lifelong or indefinite (Piek et al., 2010). Unsurprisingly, therefore, longer term use of ADMs is high and rising (Mojtabai & Olfson, 2014; Organisation for Economic Co-operation and Development, 2013), which accounts for the recorded increase in person-years on ADMs from 0.73 in 1995 to 4.94 in 2012 reported in the United Kingdom (McCrea et al., 2016). The antidepressant benefits of longer term ADM use are tempered by diverse physical and emotional side effects in the majority of patients (Bet et al., 2013; Cartwright et al., 2016), tachyphylaxis and other loss-of-response phenomena (Bosman et al., 2018; Fornaro et al., 2019; Kinrys et al., 2019), and user surveys indicating a desire for evidence-based psychosocial interventions as an alternative to ADMs for all aspects of depression management (Dorow et al., 2018; McHugh et al., 2013; Schweizer et al., 2010).

One such alternative is MBCT, an 8-week, group-based program that has emerged as a leading evidence-based psychological intervention for relapse prevention in recurrent depression (Kuyken et al., 2016). In a multicenter definitive randomized controlled trial (RCT; N = 424)—the PREVENT trial (Kuyken, Byford, et al., 2010; Kuyken et al., 2014)—we evaluated MBCT combined with support to taper or discontinue ADM against maintenance of a clinical dose of ADM for 2 years in patients (age ≥ 18 years) with recurrent depression (at least three previous episodes) who were in partial or full remission on ADM.3 The trial showed no significant differences in relapse over 2 years between the MBCT and ADM groups (hazard ratio [HR] = 0.89, 95% confidence interval [CI] = [0.67, 1.18]; p = .43; relapse rate: 44% MBCT vs. 47% ADM; Kuyken et al., 2015a), a finding corroborated by an individual patient data meta-analysis of 1,258 patients from nine RCTs (Kuyken et al., 2016). Recent meta-analytic work has confirmed and expanded these findings: A network meta-analysis by McCartney et al. (2021) provided additional evidence that MBCT is superior to control conditions in terms of rate of relapse (MBCT vs. treatment as usual) or time to relapse (MBCT vs. treatment as usual or placebo), and a meta-analysis by Breedvelt and colleagues (2021) added evidence of the superiority of combination psychological prevention and continuation ADM over continuation ADM alone.

Given the association between depressive relapse and negative long-term outcomes, helping individuals select the optimal intervention for relapse prevention (from among the available options) is of high importance. Treatment guidelines stipulate that patient preferences should inform treatment selection through a process of shared decision-making (Weston, 2001), and there is some evidence that treatment outcomes are superior for preferred versus nonpreferred treatments (Kwan et al., 2010; Shay & Lafata, 2015; Windle et al., 2019). A critical component of effective shared decision-making is ensuring that comparative evidence for different interventions in the context of the patient’s own clinical profile—what works for whom—is available at the point of care delivery (Winston et al., 2018). This information can come in a variety of different forms, including decision aids (Stacey et al., 2017) or more quantitative outcomes from clinical-prediction models (Bonnett et al., 2019). Recent methodological and empirical advances in “precision medicine” (F. S. Collins & Varmus, 2015) have generated prediction models that provide indices to identify which patients might expect improved clinical outcomes following different acute treatments for depression (Chekroud et al., 2021; Cohen & DeRubeis, 2018; Cohen et al., 2021; Perlis, 2013). A variety of factors are known to predict risk of depressive relapse (Buckman et al., 2018), but clinical prediction models in this area are lacking. Moriarty and colleagues’ (2021) systematic review of prognostic models for predicting depressive relapse identified 10 unique prognostic models, but the studies’ high bias and models’ poor predictive performance suggest that further work is needed.

The participants in PREVENT were assessed at trial baseline on a broad range of psychosocial variables that putatively have a bearing on treatment outcome (Kuyken et al., 2015a). Here, we focus on identifying patient characteristics and constructing prognostic models that could putatively guide the treatment choice between continuing ADM versus MBCT with support to taper or discontinue antidepressant treatment for the prevention of depressive relapse.

Method

For a checklist corresponding to the TRIPOD (transparent reporting of a multivariable prediction model for individual prognosis or diagnosis) guidelines (G. S. Collins et al., 2015), see the Supplemental Material available online.

Data set description

The full PREVENT sample comprised 424 individuals randomly assigned (1:1) to ADM or MBCT. Participants with more than 20% missing data on predictor variables (n = 15), no data beyond baseline (n = 17), or not in receipt of a dose of MBCT deemed sufficient (at least four sessions; following the PREVENT trial protocol: Kuyken et al., 2010, 2015a) for evaluation of MBCT as an intervention alternative (n = 25) were excluded from the primary analyses. This led to a sample of 367 participants for the primary analyses. For the data exclusion pipeline, see Figure S1 in SM1 in the Supplemental Material. Sensitivity analyses were performed to probe the impact of removing the 25 who were excluded because of inadequate MBCT dose. The results for this larger sample (n = 392) are included in SM2 in the Supplemental Material. Descriptive data for the predictor variables at baseline are provided in the SM3 in the Supplemental Material, as are comparisons of the two treatment groups (ADM vs. MBCT; see Table S1 in the Supplemental Material) and the excluded versus included samples (see Table S2 in the Supplemental Material). These comparisons indicated that there was a significantly greater proportion of women in the ADM group and that ADM participants reported more comorbid diagnoses, had a lower probability that their most recent episode of depression was chronic (≥ 24 months in duration), and were younger, at baseline, compared with the MBCT group (see Table S1 in the Supplemental Material). Relative to the analysis sample, excluded participants were, on average, 4 years younger, had 0.3 more comorbid diagnoses, and reported lower scores on the Dispositional Positive Emotions Scale Curiosity subscale, Self-Compassion Scale Isolation subscale, and the Five-Facets Mindfulness Questionnaire Describe subscale (see Table S2 in the Supplemental Material).

Predictor variables

The PREVENT study included a wide range of 53 potential demographic, clinical, and psychological predictor variables (Table 1). The demographic and clinical predictors were selected because they are available in clinical practice, and indeed, many are commonly included as part of routine diagnostic procedures. Psychological predictors included standardized self-report measures of potential mechanisms of treatment efficacy (including mindfulness, self- and other-compassion, and repetitive thinking).

Table 1.

Predictors Included in the Variable-Selection Analysis

Variable Description
Demographic
 Age Age at baseline in years
 Gender Binary variable, reflecting self-identified gender: female or male (variable was made dichotomous because most individuals identified as one or other gender)
 Education Level of education attained, in which 0 = no educational qualification, 1 = O levels or GCSEs, 2 = AS and A levels (UK Advanced Level), 3 = vocational training/qualification, 4 = university bachelor’s degree, 5 = university master’s degree, and 6 = university professional training/PhD
 Relationship status Binary variable: no (single/divorced/widowed) versus yes (married/civil partnership/cohabiting)
 Employment status Binary variable: unemployed versus full- or part-time
Clinical
 Clinician-rated depressive symptoms The total score on the GRID-Hamilton Rating Scale for Depression (GRID-HAMD; Williams et al., 2008) was used as an index of clinician-rated depressive symptoms. The GRID-HAMD is a scale that offers explicit standardized scoring guideline for the clinician-rated assessment of depression. The scale consists of 17 items assessing symptoms of depression that are rated on a scale from 0 (not present) to 4 (severe).
 Self-reported depressive symptoms The total score of the Beck Depression Inventory–II (Beck et al., 1996) was used to assess self-reported symptoms of depression. The 21-item scale requires participants to endorse symptom levels ranging from 0 (not present) to 3 (severe).
 Age of depression onset Age at first depressive episode
 Chronicity Binary variable indicating whether the duration was ≥ 24 months
 Previous psychological treatment Binary variable indicating whether the participant had received a previous psychological treatment
 Previous suicide attempt Binary variable indicating whether the participant had previously attempted suicide or not
 Family history of depression Binary variable indicating whether the participant reported a family history of mood disorders or not
 Comorbidity Number of comorbid diagnoses
Psychological
 Validated questionnairesa
  Five Facet Mindfulness Questionnaire (FFMQ; Baer et al., 2006) The FFMQ measures five facets of mindfulness: (a) Observe – observing internal and external experiences (eight items); (b) Describe – describing internal experiences/states verbally (eight items); (c) Aware – acting with awareness (eight items), (d) Non-Judging – a nonjudgmental stance toward one’s thoughts and feelings (eight items), and (e) Non-Reactivity – allowing thoughts and feelings to come and go (seven items). Individuals rated the extent to which they experienced these states ranging from 1 (never or very rarely true) to 5 (very often or always true).
  Self-Compassion Scale (SCS; Neff, 2003) The SCS consists of six self-compassion subscale factors: Self-Kindness (five items), Self-Judgment (five items), Common Humanity (four items), Isolation (four items), Mindfulness (four items), and Over-Identification (four items). We additionally included a bespoke subscale that assesses compassion for others. Ratings are provided on a scale ranging from 1 (almost never) to 5 (almost always).
  Dispositional Positive Emotion Scale (DPES; Shiota et al., 2006) We included the following DPES subscales: Joy (six items), Contentment (five items), Love (six items), Compassion (five items), and Awe (six items). We additionally included a bespoke subscale that assesses Curiosity for internal and external experiences. Ratings were provided ranging from 1 (strongly disagree) to 5 (strongly agree).
  Cognitive Emotion Regulation Questionnaire (CERQ; Garnefski et al., 2001) The CERQ is a 36-item questionnaire assessing individuals’ propensity to employ four maladaptive (Catastrophizing, Rumination, Other-Blame, and Self-Blame) and five adaptive (Acceptance, Positive Refocusing, Positive Reappraisal, Putting into Perspective, and Refocus on Planning) emotion-regulation strategies when they were confronted with negative events. Item ratings ranged from 1 (almost never) to 5 (almost always).
  Cambridge-Exeter Repetitive Thought Scale (CERTS; Barnard et al., 2007) The CERTS assesses individuals’ dispositional tendency for Brooding (Section 1), the temporal course of their brooding thinking (Section 2), dispositional tendency for repetitive thinking in general (Section 3), difficulties disengaging from repetitive thinking (Section 4), and attitudes toward repetitive thinking (Section 5). For Sections 1 through 4, responses were provided with respect to eight scenarios: (a) feeling sad, (b) feeling happy, (c) feeling angry, (d) feeling anxious, (e) being with others, (f) being alone, (g) experiencing a set-back, and (h) making progress. In Sections 1, three to five items were given ratings ranging from 1 (almost never) to 5 (almost always), and in Section 2, items were rated from only moments to what seems like hours.
  Measure of Parental Style (MOPS; Parker et al., 1997) The MOPS was administered to assess levels of parental abuse experienced as a child. Participants indicate to what extent 15 statements about their mother and father (30 items total) were true for the first 16 years of their lives. Participants rated the statements from 0 (not at all true) to 3 (extremely true). A median split was used to categorize participants as high or low (see Kuyken et al., 2015).
  General Self-Efficacy Scale (GSE; Schwarzer & Jerusalem, 1995) The GSE is a 10-item scale that assessed individuals’ sense of self-efficacy over the past 2-week period. Participants answered the scale on items from 1 (definitely disagree) to 5 (definitely agree).
 Bespoke measures
  Stigmatization and normalization (SN) SN was a bespoke seven-item questionnaire asking individuals to indicate how often they experienced stigmatization because of their depression. Items were rated on a scale from 1 (almost never) to 5 (almost always).
  Warning signs (WS) WS was a bespoke six-item questionnaire assessing individuals’ ability to recognize warning signs of depression. Responses ranged from 1 (almost never) to 5 (almost always).
  Relationship satisfaction (RS) RS was assessed with a bespoke questionnaire that individuals were asked to complete thinking of the most important relationship in their lives. The scale’s seven items assess relationship satisfaction on a scale ranging from 1 (almost never) to 5 (almost always).
  Preference for mindfulness- based cognitive therapy (MBCT) Item assessing participants’ sentiment about being assigned to MBCT (Question: “How do you feel about the possibility of being in an MBCT group”), rated on a Likert scale from 1 (not positive at all) to 5 (extremely positive).
  Preference for antidepressant medication (ADM) Item assessing participants’ sentiment about being assigned to ADM (Question: “How do you feel about remaining on your ADMs”), rated on a Likert scale from 1 (not positive at all) to 5 (extremely positive).
  Preference for therapy type Item assessing participants’ preferred treatment option (Question: “Do you have a preference for a group”), rated on a Likert scale from 1 to 5 (1 = MBCT, 3 = no preference, 5 = continue on ADM).

Note: Individuals were asked to complete all measures with respect to the previous 2 weeks. The scaling was standardized to facilitate interpretation from factor analyses and similar computations planned for the trial. The labels of the original scales were maintained. GCSE = General Certificate of Secondary Education.

a

These scales were scored on a 5-point Likert scale irrespective of their original scoring range.

Missing predictor variable data at baseline were imputed using the full (N = 424) sample via the missForest (Version 1.4; Stekhoven & Bühlmann, 2012) package for the R software environment (Version 3.5.1; R Core Team, 2018), which implements a random-forest-based nonparametric imputation approach. Random-forest-based imputation compared favorably in several evaluations of different imputation approaches (Hong & Lynn, 2020; Shah et al., 2014; Stekhoven & Bühlmann, 2012; Waljee et al., 2013).

For the 53 potential predictors assessed at baseline in the PREVENT data, following imputation, continuous variables were z-scored, and dichotomous variables were set to −0.5 and 0.5. No outcome data were included in the imputation of the missing baseline data. Note that the education variable was imputed as an ordered categorical variable and then was converted into a continuous (numeric) variable for the remainder of the analyses.

Statistical approach to treatment selection

An in-depth discussion of how data can be used to create and evaluate treatment recommendations can be found in a recent review of treatment selection (Cohen & DeRubeis, 2018). The core concept is that statistical models are constructed and used to generate predictions for an individual in two (or more) treatments, and then those predictions are used to determine which treatment to recommend (Cohen et al., 2019). Much of the work in this space (e.g., the Personalized Advantage Index approach; DeRubeis et al., 2014) has been based on the proportional-interaction model. Luedtke and colleagues (2019) highlighted potential problems with the use of this approach in the small RCT samples that are often available, including the fact that implicit estimation and testing of interaction effects (vs. main effects) requires larger samples. Their simulation work suggested that sample sizes of at least 300 per condition are required for adequate statistical power to detect clinically significant improvements in response associated with model-based treatment selection. Other approaches that have been demonstrated rely solely on prognostic models (e.g., Lorenzo-Luaces et al., 2017; Wiltsey Stirman et al., 2021). For a discussion and contrasting of these different approaches, see Cohen et al. (2021). Following the approach proposed by Kessler et al. (2017) and demonstrated by Deisenhofer and colleagues (2018), we constructed separate prognostic algorithms for each of our two treatment conditions (MBCT and ADM). For each patient, a “factual prediction”—how well they were expected do in their actual treatment arm on the basis of their scores on the variables selected for that treatment’s prognostic model—was generated, as was a “counterfactual prediction”—how well they would hypothetically have done in the alternative treatment arm on the basis of their scores on the predictors that were included in the prognostic model for the alternative treatment arm.

In this approach, the predictive performance of each of the two separate treatment arm algorithms could be independently evaluated (see below for information about cross-validation) by comparing the factual predictions with the observed outcomes. If both algorithms yielded inaccurate factual predictions, this would have revealed that the data, or the modeling procedures that were implemented, did not provide a useful signal for prediction purposes. If both models yielded accurate factual predictions, the computed difference between the sets of predictions for MBCT and ADM could have served as an index for each patient that indicated which of the two treatments would be optimal (Cohen & DeRubeis, 2018). Finally, if only one of the models (e.g., Tx-A) yielded accurate factual predictions, that model on its own could be evaluated for its potential utility for guiding treatment decisions. Patients could be arrayed according to their predicted outcome in the condition with the reliable prognostic model (Tx-A). In the absence of reliable information about expected response to the other treatment (Tx-B) and the assumption that the two treatments yielded similar outcomes on average, participants with poor prognoses in Tx-A could be reasonably advised to try Tx-B, whereas a sensible recommendation for those with good prognoses in Tx-A would be Tx-A. Thus, the expectation in this scenario would be that differential response would be observed across the spectrum of Tx-A prognoses.

We applied this approach to the PREVENT data, and below we outline the steps of variable selection, cross-validation, and assessment of model fit involved in building and evaluating the prognostic algorithms for MBCT and ADM. Although analyses revealed the MBCT model to have poor predictive performance (as indicated by low area under the curve [AUC]), the ADM model evidenced good predictive performance and was superior to a benchmark model constructed using only baseline depression severity. Consequently, we generated and evaluated the treatment-selection indices using the ADM prognostic model only. This allowed us to ask the question of whether there were differential outcomes for participants who received MBCT versus ADM in patients predicted to do well, moderately, or poorly if they continued with ADM. The details regarding the model building and evaluation for the poorly fitting MBCT model are described in detail in SM4 in the Supplemental Material.

Cross-validation

When using cross-validation in the context of predictive model evaluation, it is essential to protect against “double-dipping” (Hastie et al., 2009). For example, it is critical that the predictions that are evaluated are generated from models that are constructed (in terms of variable selection, hyperparameter tuning, and weight setting) without the use of data from individuals for whom the predictions are being made.

We performed 10-fold cross-validation (Hastie et al., 2009), which involved splitting both the ADM and MBCT samples into 10 subgroups, balanced on outcomes (Fig. 1, Step 1). Each of the 10 ADM subgroups was then held out (Fig. 1, Step 2), and a prediction model was constructed using the remaining nine ADM subgroups as the training sample (Fig. 1, Steps 3–6). That model was then applied to the 10th ADM group to generate factual predictions of expected response in ADM (Fig. 1, Step 7a) and was also applied to the entire MBCT sample to generate counterfactual predictions of their expected response if they had received ADM (Fig. 1, Step 7b). The protections needed differ when generating factual and counterfactual prediction for each treatment arm. When predicting ADM outcomes for the MBCT sample, no cross-validation is needed because the ADM model was constructed without the MBCT individuals and thus can be applied to these individuals without concern over double-dipping.

Fig. 1.

Fig. 1.

Schematic of cross-validation procedure for producing antidepressant medication (ADM) predictions for the full analysis sample. Ten key steps in the procedure are indicated by circled numbers. Step 1 (10-fold cross-validation [CV]): The main analysis sample was separated into ADM and mindfulness-based cognitive therapy (MBCT) samples, each of which was then split into 10 subgroups, balanced on outcomes. Step 2: The ADM sample was separated into its first train-test samples, and the first of the 10 subgroups was held out as ADM test sample (1); the other nine subgroups constituted ADM training sample (1). Steps 3 and 4: ADM training sample (1) was then itself split into 10 subgroups, and parameter tuning was performed using internal 10-fold CV; this entire process was repeated three times using different random permutations of the internal 10-fold CV of ADM training sample (1). Step 5 (hyperparameter optimization): The optimal alpha (a) and lambda (λ) were selected and used in Step 6 (model specification), in which elastic-net regularized regression (ENRR) was applied to the entire ADM training sample (1) to derive the ADM training sample (1) Model. Step 7a: This model was then used to generate factual predictions for the held-out ADM test sample (1) and to generate counterfactual predictions (Step 7b) for the entire MBCT sample. Step 8: Steps 2 through 7 were then repeated nine more times to complete the 10-fold CV. Step 9a: The resulting set of (protected) factual predictions for the entire ADM sample (likelihood of relapse in ADM) were then evaluated using the area under the receiver operating characteristic curve. Step 9b: The set of 10 (protected) counterfactual predictions for each individual in the MBCT sample (likelihood of relapse if they had received ADM) were averaged, which resulted in a set of averaged “ensemble” counterfactual predictions for the MBCT sample. Step 10: The ADM and MBCT samples and their ADM predictions were then recombined, which resulted in protected prognoses under ADM for the full analysis sample.

This process was repeated nine more times for each of the other nine ADM subgroups (Fig. 1, Step 8), which resulted in the generation of a single “protected” factual prediction for each of the individuals in the ADM condition. The 10 protected counterfactual predictions (one from each of the 10 ADM models) for each of the individuals in the MBCT condition were averaged to create an ensemble counterfactual prediction of how patients who received MBCT would have been expected to fare had they received ADM (Fig. 1, Step 9b). The analogous process was then performed for the MBCT group, which resulted in each individual in MBCT receiving a single factual prediction of their outcomes in MBCT and patients in the ADM condition receiving ensemble counterfactuals for their expected outcomes had they received MBCT (see Fig. S3 in the Supplemental Material).

Finally, to provide a benchmark to help in the evaluation of these multivariable prediction models, we used the same cross-validation strategy, again in both groups, to generate predictions from “severity only” models (constructed using logistic regression), in which the only predictor available to the models was baseline symptom severity on the clinician-assessed Hamilton Rating Scale for Depression (HAMD; Hamilton, 1967; see Fig. S2 in the Supplemental Material), assessed using the 17-item GRID-HAMD (Williams et al., 2008).4 Outcome for all models was relapse, which was assessed retrospectively via the Structured Clinical Interview for DSM-IV (First et al., 1995) at five time points across the 24-month study period (1 month after acute intervention, and then 9, 12, 18, and 24 months after randomization; Kuyken et al., 2015a). See Figure 1 for a schematic summarizing the analytic pipeline.

Modeling via elastic-net regularized regression

Multivariable prognostic models were constructed using elastic-net regularized regression (ENRR; Zou & Hastie, 2005; see Fig. 1, Step 6). ENRR allows for the estimation of the predictive utility of a large number of variables, and its use has been demonstrated and extensively discussed in several previous predictive modeling efforts (Buckman et al., 2021; Chekroud et al., 2016, 2017; Cohen et al., 2020; Iniesta et al., 2016; Kim et al., 2019; Pearson et al., 2018; Webb et al., 2020). ENRR combines the L1 (LASSO penalty) and L2 (ridge regression penalty) penalization, which provides a hybrid of least absolute shrinkage and selection operator (LASSO) and ridge regression that thus addresses issues of correlated predictors and overfitting by shrinking coefficients of correlated predictors toward each other and by removing uninformative predictors from the model (Hastie et al., 2009). ENRR was implemented using the R package glmnet (Version 2.0-16; Friedman et al., 2010). Hyperparameter optimization (Fig. 1, Steps 3–5) was performed within each training sample using nested cross-validation. To reduce a source of potential bias (risk of overfitting because of information leakage from the test cases; Pearson et al., 2018) that can arise when a grid search is performed for hyperparameter setting in the context of cross-validation, we used three tuning loops (as suggested by a reviewer), 10-fold cross-validation (Friedman et al., 2010; Zou & Hastie, 2005, p. 310), and a small set of α values (.01, .5, .99) as implemented in the R package beset (Version 0.0.0.9409; Shumake, 2022) and described in Pearson et al. (2018) and McNamara et al. (2021). These three α values represent heavy weighting of the ridge penalty (α = .01), heavy weighting of the LASSO penalty (α = .99), or equal weighting (α = .5). The λ path of 100 possible values was generated using the glmnet package’s default calculation equation for λ path. In addition, the regularization parameter λ was selected using the one-standard-error rule, which helps to avoid overfitting and elevated Type I error (James et al., 2013; Waldmann et al., 2013). All analyses were performed in the R software environment (R Core Team, 2018); for additional information about packages used, see SM5 in the Supplemental Material.

Evaluating the models

Primary evaluation of model performance was performed via receiver-operating-characteristic (ROC) curves, which delineate the relative sensitivity (true-positive rate) and specificity (false-positive rate) of a model’s predictions at different thresholds. The area under the ROC curve (AUC) was used to quantify each model’s discrimination; AUCs of 0.5 indicate no or “chance” discrimination, and AUCs of 1 indicate perfect discrimination. In this context, because we are evaluating the outcome, “Did a relapse occur?” (yes/no), the AUC is equivalent to the concordance or c-statistic (Steyerberg et al., 2010). Another important aspect of model performance to evaluate is calibration (Van Calster et al., 2019); following recommendations based on sample size, we present only “weak calibration,” assessed via the calibration intercept and slope, with target values of 0 for the intercept (in which negative and positive values suggest overestimation and underestimation, respectively) and 1 for the slope (in which slopes > 1 indicate predictions that are too conservative and slopes < 1 indicate those that are too extreme).

We computed AUCs for the ENRR models’ factual predictions for patients in each treatment arm (ADM and MBCT; see Step 9a of both Fig. 1 and Fig. S3 in the Supplemental Material). We also computed the AUC for each treatment arm for the depression-severity-at-baseline-only logistic regression models (HAMD) as a benchmark to compare against the more complex multivariable models. Within each treatment arm, we then compared these two AUCs using a one-tailed DeLong test for correlated ROC curves (DeLong et al., 1988) under the hypothesis that the multivariable models would outperform the benchmark models.

Evaluating prognostic utility

As noted in the results and described in detail in SM4 in the Supplemental Material, the internally cross-validated evaluation of the MBCT model’s factual predictions found that they were near chance and that they failed to noticeably outperform the HAMD model. We therefore focused our evaluation of prognostic utility on the ADM model alone under the rationale that in the absence of trustworthy information about MBCT prognosis, it would be rational to evaluate whether individuals who are predicted to have a high risk of relapse if they maintain ADM might have a better (relative) predicted outcome with a switch to MBCT. Likewise, we wanted to examine whether patients predicted to have good prognoses with ADM might be better advised to maintain the treatment regimen they are already following (i.e., ADM).

To evaluate the overall utility of the predictions generated by the ADM prognostic model in guiding treatment selection, we used two tertiles to divide the sample into three groups (Altman & Bland, 1994) on the basis of risk of relapse in ADM (good ADM prognosis, moderate ADM prognosis, and poor ADM prognosis). Sample sizes and descriptive statistics for the ADM prognoses (i.e., means, standard deviations, and ranges) for three groups, broken down by treatment received, are available in Table S3 in the Supplemental Material. Predictive utility of the ADM prognostic index was then evaluated by examining the time to relapse (in a survival analysis using Cox regression) and overall relapse rates over the 2-year follow-up. The independent variables were treatment condition (ADM, MBCT), ADM prognosis (both as a continuous variable and in categorical form: good, moderate, poor), and their interaction. For any significant interactions, the effects of treatment group were analyzed within each of the three prognostic categories.

Results

Model predicting relapse in the ADM treatment arm

Using observed depressive relapse (yes/no) over 24 months to evaluate the factual predictions in the ADM model that had been made without the use of each patient’s own data, we found that the AUC for the ADM ENRR model was 0.68 (Fig. 2a), which was significantly better (one-tailed DeLong test: z = 2.80, p = .003) than that of the ADM HAMD comparison model (AUC = 0.54; see Fig. S4 in the Supplemental Material). The ADM ENRR model had a calibration intercept of −0.02 (in the direction of overestimation of relapse) and a calibration slope of 1.49 (which suggests overly conservative predictions at both ends of the risk spectrum). In contrast, the MBCT model (AUC = 0.54) did not outperform the HAMD comparison model (AUC = 0.52; z = 0.37, p = .35), a detailed description of which is available in Figure S5 in the Supplemental Material. Additional information regarding calibration for all models is available in SM4 in the Supplemental Material.

Fig. 2.

Fig. 2.

Probability of relapse in the ADM model. The graph in (a) shows the area under the receiver-operating-characteristic (ROC) curve (AUC), which delineates the relative sensitivity (true-positive rate) and specificity (false-positive rate) of the prognostic multivariable antidepressant medication (ADM) elastic-net model. The AUC (red line) is plotted against the straight gray line, which represents the threshold at which the model has no predictive utility. The gray line indicates the likelihood that someone above and below that threshold on the prognostic index has an equal likelihood of relapse. That is, the larger (farther away from the gray line) the AUC, the greater a model’s predictive utility. The graph in (b) plots the predicted survival curves for time (measured in days) to depressive relapse over the 2-year follow-up period for each ADM-prognosis group (poor, moderate, good) as a function of the treatment they received (mindfulness-based cognitive therapy [MBCT] or ADM). The graph in (c) shows the observed relapse rates over the 2-year follow-up as a function of the ADM relapse risk, separately by treatment received.

The specific variables that were retained and their associated coefficient weightings varied across the 10 ADM ENRR models that were generated. The key results of these models are summarized in Table 2. An expanded version of this table describing all 53 variables that were considered is provided in Table S4 in the Supplemental Material, and the analogous information for the 10 MBCT models is available in Table S5 in the Supplemental Material.

Table 2.

Predictor Weightings for the ADM Prognostic Models Across 10-Fold Cross-Validation

Variable No. of times selected M SD Min Max
MOPS level of parental abuse (low/high)a 10 0.34 0.16 0.03 0.57
Previous depressive episode chronicitya,b 10 −0.33 0.17 −0.60 −0.02
DPES Contentment 10 −0.08 0.06 −0.20 −0.01
DPES Joy 10 −0.05 0.038 −0.12 −0.003
DPES Love 10 −0.07 0.04 −0.14 −0.01
CERTS Negative Rumination 9 0.05 0.03 0 0.10
CERTS Unresolution 9 0.07 0.06 0 0.15
Previous suicide attempta 8 0.10 0.09 0 0.26
Comorbidities 8 0.03 0.03 0 0.07
FFMQ Aware 8 −0.04 0.04 −0.11 0
CERQ Acceptance 8 0.04 0.05 0 0.14
GSE Self-Efficacy 7 −0.03 0.03 −0.08 0
Age of depression onset 6 −0.03 0.03 −0.08 0

Note: The table reports regression coefficients for the predictors in the ADM elastic-net prognostic models that were retained more than 50% of the time across the 10-fold cross-validation. In the model, all continuous variables entered were z-scored (M = 0, SD = 1), and dichotomous variables were set to −0.5 and +0.5. No. of times selected = number of times the variable was selected across the 10 cross-validations; min, max = minimum and maximum for variable’s coefficient value (includes zeros for when variable was not retained); ADM = antidepressant medication; MOPS = Measure of Parental Style (Parker et al., 1997); DPES = Dispositional Positive Emotion Scale (Shiota et al., 2006); CERTS = Cambridge-Exeter Repetitive Thought Scale (Barnard et al., 2007); FFMQ = Five Facet Mindfulness Questionnaire (Baer et al., 2006); CERQ = Cognitive Emotion Regulation Questionnaire (Garnefski et al., 2001); GSE = General Self-Efficacy Scale (Schwarzer & Jerusalem, 1995).

a

Dichotomous variables (set to −0.5 and +0.5).

b

Chronicity (no/yes) based on duration of previous depressive episode of 24 months or more.

Five baseline variables (from our set of 53) were retained as predictors of relapse across all 10 ADM ENRR models generated during the 10-fold CV procedure: level of child abuse, depression chronicity, and three subscales of the Dispositional Positive Emotions Scale (Shiota et al., 2006): Contentment, Joy, and Love. Higher levels of these positive emotions were associated with lower risk of relapse in ADM, whereas a history of child abuse was associated with increased risk of relapse. In the ADM models, having one’s most recent episode of depression be chronic (duration ≥ 24 months) was associated with reduced risk of relapse relative to people whose most recent episode was not. Two subscales of the Cambridge-Exeter Repetitive Thought Scale (Barnard et al., 2007) were retained in nine of the 10 models: Both Negative Rumination and Unresolution were associated with elevated risk of relapse in ADM. History of suicide attempt or attempts and number of comorbidities were both retained in eight of the 10 models and were associated with increased risk of relapse. Additional variables retained in more than 50% of the models are summarized in Table 2 and Table S4 in the Supplemental Material.

Prognostic utility

We first verified that the outcome data for our analysis sample were comparable with that of the total PREVENT sample (Kuyken et al., 2015a). As in the full sample, survival times (z = −1.02; p = .31, HR [MBCT relative to ADM] = 0.86; 95% CI = [0.64, 1.15]) and relapse rates (MBCT = 47.1%, ADM = 50.3%) during the 24-month follow-up period in our analysis sample did not differ significantly between the two treatment conditions. In the survival analysis, in which we examined time to relapse with main effects for treatment and continuous ADM prognosis, there was a significant main effect of continuous ADM prognosis (z = 4.615; p < .001). We next compared observed outcomes across the two treatment conditions for individuals according to their ADM prognoses (i.e., good, moderate, poor; Fig. 2b).

The survival curves did not differ across treatments for individuals with good ADM prognoses (HR reflecting increased risk of relapse for those in MBCT vs. ADM = 1.34; 95% CI = [0.73, 2.45]; p = .35). The same was true for individuals with moderate ADM prognoses (HR = 1.19; 95% CI = [0.73, 1.96]; p = .48). In contrast, individuals with poor ADM prognoses had significantly reduced relapse risk (HR = 0.52; 95% CI = [0.32, 0.84]; p = .008) if they switched to MBCT instead of staying on ADM.

When comparing rates of participants who had actually relapsed by the end of the 2-year follow-up period, the same pattern emerged (Fig. 2c). There was a significant main effect of ADM prognosis on observed relapse rates, χ2(2) = 16.16, p < .001. As expected, the individuals with good ADM prognoses showed the lowest rates of relapse (35%), the group with moderate prognoses showed an intermediate relapse rate (51%), and the group with the poor prognoses showed the highest rate of relapse (60%). Relapse rates were low for individuals with good ADM prognoses regardless of which treatment they received (ADM = 31%, MBCT = 38%). Relapse rates did not differ significantly as a function of treatment assignment for this group, χ2(1) = 0.45, p = .50, or for those with moderate ADM prognoses (ADM = 47%, MBCT = 56%): χ2(1) = 0.71, p = .40. However, for individuals with poor ADM prognoses, relapse rates were significantly worse for participants who received ADM (70%) compared with participants who received MBCT (48%): χ2(1) = 4.86, p = .03. Finally, results from the sensitivity analyses that repeated the above analyses in a sample that included the 25 MBCT participants who had been excluded for not having attended at least four sessions of MBCT aligned with the results from the primary analysis sample (see SM2 in the Supplemental Material).

Discussion

Clinical depression is a heterogeneous condition, which often runs a relapsing-and-remitting course across the life span and for which no single treatment has been shown to be effective for all patients (Fried, 2017; Fried & Nesse, 2015). A precision-medicine approach to depressive relapse prevention has potential utility in facilitating clinical choices between maintenance pharmacotherapy regimens and preventive psychosocial interventions such as MBCT.

We described a prognostic model that was developed using baseline data (demographic, clinical, and readily available psychological measures) from individuals who were randomly assigned to receive maintenance ADM following a successful course of acute treatment with ADM in an RCT comparing maintenance ADM with MBCT for relapse prevention. This ADM model (for a discussion of the predictors included in the model, see SM6 in the Supplemental Material), which predicts depressive relapse across a 24-month follow-up period, performed comparably with algorithms predicting acute remission response to antidepressants (Chekroud et al., 2016, 2017; Iniesta et al., 2016). We then generated ADM prognoses for the entire RCT sample (including participants randomly assigned to receive MBCT) to investigate whether the information from the ADM predictions might be helpful in deciding between staying on antidepressants or switching to preventive psychotherapy (MBCT). We observed a large difference in relapse rates for patients with poor ADM prognoses: 70% relapse in ADM versus 48% relapse in MBCT. In other words, patients with poor prognoses on ADM do not seem to simply be clinical nonresponders, but, rather, they may be individuals for whom MBCT represents a clinically beneficial alternative. Interpreted clinically, the findings suggest that if people present with a history of depression but do not report other risk factors, such as early abuse, anhedonia, rumination, and early onset, then ADM works well. However, our model suggests that when these other risk factors are present, it is worth considering MBCT because outcomes may be enhanced. This is consistent with other articles (Kuyken et al., 2016; Ma & Teasdale, 2004) that have suggested that for such individuals, there is more “grist for the MBCT mill” and possibly more motivation to engage in an active intervention such as MBCT (or indeed cognitive-behavioral therapy).

The survival model’s estimate of a 48% reduction in risk of relapse across the 24-month follow-up period (HR = 0.52) for patients with poor ADM prognoses who received MBCT versus ADM would suggest, if replicated, that such patients should pursue MBCT. The potential impact of the absolute observed difference in relapse rates (22%) for patients in the poor-ADM-prognosis subgroup who received ADM versus MBCT, however, is tempered by the fact that these individuals accounted for only one third of the sample. Yet the potential clinical utility of these findings may not necessarily be limited to this subgroup: Given the low relapse rates and lack of difference between treatments for patients with good ADM prognoses (31% ADM vs. 38% MBCT), such patients could be encouraged to select which relapse prevention strategy to pursue according to other factors. Clinically, our data indicate that treatment selection for depressive relapse prevention in individuals with recurrent depression who have a moderate to good ADM prognoses could be guided by factors such as patient preference, cost, and resource availability. Although resource availability may be a limiting factor, cost–benefit analyses have shown noninferiority of MBCT (Kuyken et al., 2015b), and some have even favored MBCT over ADM (Pahlevan et al., 2020). For individuals with poor prognoses on ADM, however, our data indicate that MBCT alongside tapering or cessation of medication to prevent relapse potentially confers a better clinical outcome and should be offered as an alternative to ADM. Recent systematic reviews and individual-participant meta-analyses suggest that combination relapse prevention, in which both medication continuation and preventive psychotherapy are provided, is superior to monotherapy and thus should also be considered for patients at higher risk for relapse.

Our study has a number of potential limitations. With the present data, we are unable to disaggregate the effects of MBCT from the tapering or discontinuation of ADM because they were both part of the MBCT protocol. We are also unable to comment on whether the effects are specific to MBCT or whether any effective alternative psychosocial intervention would offer potentially similar benefits for individuals with poor prognoses on maintenance ADM.

The utility of any model depends on its ability to generalize. The present algorithm was subjected to internal validation during variable selection and model building. The imputation of missing baseline data was not included in the cross-validation, but given the low number of missing data points, it is unlikely that this was a substantial source of bias. Previous work suggests that penalization and shrinkage methods may not provide as much protection as is assumed and that such methods (including ENRR) can produce unreliable clinical prediction models when sample sizes are small (Riley et al., 2021). Despite the internal cross-validation, we were not able to externally validate the model on a wholly independent sample because comparable sufficiently large trials evaluating the same preventive interventions with the same or a similar set of baseline measures are not currently available. This reflects the current state of precision medicine research (Cohen & DeRubeis, 2018), in which predictive models are too rarely subjected to proper external validation (Salazar de Pablo et al., 2021). Further external validation of the model5 and these results, when suitable data become available, will be an important next step before the translation of the current findings into firm treatment recommendations. Although we were fortunate to receive extensive reviewer feedback that allowed us to enhance our analytic approach, the many researcher degrees of freedom that remain represent potential threats to generalizability that merit caution and are worthy of further study.

Ideally, both the ADM and MBCT models would have been sufficiently robust to actively compare the two predictive indices to elucidate what works best for whom. However, our computed MBCT model did not perform above chance and was no better than a prediction model built solely on baseline depression severity scores. This lack of robust prediction within the MBCT model accords with the replicated finding that very few demographic, clinical, or psychological variables over and above baseline symptom severity appear to predict outcome to MBCT (Kuyken et al., 2016; Kuyken, Watkins, et al., 2010), which testifies to the intervention’s broad suitability. Second, in the present study, MBCT was combined with support for medication tapering or discontinuation, and it may be that the mixture of these two different intervention components (and possible associated effects of medication withdrawal) obscured any clear relations in the MBCT arm with the predictor variables included here.

The current findings represent a significant first step in the application of precision medicine to inform patient and clinician choice around optimal interventions for depressive relapse prevention. Additional work is needed to further validate the model reported here in wholly independent, yet-to-be-collected, large samples. The eventual success of this and similar personalized-medicine approaches to mental-health care will depend on the acquisition and dissemination of large-scale clinical data sets, which will allow for the development and validation of predictive models (Chekroud et al., 2017, 2021). The utility of these models must then be evaluated in prospective clinical trials (Delgadillo & Lutz, 2020), which have begun to emerge with promising results (e.g., Delgadillo et al., 2022; Lutz et al., 2022).

Supplemental Material

sj-pdf-1-cpx-10.1177_21677026221076832 – Supplemental material for The Development and Internal Evaluation of a Predictive Model to Identify for Whom Mindfulness-Based Cognitive Therapy Offers Superior Relapse Prevention for Recurrent Depression Versus Maintenance Antidepressant Medication

Supplemental material, sj-pdf-1-cpx-10.1177_21677026221076832 for The Development and Internal Evaluation of a Predictive Model to Identify for Whom Mindfulness-Based Cognitive Therapy Offers Superior Relapse Prevention for Recurrent Depression Versus Maintenance Antidepressant Medication by Zachary D. Cohen, Robert J. DeRubeis, Rachel Hayes, Edward R. Watkins, Glyn Lewis, Richard Byng, Sarah Byford, Catherine Crane, Willem Kuyken, Tim Dalgleish and Susanne Schweizer in Clinical Psychological Science

Acknowledgments

We are grateful to the participants for their time in taking part in this trial, and we also thank the following colleagues who have contributed to the PREVENT study through recruitment, retention, and treatment of patients or provision of administrative support: Claire Brejcha, Jessica Cardy, Aaron Causley, Suzanne Cowderoy, Alison Evans, Felix Gradinger, Surinder Kaur, Jonathan Richards, Harry Sutton, Rachael Vicary, Alice Weaver, Jenny Wilks, and Matthew Williams. We thank Marjolein Fokkema for her insight on elastic-net regularized regression. Finally, we thank the reviewers and editor of this manuscript for their insightful recommendations that have allowed us to improve the reported analyses.

1.

We note that a recent systematic review of 56 studies and almost 40,000 subjects by De Zwart and colleagues (2019) reexamined the distinction between relapse and recurrence established by Frank et al. (1991) and concluded that “the idea that a recurrence of depressive symptoms shortly after their initial remission constitutes a ‘relapse’ of the previous episode, whereas their later recurrence is the first sign of an entirely new episode, is a model that lacks empirical support” (De Zwart et al., 2019, p. 544). Therefore, in this article, we use the term “relapse” to describe a return of depressive symptoms meeting criteria from the fourth edition of the Diagnostic and Statistical Manual of Mental Disorders (as assessed via Structured Clinical Interview for DSM-IV), regardless of when during the 24-month follow-up it occurred.

2.

Although the terms “continuation” and “maintenance” are often used interchangeably in the literature, they are sometimes used more specifically (e.g., DeRubeis et al., 2019) to differentiate treatment following remission and up to recovery (continuation) from treatment past the point of recovery (maintenance). Here, we align with the trial’s main outcome article (Kuyken et al., 2015a) and use “maintenance” to describe all medication use during the study.

3.

We note that in the PREVENT trial’s main outcome article (Kuyken et al., 2015a), “MBCT-TS” was used as the abbreviation for mindfulness-based cognitive therapy with support to taper or discontinue antidepressant treatment and that “m-ADM” was used as the abbreviation for maintenance antidepressant medication, but here these two conditions are simply abbreviated as MBCT and ADM, respectively.

4.

From here forward, when we reference the “HAMD” (Hamilton Rating Scale for Depression) models (either for ADM or MBCT), we mean the simple “severity only” logistic regression comparison models that include only baseline HAMD, whereas when we reference the “ADM model” or the “MBCT model,” we mean the multivariable prognostic models constructed with elastic-net regularized regression.

5.

See SM7 in the Supplemental Material for further details regarding a model constructed using the full ADM analysis sample that could be subjected to external validation in future efforts.

Footnotes

ORCID iDs: Edward R. Watkins Inline graphic https://orcid.org/0000-0002-2432-5577

Susanne Schweizer Inline graphic https://orcid.org/0000-0001-6153-8291

Transparency

Action Editor: Christopher G. Beevers

Editor: Kenneth J. Sher

Author Contributions

T. Dalgleish and S. Schweizer should be regarded as joint senior authors. Z. D. Cohen, T. Dalgleish, and S. Schweizer conceived of the secondary analyses. Z. D. Cohen analyzed the data. Z. D. Cohen, T. Dalgleish, and S. Schweizer wrote the manuscript. All coauthors commented on the final version of the manuscript. All of the coauthors approved the final manuscript for submission.

The author(s) declared that there were no conflicts of interest with respect to the authorship or the publication of this article.

Funding: This research was funded by the National Institute for Health Research (NIHR) Health Technology Assessment program (Project 08/56/01). T. Dalgleish was partly supported by the National Institute for Health Research Cambridge Biomedical Research Centre and the UK Medical Research Council (Grant SUAG/043 G101400). S. Schweizer was supported by the Wellcome Trust (Grant 209127/Z/17/Z). Z. D. Cohen and R. J. DeRubeis were partly supported by the MQ Foundation (Grant MQ14PM_27). W. Kuyken and C. Crane were supported by the Wellcome Trust (Grant 104908/Z/14/Z and 107496/Z/15/Z). R. Byng was supported by the NIHR Collaboration for Leadership in Applied Health Research and Care South West Peninsula at the Royal Devon and Exeter National Health Service Foundation Trust. The views expressed in this publication are those of the authors and do not necessarily reflect those of the National Institute for Health Research (NIHR) Health Technology Assessment program, NIHR, the National Health Service, the Department of Health and Social Care, the Wellcome Trust, or the MQ Foundation. The opinions and assertions contained in this article should not be construed as reflecting the views of the sponsors.

References

  1. Altman D. G., Bland J. M. (1994). Statistics Notes: Quartiles, quintiles, centiles, and other quantiles. BMJ, 309(6960), 996–996. 10.1136/bmj.309.6960.996 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Baer R. A., Smith G. T., Hopkins J., Krietemeyer J., Toney L. (2006). Using self-report assessment methods to explore facets of mindfulness. Assessment, 13(1), 27–45. 10.1177/1073191105283504 [DOI] [PubMed] [Google Scholar]
  3. Barnard P. J., Watkins E. R., Mackintosh B., Nimmo-Smith I. (2007, September 12–14). Getting stuck in a mental rut: Some process and experiential attributes [Conference session]. 35th Congress of the British Association for Behavioural and Cognitive Psychotherapies, Brighton, England. [Google Scholar]
  4. Beck A. T., Steer R. A., Brown G. K. (1996). Manual for the Beck Depression Inventory-II. Psychological Cooperation. [Google Scholar]
  5. Bet P. M., Hugtenburg J. G., Penninx B. W. J. H., Hoogendijk W. J. G. (2013). Side effects of antidepressants during long-term use in a naturalistic setting. European Neuropsychopharmacology, 23(11), 1443–1451. 10.1016/j.euroneuro.2013.05.001 [DOI] [PubMed] [Google Scholar]
  6. Bonnett L. J., Snell K. I., Collins G. S., Riley R. D. (2019). Guide to presenting clinical prediction models for use in clinical settings. The BMJ, 365, Article 1737. 10.1136/bmj.l737 [DOI] [PubMed] [Google Scholar]
  7. Bosman R. C., Waumans R. C., Jacobs G. E., Oude Voshaar R. C., Muntingh A. D. T., Batelaan N. M., van Balkom A. (2018). Failure to respond after reinstatement of antidepressant medication: A systematic review. Psychotherapy and Psychosomatics, 87(5), 268–275. 10.1159/000491550 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Breedvelt J. J. F., Brouwer M. E., Harrer M., Semkovska M., Ebert D. D., Cuijpers P., Bockting C. L. H. (2021). Psychological interventions as an alternative and add-on to antidepressant medication to prevent depressive relapse: Systematic review and meta-analysis. British Journal of Psychiatry, 219(4), 538–545. 10.1192/bjp.2020.198 [DOI] [PubMed] [Google Scholar]
  9. Buckman J. E. J., Cohen Z. D., O’Driscoll C., Fried E. I., Saunders R., Ambler G., DeRubeis R. J., Gilbody S., Hollon S. D., Kendrick T., Watkins E., Eley T. C., Peel A. J., Rayner C., Kessler D., Wiles N., Lewis G., Pilling S. (2021). Predicting prognosis for adults with depression using individual symptom data: A comparison of modelling approaches. Psychological Medicine. Advance online publication. 10.1017/S0033291721001616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Buckman J. E. J., Underwood A., Clarke K., Saunders R., Hollon S. D., Fearon P., Pilling S. (2018). Risk factors for relapse and recurrence of depression in adults and how they operate: A four-phase systematic review and meta-synthesis. Clinical Psychology Review, 64, 13–38. 10.1016/j.cpr.2018.07.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cartwright C., Gibson K., Read J., Cowan O., Dehar T. (2016). Long-term antidepressant use: Patient perspectives of benefits and adverse effects. Patient Preference and Adherence, 10, 1401–1407. 10.2147/PPA.S110632 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chekroud A. M., Bondar J., Delgadillo J., Doherty G., Wasil A., Fokkema M., Cohen Z. D., Belgrave D., DeRubeis R. J., Iniesta R., Dwyer D., Choi K. (2021). The promise of machine learning in predicting treatment outcomes in psychiatry. World Psychiatry, 20(2), 154–170. 10.1002/wps.20882 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chekroud A. M., Gueorguieva R., Krumholz H. M., Trivedi M. H., Krystal J. H., McCarthy G. (2017). Reevaluating the efficacy and predictability of antidepressant treatments: A symptom clustering approach. JAMA Psychiatry, 74(4), 370–378. 10.1001/jamapsychiatry.2017.0025 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Chekroud A. M., Zotti R. J., Shehzad Z., Gueorguieva R., Johnson M. K., Trivedi M. H., Cannon T. D., Krystal J. H., Corlett P. R. (2016). Cross-trial prediction of treatment outcome in depression: A machine learning approach. Lancet Psychiatry, 3(3), 243–250. 10.1016/S2215-0366(15)00471-X [DOI] [PubMed] [Google Scholar]
  15. Cleare A., Pariante C. M., Young A. H., Anderson I. M., Christmas D., Cowen P. J., Dickens C., Ferrier I. N., Geddes J., Gilbody S., Haddad P. M., Katona C., Lewis G., Malizia A., McAllister-Williams R. H., Ramchandani P., Scott J., Taylor D., Uher R. (2015). Evidence-based guidelines for treating depressive disorders with antidepressants: A revision of the 2008 British Association for Psychopharmacology guidelines. Journal of Psychopharmacology, 29(5), 459–525. 10.1177/0269881115581093 [DOI] [PubMed] [Google Scholar]
  16. Cohen Z. D., Ashar Y. K., DeRubeis R. J. (2019). Moving beyond “One Size Fits All.” In Dimidjian S. (Ed.), Evidence-based practice in action: Bridging clinical science and intervention (pp. 111–132). The Guilford Press. [Google Scholar]
  17. Cohen Z. D., Delgadillo J., DeRubeis R. J. (2021). Personalized treatment approaches. In Barkham M., Lutz W., Castonguay L. G. (Eds.), Bergin and Garfield’s handbook of psychotherapy and behavior change (7th ed., pp. 673–704). Wiley Blackwell. [Google Scholar]
  18. Cohen Z. D., DeRubeis R. J. (2018). Treatment selection in depression. Annual Review of Clinical Psychology, 14(1), 209–236. 10.1146/annurev-clinpsy-050817-084746 [DOI] [PubMed] [Google Scholar]
  19. Cohen Z. D., Kim T. T., Van H. L., Dekker J. J. M., Driessen E. (2020). A demonstration of a multi-method variable selection approach for treatment selection: Recommending cognitive–behavioral versus psychodynamic therapy for mild to moderate adult depression. Psychotherapy Research, 30(2), 137–150. 10.1080/10503307.2018.1563312 [DOI] [PubMed] [Google Scholar]
  20. Collins F. S., Varmus H. (2015). A new initiative on precision medicine. New England Journal of Medicine, 372, 793–795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Collins G. S., Reitsma J. B., Altman D. G., Moons K. G. M. (2015). Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement. BMC Medicine, 13(1), Article g7594. 10.1186/s12916-014-0241-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Cuijpers P., Hollon S. D., van Straten A., Bockting C., Berking M., Andersson G. (2013). Does cognitive behaviour therapy have an enduring effect that is superior to keeping patients on continuation pharmacotherapy? A meta-analysis. BMJ Open, 3(4), Article e002542. 10.1136/bmjopen-2012-002542 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Deisenhofer A.-K., Delgadillo J., Rubel J. A., Böhnke J. R., Zimmermann D., Schwartz B., Lutz W. (2018). Individual treatment selection for patients with posttraumatic stress disorder. Depression and Anxiety, 35(6), 541–550. 10.1002/da.22755 [DOI] [PubMed] [Google Scholar]
  24. Delgadillo J., Ali S., Fleck K., Agnew C., Southgate A., Parkhouse L., Cohen Z. D., DeRubeis R. J., Barkham M. (2022). Stratified care vs stepped care for depression: A cluster randomized clinical trial. JAMA Psychiatry, 79(2), 101–108. 10.1001/jamapsychiatry.2021.3539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Delgadillo J., Lutz W. (2020). A development pathway towards precision mental health care. JAMA Psychiatry, 77(9), 889–890. 10.1001/jamapsychiatry.2020.1048 [DOI] [PubMed] [Google Scholar]
  26. DeLong E. R., DeLong D. M., Clarke-Pearson D. L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics, 44(3), 837–845. 10.2307/2531595 [DOI] [PubMed] [Google Scholar]
  27. DeRubeis R. J., Cohen Z. D., Forand N. R., Fournier J. C., Gelfand L. A., Lorenzo-Luaces L. (2014). The Personalized Advantage Index: Translating research on prediction into individualized treatment recommendations. A demonstration. PLOS ONE, 9(1), Article e83875. 10.1371/journal.pone.0083875 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. DeRubeis R. J., Zajecka J., Shelton R. C., Amsterdam J. D., Fawcett J., Xu C., Young P. R., Gallop R., Hollon S. D. (2019). Prevention of recurrence after recovery from a major depressive episode with antidepressant medication alone or in combination with cognitive behavioral therapy: Phase 2 of a 2-phase randomized clinical trial. JAMA Psychiatry, 77(3), 237–245. 10.1001/jamapsychiatry.2019.3900 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. De Zwart P., Jeronimus B., De Jonge P. (2019). Empirical evidence for definitions of episode, remission, recovery, relapse and recurrence in depression: A systematic review. Epidemiology and Psychiatric Sciences, 28(5), 544–562. 10.1017/S2045796018000227 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Dorow M., Löbner M., Pabst A., Stein J., Riedel-Heller S. G. (2018). Preferences for depression treatment including internet-based interventions: Results from a large sample of primary care patients. Frontiers in Psychiatry, 9, Article 181. 10.3389/fpsyt.2018.00181 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. First M. B., Spitzer R. L., Gibbon M., Williams J. B. W. (1995). The structured clinical interview for DSM-IV Axis I disorder with psychotic screen. New York Psychiatric Institute. [Google Scholar]
  32. Fornaro M., Anastasia A., Novello S., Fusco A., Pariano R., De Berardis D., Solmi M., Veronese N., Stubbs B., Vieta E. (2019). The emergence of loss of efficacy during antidepressant drug treatment for major depressive disorder: An integrative review of evidence, mechanisms, and clinical implications. Pharmacological Research, 139, 494–502. [DOI] [PubMed] [Google Scholar]
  33. Frank E., Prien R. F., Jarrett R. B., Keller M. B., Kupfer D. J., Lavori P. W., Rush J. A., Weissman M. M. (1991). Conceptualization and rationale for consensus definitions of terms in major depressive disorder: Remission, recovery, relapse, and recurrence. Archives of General Psychiatry, 48, 851–855. 10.1001/archpsyc.1991.01810330075011 [DOI] [PubMed] [Google Scholar]
  34. Fried E. I. (2017). Moving forward: How depression heterogeneity hinders progress in treatment and research. Expert Review of Neurotherapeutics, 17(5), 423–425. 10.1080/14737175.2017.1307737 [DOI] [PubMed] [Google Scholar]
  35. Fried E. I., Nesse R. M. (2015). Depression is not a consistent syndrome: An investigation of unique symptom patterns in the STAR*D study. Journal of Affective Disorders, 172, 96–102. 10.1016/j.jad.2014.10.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Friedman J., Hastie T., Tibshirani R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33(1), 1–22. 10.18637/jss.v033.i01 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Garnefski N., Kraaij V., Spinhoven P. (2001). Negative life events, cognitive emotion regulation and emotional problems. Personality and Individual Differences, 30(8), 1311–1327. [Google Scholar]
  38. Gelenberg A. J., Freeman M. P., Markowitz J. C., Rosenbaum J. F., Thase M. E., Trivedi M. H., Van Rhoads R. S. (2010). American Psychiatric Association practice guidelines for the treatment of patients with major depressive disorder. American Journal of Psychiatry, 167(Suppl. 10), 9–118. [Google Scholar]
  39. Hamilton M. (1967). Development of a rating scale for primary depressive illness. British Journal of Social and Clinical Psychology, 6(4), 278–296. [DOI] [PubMed] [Google Scholar]
  40. Hastie T., Tibshirani R., Friedman J. (2009). The elements of statistical learning (2nd ed.). Springer. [Google Scholar]
  41. Hong S., Lynn H. S. (2020). Accuracy of random-forest-based imputation of missing data in the presence of non-normality, non-linearity, and interaction. BMC Medical Research Methodology, 20(1), Article 199. 10.1186/s12874-020-01080-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Iniesta R., Malki K., Maier W., Rietschel M., Mors O., Hauser J., Henigsberg N., Dernovsek M. Z., Souery D., Stahl D., Dobson R., Aitchison K. J., Farmer A., Lewis C. M., McGuffin P., Uher R. (2016). Combining clinical variables to optimize prediction of antidepressant treatment outcomes. Journal of Psychiatric Research, 78, 94–102. 10.1016/j.jpsychires.2016.03.016 [DOI] [PubMed] [Google Scholar]
  43. James G., Witten D., Hastie T., Tibshirani R. (2013). Linear model selection and regularization. In An introduction to statistical learning (pp. 203–264). Springer. [Google Scholar]
  44. Kessler R. C., Bromet E. J. (2013). The epidemiology of depression across cultures. Annual Review of Public Health, 34(1), 119–138. 10.1146/annurev-publhealth-031912-114409 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Kessler R. C., van Loo H. M., Wardenaar K. J., Bossarte R. M., Brenner L. A., Ebert D. D., de Jonge P., Nierenberg A. A., Rosellini A. J., Sampson N. A., Schoevers R. A., Wilcox M. A., Zaslavsky A. M. (2017). Using patient self-reports to study heterogeneity of treatment effects in major depressive disorder. Epidemiology and Psychiatric Sciences, 26(1), 22–36. 10.1017/S2045796016000020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kim T. T., Dufour S., Xu C., Cohen Z. D., Sylvia L., Deckersbach T., DeRubeis R. J., Nierenberg A. A. (2019). Predictive modeling for response to lithium and quetiapine in bipolar disorder. Bipolar Disorders, 21(5), 428–436. 10.1111/bdi.12752 [DOI] [PubMed] [Google Scholar]
  47. Kinrys G., Gold A. K., Pisano V. D., Freeman M. P., Papakostas G. I., Mischoulon D., Nierenberg A. A., Fava M. (2019). Tachyphylaxis in major depressive disorder: A review of the current state of research. Journal of Affective Disorders, 245, 488–497. 10.1016/j.jad.2018.10.357 [DOI] [PubMed] [Google Scholar]
  48. Kuyken W., Byford S., Byng R., Dalgleish T., Lewis G., Taylor R., Watkins E. R., Hayes R., Lanham P., Kessler D., Morant N., Evans A. (2010). Study protocol for a randomized controlled trial comparing mindfulness-based cognitive therapy with maintenance anti-depressant treatment in the prevention of depressive relapse/recurrence: The PREVENT trial. Trials, 11, Article 99. 10.1186/1745-6215-11-99 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Kuyken W., Byford S., Byng R., Dalgleish T., Lewis G., Taylor R., Watkins E. R., Hayes R., Lanham P., Kessler D., Morant N., Evans A. (2014). Update to the study protocol for a randomized controlled trial comparing mindfulness-based cognitive therapy with maintenance anti-depressant treatment depressive relapse/recurrence: The PREVENT trial. Trials, 15, Article 217. 10.1186/1745-6215-15-217 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kuyken W., Hayes R., Barrett B., Byng R., Dalgleish T., Kessler D., Lewis G., Watkins E., Brejcha C., Cardy J., Causley A., Cowderoy S., Evans A., Gradinger F., Kaur S., Lanham P., Morant N., Richards J., Shah P., . . .Byford S. (2015. a). Effectiveness and cost-effectiveness of mindfulness-based cognitive therapy compared with maintenance antidepressant treatment in the prevention of depressive relapse or recurrence (PREVENT): A randomised controlled trial. Lancet, 386(9988), 63–73. 10.1016/S0140-6736(14)62222-4 [DOI] [PubMed] [Google Scholar]
  51. Kuyken W., Hayes R., Barrett B., Byng R., Dalgleish T., Kessler D., Lewis G., Watkins E., Morant N., Taylor R. S., Byford S. (2015. b). The effectiveness and cost-effectiveness of mindfulness-based cognitive therapy compared with maintenance antidepressant treatment in the prevention of depressive relapse/recurrence: Results of a randomised controlled trial (the PREVENT study). Health Technology Assessment, 19(73), 1–124. 10.3310/hta19730 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Kuyken W., Warren F. C., Taylor R. S., Whalley B., Crane C., Bondolfi G., Hayes R., Huijbers M., Ma H., Schweizer S., Segal Z., Speckens A., Teasdale J. D., van Heeringen K., Williams M., Byford S., Byng R., Dalgleish T. (2016). Efficacy of mindfulness-based cognitive therapy in prevention of depressive relapse: An individual patient data meta-analysis from randomized trials. JAMA Psychiatry, 73(6), 565–574. 10.1001/jamapsychiatry.2016.0076 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Kuyken W., Watkins E. R., Holden E., White K., Taylor R. S., Byford S., Evans A., Radford S., Teasdale J. D., Dalgleish T. (2010). How does mindfulness-based cognitive therapy work? Behaviour Research and Therapy, 48(11), 1105–1112. 10.1016/j.brat.2010.08.003 [DOI] [PubMed] [Google Scholar]
  54. Kwan B. M., Dimidjian S., Rizvi S. L. (2010). Treatment preference, engagement, and clinical improvement in pharmacotherapy versus psychotherapy for depression. Behaviour Research and Therapy, 48(8), 799–804. 10.1016/j.brat.2010.04.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Lorenzo-Luaces L., DeRubeis R. J., van Straten A., Tiemens B. (2017). A prognostic index (PI) as a moderator of outcomes in the treatment of depression: A proof of concept combining multiple variables to inform risk-stratified stepped care models. Journal of Affective Disorders, 213, 78–85. 10.1016/j.jad.2017.02.010 [DOI] [PubMed] [Google Scholar]
  56. Luedtke A., Sadikova E., Kessler R. C. (2019). Sample size requirements for multivariate models to predict between-patient differences in best treatments of major depressive disorder. Clinical Psychological Science, 7(3), 445–461. 10.1177/2167702618815466 [DOI] [Google Scholar]
  57. Lutz W., Deisenhofer A.-K., Rubel J., Bennemann B., Giesemann J., Poster K., Schwartz B. (2022). Prospective evaluation of a clinical decision support system in psychological therapy. Journal of Consulting and Clinical Psychology, 90(1), 90–106. 10.1037/ccp0000642 [DOI] [PubMed] [Google Scholar]
  58. Ma S. H., Teasdale J. D. (2004). Mindfulness-based cognitive therapy for depression: Replication and exploration of differential relapse prevention effects. Journal of Consulting and Clinical Psychology, 72(1), 31–40. 10.1037/0022-006X.72.1.31 [DOI] [PubMed] [Google Scholar]
  59. McCartney M., Nevitt S., Lloyd A., Hill R., White R., Duarte R. (2021). Mindfulness-based cognitive therapy for prevention and time to depressive relapse: Systematic review and network meta-analysis. Acta Psychiatrica Scandinavica, 143(1), 6–21. 10.1111/acps.13242 [DOI] [PubMed] [Google Scholar]
  60. McCrea R. L., Sammon C. J., Nazareth I., Petersen I. (2016). Initiation and duration of selective serotonin reuptake inhibitor prescribing over time: UK cohort study. British Journal of Psychiatry, 209(5), 421–426. 10.1192/bjp.bp.115.166975 [DOI] [PubMed] [Google Scholar]
  61. McHugh R. K., Whitton S. W., Peckham A. D., Welge J. A., Otto M. W. (2013). Patient preference for psychological vs. pharmacological treatment of psychiatric disorders: A meta-analytic review. Journal of Clinical Psychiatry, 74(6), 595–602. 10.4088/JCP.12r07757 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. McNamara M. E., Shumake J., Stewart R. A., Labrada J., Alario A., Allen J. J. B., Palmer R., Schnyer D. M., McGeary J. E., Beevers C. G. (2021). Multifactorial prediction of depression diagnosis and symptom dimensions. Psychiatry Research, 298, Article 113805. 10.1016/j.psychres.2021.113805 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Mojtabai R., Olfson M. (2014). National trends in long-term use of antidepressant medications: Results from the U.S. National Health and Nutrition Examination Survey. Journal of Clinical Psychiatry, 75(2), 169–177. 10.4088/JCP.13m08443 [DOI] [PubMed] [Google Scholar]
  64. Moriarty A. S., Meader N., Snell K. I., Riley R. D., Paton L. W., Chew-Graham C. A., Gilbody S., Churchill R., Phillips R. S., Ali S., McMillan D. (2021). Prognostic models for predicting relapse or recurrence of major depressive disorder in adults. Cochrane Database of Systematic Reviews, 2021(5), Article CD013491. 10.1002/14651858.CD013491.pub2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. National Institute for Health and Care Excellence. (2009). Depression—Guidance and guidelines—NICE. https://www.nice.org.uk/guidance/cg90/chapter/1-guidance
  66. Neff K. D. (2003). The development and validation of a scale to measure self-compassion. Self and Identity, 2(3), 223–250. 10.1080/15298860309027 [DOI] [Google Scholar]
  67. Organisation for Economic Co-operation and Development. (2013). Health at a glance 2013: OECD indicators. 10.1787/health_glance-2013-en [DOI]
  68. Pahlevan T., Ung C., Segal Z. (2020). Cost-utility analysis of mindfulness-based cognitive therapy versus antidepressant pharmacotherapy for prevention of depressive relapse in a Canadian context: Analyse coût-utilité de la thérapie cognitive basée sur la pleine conscience contre la pharmacothérapie antidépressive pour prévenir la rechute de la dépression en contexte canadien. Canadian Journal of Psychiatry/Revue Canadienne De Psychiatrie, 65(8), 568–576. 10.1177/0706743720904613 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Parker G., Roussos J., Hadzi-Pavlovic D., Mitchell P., Wilhelm K., Austin M. P. (1997). The development of a refined measure of dysfunctional parenting and assessment of its relevance in patients with affective disorders. Psychological Medicine, 27(5), 1193–1203. [DOI] [PubMed] [Google Scholar]
  70. Patel V., Chisholm D., Parikh R., Charlson F. J., Degenhardt L., Dua T., Ferrari A. J., Hyman S., Laxminarayan R., Levin C., Lund C., Medina Mora M. E., Petersen I., Scott J., Shidhaye R., Vijayakumar L., Thornicroft G., Whiteford H. (2016). Addressing the burden of mental, neurological, and substance use disorders: Key messages from Disease Control Priorities, 3rd edition. Lancet, 387(10028), 1672–1685. 10.1016/S0140-6736(15)00390-6 [DOI] [PubMed] [Google Scholar]
  71. Patel V., Saxena S., Lund C., Thornicroft G., Baingana F., Bolton P., Chisholm D., Collins P. Y., Cooper J. L., Eaton J., Herrman H., Herzallah M. M., Huang Y., Jordans M. J. D., Kleinman A., Medina-Mora M. E., Morgan E., Niaz U., Omigbodun O., . . . Unützer J. (2018). The Lancet Commission on global mental health and sustainable development. The Lancet, 392(10157), 1553–1598. 10.1016/S0140-6736(18)31612-X [DOI] [PubMed] [Google Scholar]
  72. Pearson R., Pisner D., Meyer B., Shumake J., Beevers C. G. (2018). A machine learning ensemble to predict treatment outcomes following an Internet intervention for depression. Psychological Medicine, 49(14), 2330–2341. https://doi.org/https://doi.org/10.1017/S003329171800315X [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Perlis R. H. (2013). A clinical risk stratification tool for predicting treatment resistance in major depressive disorder. Biological Psychiatry, 74(1), 7–14. 10.1016/j.biopsych.2012.12.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Piek E., van der Meer K., Nolen W. A. (2010). Guideline recommendations for long-term treatment of depression with antidepressants in primary care—A critical review. The European Journal of General Practice, 16(2), 106–112. 10.3109/13814781003692463 [DOI] [PubMed] [Google Scholar]
  75. R Core Team. (2018). R: A language and environment for statistical computing (Version 3.5.1) [Computer software]. R Foundation for Statistical Computing. http://www.R-project.org [Google Scholar]
  76. Riley R. D., Snell K. I. E., Martin G. P., Whittle R., Archer L., Sperrin M., Collins G. S. (2021). Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small. Journal of Clinical Epidemiology, 132, 88–96. 10.1016/j.jclinepi.2020.12.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Salazar de Pablo G., Studerus E., Vaquerizo-Serrano J., Irving J., Catalan A., Oliver D., Baldwin H., Danese A., Fazel S., Steyerberg E. W., Stahl D., Fusar-Poli P. (2021). Implementing precision psychiatry: A systematic review of individualized prediction models for clinical practice. Schizophrenia Bulletin, 47(2), 284–297. 10.1093/schbul/sbaa120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Schwarzer R., Jerusalem M. (1995). Generalized self-efficacy scale. In Weinman J., Wright S., Johnston M. (Eds.), Measures in health psychology: A user’s portfolio (pp. 35–37). NFER-Nelson. [Google Scholar]
  79. Schweizer S., Peeters F., Huibers M., Roelofs J., van Os J., Arntz A. (2010). Does illness attribution affect treatment assignment in depression? Clinical Psychology & Psychotherapy, 17(5), 418–426. [DOI] [PubMed] [Google Scholar]
  80. Shah A. D., Bartlett J. W., Carpenter J., Nicholas O., Hemingway H. (2014). Comparison of random forest and parametric imputation models for imputing missing data using MICE: A CALIBER study. American Journal of Epidemiology, 179(6), 764–774. 10.1093/aje/kwt312 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Shay L. A., Lafata J. E. (2015). Where is the evidence? A systematic review of shared decision making and patient outcomes. Medical Decision Making, 35(1), 114–131. 10.1177/0272989X14551638 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Shiota M. N., Keltner D., John O. P. (2006). Positive emotion dispositions differentially associated with Big Five personality and attachment style. Journal of Positive Psychology, 1(2), 61–71. 10.1080/17439760500510833 [DOI] [Google Scholar]
  83. Shumake J. (2022). Jashu/beset (Version 0.0.0.9409) [Computer software]. https://github.com/jashu/beset
  84. Stacey D., Legare F., Lewis K., Barry M. J., Bennett C. L., Eden K. B., Holmes-Rovner M., Llewellyn-Thomas H., Lyddiatt A., Thomson R., Trevena L. (2017). Decision aids for people facing health treatment or screening decisions. Cochrane Database of Systematic Reviews. 10.1002/14651858.CD001431.pub5 [DOI] [PMC free article] [PubMed]
  85. Stekhoven D. J., Bühlmann P. (2012). MissForest—Non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112–118. 10.1093/bioinformatics/btr597 [DOI] [PubMed] [Google Scholar]
  86. Steyerberg E. W., Vickers A. J., Cook N. R., Gerds T., Gonen M., Obuchowski N., Pencina M. J., Kattan M. W. (2010). Assessing the performance of prediction models: A framework for traditional and novel measures. Epidemiology, 21(1), 128–138. 10.1097/EDE.0b013e3181c30fb2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Van Calster B., McLernon D. J., van Smeden M., Wynants L., Steyerberg E. W., on Behalf of Topic Group ‘Evaluating diagnostic tests and prediction models’ of the STRATOS Initiative. (2019). Calibration: The Achilles heel of predictive analytics. BMC Medicine, 17(1), Article 230. 10.1186/s12916-019-1466-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Waldmann P., Mészáros G., Gredler B., Fuerst C., Sölkner J. (2013). Evaluation of the lasso and the elastic net in genome-wide association studies. Frontiers in Genetics, 4, Article 270. 10.3389/fgene .2013.00270. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Waljee A. K., Mukherjee A., Singal A. G., Zhang Y., Warren J., Balis U., Marrero J., Zhu J., Higgins P. D. (2013). Comparison of imputation methods for missing laboratory data in medicine. BMJ Open, 3(8), Article e002847. 10.1136/bmjopen-2013-002847 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Webb C. A., Cohen Z. D., Beard C., Forgeard M., Peckham A. D., Björgvinsson T. (2020). Personalized prognostic prediction of treatment outcome for depressed patients in a naturalistic psychiatric hospital setting: A comparison of machine learning approaches. Journal of Consulting and Clinical Psychology, 88(1), 25–38. 10.1037/ccp0000451 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Weston W. W. (2001). Informed and shared decision-making: The crux of patient-centered care. Canadian Medical Association Journal, 165(4), 438–439. [PMC free article] [PubMed] [Google Scholar]
  92. Williams J. B. W., Kobak K. A., Bech P., Engelhardt N., Evans K., Lipsitz J., Olin J., Pearson J., Kalali A. (2008). The GRID-HAMD: Standardization of the Hamilton depression rating scale. International Clinical Psychopharmacology, 23(3), 120–129. [DOI] [PubMed] [Google Scholar]
  93. Wiltsey Stirman S., Cohen Z. D., Lunney C. A., DeRubeis R. J., Wiley J. F., Schnurr P. P. (2021). A personalized index to inform selection of a trauma-focused or non-trauma-focused treatment for PTSD. Behaviour Research and Therapy, 142, Article 103872. 10.1016/j.brat.2021.103872 [DOI] [PubMed] [Google Scholar]
  94. Windle E., Tee H., Sabitova A., Jovanovic N., Priebe S., Carr C. (2019). Association of patient treatment preference with dropout and clinical outcomes in adult psychosocial mental health interventions: A systematic review and meta-analysis. JAMA Psychiatry, 77(3), 294–302. 10.1001/jamapsychiatry.2019.3750 [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Winston K., Grendarova P., Rabi D. (2018). Video-based patient decision aids: A scoping review. Patient Education and Counseling, 101(4), 558–578. 10.1016/j.pec.2017.10.009 [DOI] [PubMed] [Google Scholar]
  96. Zou H., Hastie T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society B: Statistical Methodology, 67(2), 301–320. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sj-pdf-1-cpx-10.1177_21677026221076832 – Supplemental material for The Development and Internal Evaluation of a Predictive Model to Identify for Whom Mindfulness-Based Cognitive Therapy Offers Superior Relapse Prevention for Recurrent Depression Versus Maintenance Antidepressant Medication

Supplemental material, sj-pdf-1-cpx-10.1177_21677026221076832 for The Development and Internal Evaluation of a Predictive Model to Identify for Whom Mindfulness-Based Cognitive Therapy Offers Superior Relapse Prevention for Recurrent Depression Versus Maintenance Antidepressant Medication by Zachary D. Cohen, Robert J. DeRubeis, Rachel Hayes, Edward R. Watkins, Glyn Lewis, Richard Byng, Sarah Byford, Catherine Crane, Willem Kuyken, Tim Dalgleish and Susanne Schweizer in Clinical Psychological Science


Articles from Clinical Psychological Science are provided here courtesy of SAGE Publications

RESOURCES