Abstract
Precision medicine methods (machine learning; ML) can identify which clients with generalized anxiety disorder (GAD) benefit from mindfulness ecological momentary intervention (MEMI) vs. self-monitoring app (SM). We used randomized controlled trial data of MEMI vs. SM for GAD (N = 110) and tested three ML models to predict one-month follow-up reliable improvement in GAD severity, perseverative cognitions (PC), trait mindfulness (TM), and executive function (EF). Eleven baseline predictors were tested regarding differential reliable change from MEMI vs. SM (age, sex, race, EF errors, inhibitory dyscontrol, set-shifting deficits, verbal fluency, working memory, GAD severity, TM, PC). The final top five prescriptive predictor models of all outcomes performed well (AUC = .752 .886). The following variables predicted better outcome from MEMI vs. SM: Higher GAD severity predicted more GAD improvement but less EF improvement. Elevated PC, inhibitory dyscontrol, and verbal dysfluency predicted better improvement in most outcomes. Greater set-shifting and TM predicted stronger improvements in GAD symptoms and TM. Older age predicted more alleviation of GAD and PC symptoms. Women exhibited more enhancements in trait mindfulness and EF than men. White individuals benefitted more than non-White. PC, TM, EF, and sociodemographic data might help predictive models optimize intervention selection for GAD.
Keywords: Generalized anxiety disorder, Digital mental health intervention, Machine learning, Mindfulness, Precision medicine
1. Introduction
Generalized anxiety disorder (GAD) is a mental disorder linked to compromised socio-occupational functioning, physical health, and quality of life (Newman et al., 2017; Zainal & Newman, 2022b; Zhou et al., 2017). Fortunately, effective psychosocial interventions exist. These include face-to-face cognitive-behavioral therapies (CBT; Newman et al., 2020), mindfulness-based interventions (MBIs; Zainal & Newman, In Press), and guided and self-guided digital mental health interventions (DMHIs) that can treat GAD and comorbid symptoms (e.g., panic; Apolinário-Hagen et al., 2020). However, providing high-intensity face-to-face psychotherapy to all clients with GAD is infeasible in many resource-limited treatment settings where demand outstrips supply (Rebello et al., 2014). Self-guided DMHIs can aid dissemination when logistical barriers prevent seeking face-to-face psychotherapies, such as shame, stigma, and financial constraints (Goetter et al., 2020). Persons with GAD also reported high levels of acceptability toward DMHIs, such as mindfulness ecological momentary interventions (MEMIs) and self-monitoring apps (SMs) to manage symptoms alone (Newman et al., 2021). EMI represents a growing field of keen interest, holding substantial promise in the context of GAD and related mental health problems. A recent review evidenced that users generally found EMIs beneficial as long as disturbances, such as measurement burden, did not interfere with participants’ everyday routines (Dao et al., 2021). Thus, determining which clients with GAD differentially benefit from unique scalable self-guided DMHIs is essential.
Heterogeneity of treatment effects (i.e., the fact that more efficacious interventions at the group level may not be the most effective for a specific patient) is intrinsic to all psychotherapies (Kaiser et al., 2022), including low-intensity self-guided DMHIs. For example, evidence suggested that for a particular subgroup, SM alone helped to alleviate anxiety, depression, and related symptoms (Gruszka et al., 2019). A meta-analysis also indicated that MEMIs, more than other DMHIs, were suitable for depression and anxiety during pregnancy in patients with specific symptoms and demographic profiles (Silang et al., 2022). However, most DMHI randomized controlled trials (RCTs) thus far solely examined aggregate treatment effects for GAD and related disorders (Carl et al., 2020b). Doing so precludes understanding heterogeneity of treatment effects (Goldberg et al., 2022). Thus, from a resource-management perspective, evaluating which DMHI yields the most optimal treatment effect for which specific client with GAD would be profitable for patients, clinicians, and public health administrators.
Additionally, there remains room for improvement when determining what specific DMHIs work best for which clients with GAD. Previous studies tended to use approaches based on classical ordinary least squares regression (OLS; Faaland et al., 2022). Traditional OLS is suboptimal as it necessitates manual specification of non-linearity and interactions and can lead to underfitting of the data (Wallert et al., 2022). Machine learning (ML) may facilitate the building of precise predictive models. ML comprises data-driven methods that empower computer algorithms to determine and iteratively improve the best parameters to fit complex predictor patterns (Jordan & Mitchell, 2015). Despite compromising model interpretability to some extent, fitting flexible nonlinear and higher-order interaction ML algorithms instead can enhance model derivation and performance and optimize model prediction (Pearson et al., 2019). Further, methods exist to determine significant treatment predictors and delineate or characterize how each predictor correlates with the outcome (Archer & Kimes, 2008). By managing collinearity and carefully regulating overfitting, the accuracy of ML algorithms can be evaluated (Christodoulou et al., 2019).
Harnessing ML for prediction using DMHI datasets is a nascent yet rapidly growing endeavor in clinical science (Jacobson & Nemesure, 2021). Combining ML with DMHIs could be valuable since self-guided DMHIs are easily implementable and have promising preliminary results. However, previous DMHI studies that used ML to predict intervention outcomes for adults with anxiety, depression, and obsessive-compulsive and related disorders (e.g., Hornstein et al., 2021; Lenhard et al., 2018; Pearson et al., 2019) had the shortcoming of testing a restricted set of possible ML models. Also, the only digitally delivered MBI that used ML to date (Lekkas et al., 2021) focused narrowly on compliance features as predictors of post-treatment stress reduction. Compliance features are not optimal because they rely on information collected during treatment as opposed to using information collected at baseline that may inform immediate prescriptive treatment assignment. Testing a broader predictor set comprising baseline symptom severity, demographic variables, and theory-based predictors with ML models (Canby et al., 2021; Elhai & Montag, 2020) can help to optimally determine which DMHI app works best for which person with GAD.
Drawing from the compensation model (Cheavens et al., 2012), treatment efficacy is contingent upon effectively addressing the specific deficiencies within patients’ disorder-pertinent vulnerabilities. Individuals with elevated perseverative cognitions might experience more gains in utilizing MEMIs than SMs, given deficits in understanding and applying mindfulness techniques. Higher trait perseverative cognition could afford individuals with GAD an expanded opportunity to cultivate mindfulness skills over time. Consequently, this could increase their chances of experiencing symptom relief through MEMIs more than SMs (Spinhoven et al., 2018). Such distinct effects would probably arise since individuals grappling with excessive worry and self-focused repetitive thinking tend to derive the greatest advantage from breaking free from those unproductive habits (Hallion et al., 2022). They stand to gain by redirecting their attention toward the present task and fostering a sense of personal agency (Gallagher et al., 2014) via mindfulness exercises. These patterns may manifest as individuals with GAD possess greater potential to nurture present-focused awareness and positive emotions through mindfulness practices (Perestelo-Perez et al., 2017).
Mindfulness theories propose that although sustaining mindfulness can be challenging, staying mindful by optimally using cognitive resources can foster more robust insight, cognitive flexibility, and self-regulation abilities (Hofmann & Gomez, 2017; Spinhoven et al., 2022). Lower levels of trait mindfulness could prevent participants from reaping optimal benefits from a MEMI vs. an SM. More robust levels of mindfulness traits likely promote a better response to a MEMI than an SM because they help to encourage prefrontal cortex-mediated regulation and reduce the inclination to experience or avoid negative emotions (Mizera et al., 2015; Spinhoven et al., 2017). Collectively, heightened trait mindfulness could facilitate being optimized by MEMI vs. SM to improve global trait mindfulness (cf. capitalization model; Murphy et al., 2021).
In addition, the self-regulatory executive function (S-REF; Matthews & Wells, 2004) model and cognitive model of pathological worry (Hirsch & Mathews, 2012) can be extended to suggest that weaker executive function (EF) skills may hamper assistance provided by MEMI vs. SM. EF refers to higher-order cognitive control skills that govern thought and action repertoires (Keller et al., 2023). Empirical data showed that EF deficits correlated consistently with attentional dyscontrol, metacognitive beliefs (e.g., thoughts of the “helpfulness” of rumination), and maladaptive coping across time (Hoffart et al., 2022). Therefore, inhibition (abstaining from autopilot responses) and working memory (WM; mentally attending to and altering information in real-time) problems might contribute to worse responses to MEMI than SM. Issues with set-shifting (fluently switching between distinct cognitive modes) and verbal fluency (producing apt thematic words under a time limit; Renna et al., 2018; Zainal & Newman, 2022a) could likewise prevent individuals from strengthening psychological flexibility and related processes that could enhance emotion regulation during treatment.
Accordingly, the goal of the present study was to use various ML algorithms to evaluate which DMHI app (i.e., MEMI vs. SM) would work best for which person with GAD and which theory-based predictors predicted a better outcome with the MEMI vs. SM. Our study offers novel contributions in several ways. First, inquiring into which GAD sufferers benefitted from a specific DMHI has been a neglected research question (McDevitt-Murphy et al., 2018). Relatedly, by determining prescriptive predictors that informed optimal intervention assignment across different DMHIs, we extended the precision psychiatry literature that primarily examined prognostic predictors within and between specific face-to-face psychotherapies (Aafjes-van Doorn et al., 2021). Prescriptive predictors refer to variables that contribute to heterogeneity of treatment effects (Wester et al., 2022). Second, our mostly self-report pre-treatment predictor set could be practical to implement in routine care settings. Our method is an alternative approach to most prior ML-based treatment predictor studies intended to build clinical decision-making support tools with biomarker or neuroimaging data (e. g., Tymofiyeva et al., 2019). Although the potential of biomarker and neuroimaging techniques is undeniable, their practical application in outpatient and hospital environments is constrained by limited availability and high expense.
Our study was a secondary analysis of a DMHI RCT for GAD that primarily evaluated efficacy using an intention-to-treat approach (Zainal & Newman, 2023). The primary RCT findings most relevant for the current study were that the MEMI significantly outperformed the SM in reducing pre-treatment to one-month follow-up (pre-1MFU) GAD severity, perseverative cognitions (Cohen’s |d| = 0.393–0.394), trait mindfulness (d = 0.303), and EF (|d| = 0.280) measured dimensionally. Herein, we evaluated the following pre-treatment variables as potential prescriptive predictors: EF, GAD severity, trait mindfulness, perseverative cognitions, and sociodemographic variables. Building on the primary RCT that determined differential treatment efficacy on continuous outcomes, we examined how these variables would function as prescriptive predictors of pre-1MFU clinically reliable improvement (a categorical outcome; Blampied, 2022) in GAD severity, trait perseverative cognition, mindfulness, and EF. First, we expected that the MEMI but not SM would significantly yield pre-1MFU clinically reliable improvement in GAD severity, global perseverative cognition, trait mindfulness, and EF. Second, we hypothesized that ML models would produce moderate-to-high-accuracy predictive models of prescriptive predictors (Hypothesis 2). Lastly, we tested the prediction that higher pre-treatment GAD severity, trait perseverative cognition, mindfulness, and EF would predict more reliable change in all of these pre-1MFU outcomes for the MEMI than SM (Hypothesis 3).
2. Method
2.1. Participants
The present RCT was preregistered (NCT04846777 on ClinicalTrials.gov). Table 1 summarizes sociodemographic and clinical descriptive statistics. Participants (N = 110) averaged 20.80 years of age (SD = 5.41, range = 18–52). Further, 86.67% were women, 13.63% were men, and one person declined to disclose their gender identity. Also, 64.55% identified as White, and the rest as African American (5.45%), American/Pacific Islander (1.82%), Asian or Asian American (13.63%), Hispanic (7.27%), Native American/Pacific Islander (1.82%), another race (5.45%), or declined to disclose (0.91%). Although 16% attained a college or postgraduate degree, the remaining 84% of participants achieved a high school education.
Table 1.
Comparison of sociodemographic and clinical variables across intervention arms
| MA (n = 68) | SM (n = 42) | p | |||
|---|---|---|---|---|---|
| Continuous variables | M | (SD) | M | (SD) | |
| Age (in years) | 20.53 | (3.91) | 21.24 | (7.24) | .51 |
| 14-item GAD-Q-IV score | 9.52 | (2.10) | 9.94 | (1.96) | .30 |
| Perseverative cognitions | |||||
| Total score | 17.07 | (4.36) | 18.02 | (4.27) | .26 |
| Loss of controllability | 2.20 | (0.89) | 2.41 | (1.05) | .26 |
| Preparing for the future | 3.32 | (1.04) | 3.52 | (1.03) | .32 |
| Expecting the worst | 2.78 | (1.32) | 2.88 | (1.07) | .66 |
| Searching for causes and meaning | 3.14 | (1.27) | 3.48 | (1.24) | .16 |
| Dwelling on the past | 3.80 | (0.87) | 3.84 | (0.84) | .78 |
| Thoughts discordant with ideal self | 1.84 | (1.00) | 1.88 | (0.98) | .84 |
| Trait mindfulness | |||||
| Total score | 14.25 | (2.30) | 14.11 | (2.11) | .75 |
| Observing | 3.22 | (0.67) | 3.28 | (0.65) | .63 |
| Describing | 2.70 | (0.80) | 2.56 | (0.82) | .39 |
| Awareness | 2.75 | (0.68) | 2.66 | (0.69) | .49 |
| Nonjudgment | 3.00 | (0.89) | 3.01 | (0.87) | .94 |
| Nonreactivity of inner experiences | 2.57 | (0.62) | 2.59 | (0.61) | .88 |
| Executive functioning | |||||
| Inhibitory dyscontrol (s) | 43.50 | (9.64) | 43.87 | (7.37) | .83 |
| Set-shifting deficits (s) | 49.59 | (12.61) | 51.02 | (11.42) | .55 |
| Verbal fluency | 112.87 | (18.59) | 115.14 | (22.99) | .57 |
| Treatment expectations | |||||
| Credibility | 6.00 | (1.39) | 5.72 | (1.58) | .34 |
| Expectancy | 43.46 | (17.33) | 44.29 | (18.13) | .31 |
| Categorical variables | n | (%) | n | (%) | |
| Gender orientation | .85 | ||||
| Women | 10 | (14.71) | 5 | (11.90) | |
| Men | 57 | (83.82) | 37 | (88.10) | |
| Declined to disclose | 1 | (1.47) | – | – | |
| Race | .99 | ||||
| White Caucasian | 44 | (64.71) | 27 | (64.29) | |
| Asian or Asian American | 11 | (16.18) | 4 | (9.52) | |
| Hispanic | 3 | (4.41) | 5 | (11.91) | |
| African American | 5 | (7.35) | 1 | (2.38) | |
| Another race | 4 | (5.88) | 2 | (4.76) | |
| Declined to disclose | 1 | (1.47) | 0 | (0.00) | |
Note. GAD-Q-IV = Generalized Anxiety Disorder Questionnaire–Fourth version; MA = mindfulness app; SM = self-monitoring app. Univariate linear regressions were used for continuous variables. Univariate logistic regressions were used for categorical variables.
Participants diagnosed with GAD based on the Anxiety and Related Disorders Interview Schedule for DSM-5 (ADIS-5; American Psychiatric Association, 2013; Brown & Barlow, 2014) were randomized to MEMI (n = 68) vs. SM (n = 42). Randomization was carried out with the Microsoft Excel randomization function programmed into Qualtrics with the insertion of the appropriate treatment video played toward the end of the baseline visit after completing all pre-treatment assessments. Intervention arms were assessor-blinded, i.e., the intervention assignment was concealed from assessors at all study visits. During the baseline visit, the experimenter left the physical room (pre-COVID-19) or asked participants to switch off/mute their Zoom audio/video before they clicked on the Qualtrics link to play the appropriate intervention arm video (during COVID-19). Table 1 shows no statistically significant differences in age, race, GAD severity, trait perseverative cognition, mindfulness, EF, and treatment expectation between intervention arms at baseline (p values =.26 to.95). These variables and treatment condition also did not significantly predict attrition (p = .10 to .99). This non-significant difference indicated the success of the randomization and class balance of baseline scores of potential prescriptive predictors between arms. Fig. 1 illustrates the CONSORT flowchart depicting participant enrollment and advancement.
Figure 1.

CONSORT diagram
Note. CONSORT = Consolidated Standards of Reporting Trials.
2.2. Measures
2.2.1. Intervention outcome
We evaluated reliable change from pre-1MFU (Jacobson & Truax, 1991; McGlinchey et al., 2002) in the following measures: (1) GAD severity using the Generalized Anxiety Disorder Questionnaire–Fourth version (GAD-Q-IV; Newman et al., 2002) total score; (2) trait perseverative cognition using the total scores of the Perseverative Cognitions Questionnaire (PCQ; Szkodny & Newman, 2019); (3) trait mindfulness using the total scores of the Five Factor Mindfulness Questionnaire (FFMQ; Baer, 2006); and (4) EF composite using average standardized scores of inhibitory dyscontrol (Stroop, 1935), set-shifting (Delis et al., 2001), verbal fluency (Borkowski et al., 1967), WM (Wechsler, 2008), and EF errors on inhibition, set-shifting, and verbal fluency tests detailed in the next paragraph. In addition, the domains/facets, example items, and psychometric properties of the outcome and potential predictor measures are detailed in Appendix A of the online supplemental materials (OSM). The current study used binary outcomes (i.e., reliable improvement) on these measures to replicate and extend the primary analyses that examined continuous outcomes (Zainal & Newman, 2023) and to generate area under the receiver operating characteristic curve (AUC) and related metrics when evaluating ML model performance. Further, reliable improvement is a more stringent criterion than dimensional symptom severity that is essential to examine as the outcome to understand predictors of clinically significant change.
2.2.2. Potential prescriptive predictors
All potential prescriptive predictors were variables measured at pre-treatment. The 14-item GAD-Q-IV (Newman et al., 2002) measured GAD severity. The 45-item PCQ (Szkodny & Newman, 2019) assessed six factors of repetitive negative thinking (dwelling on the past, expecting the worst, lack of controllability, thoughts discrepant with the ideal self, preparing for the future, and searching for causes and meanings). The 39-item FFMQ (Goldberg et al., 2016) measured five trait-level mindfulness domains (attention to inner and outer events, non-judgment, non-reactivity to inner experiences, describing/labeling of internal events, and acting with awareness). The color-word interference test from the Delis-Kaplan Executive Functioning System (Delis et al., 2001; Stroop, 1935) measured inhibition and set-shifting EF domains. The composite score of the digit span (DS) forward, DS backward, DS sequencing, and letter-number sequencing (Egeland, 2015) on the Wechsler Adult Intelligence Scale–Fourth Edition (Wechsler, 2008) indexed WM. The Controlled Oral Word Association Test (Borkowski et al., 1967; Delis et al., 2001) measured verbal fluency ability. We also included demographic variables (i.e., age, gender, and race/ethnicity) as possible important prescriptive predictors.
2.3. Procedures
Participants who met GAD criteria on the ADIS-5 (Brown & Barlow, 2014) semi-structured interview received the MEMI or SM at baseline. As detailed in the primary study (Zainal & Newman, 2023), participants received prompts five times daily (about 9 am, noon, 3 pm, 6 pm, and 9 pm) for the next 14 days to engage in mindfulness or self-monitoring strategies depending on their condition (detailed below and in Appendix A). MEMI participants regularly practiced these mindfulness skills during the intervention: open monitoring, attending to small moments, slowed, rhythmic, diaphragmatic breathing, and non-judgmental acceptance. SM participants did not receive exposure to or learn any mindfulness techniques but completed instructions to notice their thoughts and how distressing they were (detailed below). These prompts directed participants to engage in brief, one-minute mindfulness or self-monitoring exercises tailored to their assigned group. Participants had the flexibility to customize these prompts within a two-hour time frame (e.g., 8 am to 10 am, 11 am to 1 pm, etc.) to accommodate their individual schedules to minimize disruption and user burden. These adjustments were generally consistent among all participants. Similar to prior EMI research (e.g., Folkersma et al., 2021; Rauschenberg et al., 2021), no participants in our study spontaneously complained that the one-minute self-assessments and exercises disturbed their daily routine.
2.4. Mindfulness ecological momentary intervention (MEMI)
In the MEMI, participants were presented with a standardized video explaining the principles of evidence-based MBI protocols (Kabat-Zinn, 1990). MEMI participants were introduced to mindfulness to empower chronic worriers with the skills of open monitoring and attending to subtle moments. Participants also learned slow, rhythmic, diaphragmatic breathing. Subsequently, participants learned the practice of non-judgmental acceptance, mindful observing, and non-reactivity, paralleling components in mindfulness-based cognitive therapy (MBCT; Segal et al., 2002). Each MEMI participant received positive reinforcement on the significance and advantages of cultivating a regular mindfulness practice.
MEMI encouraged participants to engage in mindfulness exercises five times/day for 14 days. MEMI participants were given the following directives at each prompt: “Pay attention to your breathing. Breathe in a slow, steady, and rhythmic manner. Stay focused on the sensations of the air coming into your lungs and then letting it out. As you’re breathing, observe your experience as it is. Let go of judgments that do not serve you. Focus on the here and now. Attend to the small moments right now (e.g., reading a chapter, having a cool glass of water), as that is where enjoyment, peace, and serenity in life happen.” Participants self-assessed their levels of anxiety (“To what degree do you feel keyed up or on edge right now?”), depression (“To what degree do you feel depressed right now?”), and mindfulness (“To what extent are you experiencing the present moment fully?”) on a 9-point Likert scale, ranging from 1 (Not At All) to 9 (Extremely) before and following the reception of these prompts. Every MEMI prompt concluded with motivation to instill these abilities in the long run: “Remember that the cultivation of mindfulness is lifelong. The goal of therapy is to be your own therapist. Practice mindfulness between the prompts and after you have completed this study.”.
2.5. Self-monitoring app (SM)
In the SM, participants watched a video that defined self-monitoring as the heightened awareness of one’s feelings and thoughts. They received the message that simply monitoring feelings and thoughts and noting associated distress can support more constructive thinking patterns, and self-monitoring, by itself, might alleviate anxiety symptoms. The SM rationale drew inspiration from the approach employed in a recent, concise ecological momentary intervention (LaFreniere & Newman, 2016). Its development aimed to mirror the MEMI while removing hypothesized active mindfulness components, such as acceptance, attending to small moments, diaphragmatic breathing retraining, ongoing mindfulness exercises, and open monitoring. It hence did not refer to mindfulness. It did not guide participants to heighten their present-moment awareness; instead, it centered on monitoring their feelings and thoughts. Participants were not instructed to concentrate solely on their current activities, which could influence their emotional states. Whereas SM participants were directed to observe their feelings and thoughts, there was no guidance on accepting these thoughts and feelings as they emerged. It purposefully offered no advice on diaphragmatic breathing retraining. Its function did not involve inducing the calming sensations associated with diaphragmatic breathing. SM participants were not instructed to engage in self-monitoring between and after the prompts. The SM control differed from the notion that continual practice of mindfulness is meant for lifelong cultivation.
Rather than receiving the more extensive messages for continuous mindfulness practice, SM participants were given the following concise directive on five occasions daily: “Notice your thoughts and how distressing they may be.” Levels of state anxiety, depression, and mindfulness were assessed using identical 9-point Likert scale questions both before and after each SM prompt.
2.6. Data analyses
The Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD; Moons et al., 2015) guidelines were followed in the current study. All analyses were conducted with the nestedcv R package (Lewis et al., 2023). Multiple imputation with the predictive mean matching algorithm was used to impute missing values (< 10% of the data set; van Buuren, 2007). We evaluated three ML algorithms (logistic regression, support vector machine (SVM)– radial kernel, and random forest (RF)) with all stated predictor variables. The logistic regression algorithm using the logistic link function with maximum likelihood estimation was employed as a benchmark ML algorithm. SVM and RF have several benefits that make them appropriate for the current study. SVM tries to maximize generalizability (i.e., model performance accuracy), is suitable for data sets with a relatively high number of predictors, and is robust to anomalies (Dreiseitl et al., 2001). Likewise, the RF algorithm, an ensemble of decision trees with stopping rules (Cawley & Talbot, 2010), is efficient computationally at managing diverse potential predictors (Kern et al., 2019) and maximizes differentiation of significant predictors between classes (cf. more details in Appendix B and Loh, 2011). Moreover, the RF and SVM algorithms are appropriate for small sample sizes. Studies have shown that using class balancing (detailed below) enhances RF prediction with small sample sizes (Han et al., 2021) and a high-dimensional predictor set relative to the total number of observations (Qi, 2012). A simulation study also showed that small sample sizes (n = 50–100) with 10–30 potential predictors could attain high cross-validated accuracy estimates with SVM (e.g., ranging from 93–95%; Jiang et al., 2008). RF and SVM are collectively suitable and widely accepted for intervention outcome prediction purposes with comparatively small data sets (Kokol et al., 2022).
The present study focused on ML models that included prescriptive predictors of optimization to MEMI or SM. We identified prescriptive predictors of optimization based on good practice recommendations by simulation and empirical studies of logistic regression and other more complex ML models (Petkova et al., 2019), such as RF (Laber & Zhao, 2015) and SVM (Song et al., 2015; Zhao et al., 2012; refer to Appendices B to C). Continuous predictors were standardized to have a mean of 0 and standard deviation of 1. Categorical predictors (race, gender) were dummy-coded using one-hot encoding.
We tested all ML models using nested 10-fold cross-validation (10 F-CV), which minimizes biased estimates of the true error (Varma & Simon, 2006). Nested 10 F-CV divides the dataset into outer and inner folds (Cawley & Talbot, 2010), using the configuration of 10 outer and 10 inner folds. The inner fold cross-validation is employed to fine-tune optimal hyperparameters for the models. Subsequently, the model was trained on the entirety of the outer training fold and evaluated using the data that was withheld from the outer fold. This process was iterated for all outer folds, and the aggregated predictions from the unseen test data across the outer folds were compared with the actual outcomes for the outer test folds. We also employed nested leave-one-out cross-validation (LOOCV) to test each model. In this procedure, we trained the model using all participants except one, reserved as the testing dataset in the outer fold (Cristianini & Shawe-Taylor, 1999). LOOCV could offer a more impartial estimation, particularly in light of the limited sample size. Further, nested 10 F-CV (including CV and LOOCV) has benefits over split-sample validation, including decreasing probability of model overfitting (due to dependence on a single or arbitrary sample split) and bigger sample size (due to using all observations as both training and test cases; James et al., 2013). All ML analyses were adapted from published online tutorials (e.g., https://tinyurl.com/nestedcv; https://tinyurl.com/shapxai).
The AUC was the decision threshold to evaluate classification performance. Whereas AUC =− .5 indicates discrimination between groups at no better than chance, AUC ≥ .5 reflects adequate classification (i.e., minimizes the false positive rate and maximizes the false negative rate). Interpreting the AUC requires considering the question of interest. We used conventional heuristics to facilitate our interpretation: AUC = .50 means zero discrimination, .70–.79 indicates adequate discrimination, and AUC ≥ .80 denotes good discrimination. These guidelines also applied to two other decision thresholds: accuracy (proportion of correctly classified cases) and balanced accuracy (BAC; average of the sensitivity [model’s ability to identify a case] and specificity [model’s ability to identify a non-case]; Swets, 1988). Their values ranged from 0 to 1, wherein higher values reflected better performance.
Class imbalances in the outcome variable are essential to take into account when predicting binary outcomes. A high level of class imbalance may bias ML outcomes by weighting them toward the majority class (i.e., higher frequency outcome). As there was a class imbalance in the intervention outcomes, we applied the synthetic minority over-sampling technique (SMOTE; Chawla et al., 2002). The SMOTE approach undersamples the high-frequency class and oversamples the low-frequency class to generate a more balanced structure for the clinical outcomes, improving the prediction accuracy (Zulfiker et al., 2021).
Since the principal aim of ML is to optimize the performance of predictive models, identifying the subset of predictors that yields good classification performance may be beneficial. We originally had a total of 21 possible predictors that included the subfacets of mindfulness and perseverative cognition. However, given our sample size (N = 110) and established heuristics to have a minimum of 10 observations per candidate predictor (Jenkins & Quintana-Ascencio, 2020), we instead used the total scores on the mindfulness and perseverative cognition measures, which resulted in an initial 11 (age, sex, race, EF errors, inhibitory dyscontrol, set-shifting deficits, verbal fluency, WM, GAD severity, trait mindfulness, trait perseverative cognitions) theory-driven predictors, and a final model with the top five most important predictors. Model performance metrics were calculated for both models using a recursive feature elimination method within a nested CV framework (Kuhn & Johnson, 2018). Feature importance analyses were carried out to identify the essential prescriptive predictors ranked in descending order of their predictive importance (Lewis et al., 2023). We determined that all final predictors were stable, appearing in all 10 outer folds (Appendix C Figs. S1–4).
Last, we examined predictor importance based on the model-agnostic kernel Shapley Additive Explanations (SHAP) values to facilitate interpretability to visualize the associations between predictors and differential intervention outcomes (Lundberg et al., 2020). The SHAP approach estimated the effect of participant-level predicted probability of reliable improvement of modifying a single predictor from its observed value to the sample average across all logical permutations of other potential predictors. The average SHAP value across all participants is 0, but the average absolute SHAP value informs relative predictor importance. We generated SHAP-based bee swarm plots, a vital step because RF and SVM are considered “black box” models that do not provide a straightforward interpretation of the predictor coefficients. SHAP plots communicate the possibly complex relationship between each value of a predictor and the probability of optimization to MEMI (vs. SM) to predict each clinical outcome. We created SHAP-based bee swarm plots for the top five prescriptive predictors of each distinct outcome in each final multivariable model.
3. Results
3.1. Differential treatment effects on reliable improvement in pre-1MFU outcomes
The MEMI yielded substantially higher reliable improvement in GAD severity (n = 49, 72.06% vs. n = 24, 58.54%; χ2(df = 1) = 9.30, p = .002) and trait perseverative cognitions (n = 39, 57.4% vs. n = 11, 26.2%; χ2(df = 1) = 10.17, p = .001) from pre-treatment to 1MFU than the SM. However, the MEMI (n = 24, 35.5%) vs. SM (n = 10, 23.8%) yielded no significant differential treatment efficacy on reliable improvement in trait mindfulness (χ2(df = 1) = 1.60, p = .205). Likewise, MEMI (n = 25, 59.5%) vs. SM (n = 29, 42.6%) had no significant differential treatment efficacy on reliable improvement in EF (χ2(df = 1) = 2.96, p = .085). Therefore, we found inconsistent support for Hypothesis 1.
3.2. Prescriptive predictors of pre-1MFU reliable improvement in GAD severity
SVM LOOCV was the best-performing initial model (AUC = .817, accuracy = .800, BAC = .795; Table 2 Part A). The final model also performed well (AUC = .834, accuracy = .795, BAC = .775). Predictors of optimization to the MEMI were higher GAD severity (#1), higher trait perseverative cognition (#2), lower set-shifting deficits (#3), older age (#4), and stronger trait mindfulness (#5; Fig. 2).
Table 2.
Model performance of each predictive machine learning (ML) algorithm to predict optimization to the mindfulness app with the initial 11 predictors
| ML Model | AUC | Accuracy | Balanced accuracy | Sensitivity | Specificity |
|---|---|---|---|---|---|
| A. Outcome: Generalized anxiety disorder (GAD) remission | |||||
| Logistic regression - CV | 0.806 | 0.786 | 0.764 | 0.651 | 0.876 |
| Logistic regression - LOOCV | 0.816 | 0.772 | 0.746 | 0.616 | 0.876 |
| Random forest - CV | 0.797 | 0.786 | 0.781 | 0.756 | 0.806 |
| Random forest - LOOCV | 0.817 | 0.819 | 0.814 | 0.791 | 0.837 |
| Support vector machine - CV | 0.817 | 0.772 | 0.762 | 0.709 | 0.814 |
| Support vector machine - LOOCV | 0.817 | 0.800 | 0.795 | 0.767 | 0.822 |
| B. Outcome: Reliable improvement in trait perseverative cognitions | |||||
| Logistic regression - CV | 0.799 | 0.719 | 0.703 | 0.586 | 0.819 |
| Logistic regression - LOOCV | 0.808 | 0.719 | 0.707 | 0.621 | 0.793 |
| Random forest - CV | 0.816 | 0.744 | 0.741 | 0.724 | 0.759 |
| Random forest - LOOCV | 0.807 | 0.729 | 0.726 | 0.701 | 0.750 |
| Support vector machine - CV | 0.808 | 0.724 | 0.721 | 0.701 | 0.741 |
| Support vector machine - LOOCV | 0.803 | 0.729 | 0.727 | 0.713 | 0.741 |
| C. Outcome: Reliable improvement in trait mindfulness | |||||
| Logistic regression - CV | 0.736 | 0.644 | 0.633 | 0.750 | 0.517 |
| Logistic regression - LOOCV | 0.735 | 0.652 | 0.639 | 0.778 | 0.500 |
| Random forest - CV | 0.740 | 0.644 | 0.639 | 0.694 | 0.583 |
| Random forest - LOOCV | 0.738 | 0.629 | 0.624 | 0.681 | 0.567 |
| Support vector machine - CV | 0.752 | 0.674 | 0.671 | 0.708 | 0.633 |
| Support vector machine - LOOCV | 0.737 | 0.644 | 0.642 | 0.667 | 0.617 |
| D. Outcome: Reliable improvement in executive functioning composite | |||||
| Logistic regression - CV | 0.878 | 0.797 | 0.790 | 0.859 | 0.721 |
| Logistic regression - LOOCV | 0.877 | 0.797 | 0.791 | 0.847 | 0.735 |
| Random forest - CV | 0.880 | 0.830 | 0.826 | 0.859 | 0.794 |
| Random forest - LOOCV | 0.879 | 0.843 | 0.843 | 0.847 | 0.838 |
| Support vector machine - CV | 0.886 | 0.843 | 0.843 | 0.847 | 0.838 |
| Support vector machine - LOOCV | 0.873 | 0.837 | 0.838 | 0.824 | 0.853 |
AUC = area under the receiver operating characteristic curve; PPV = positive predictive value; NPV = negative predictive value; CV = nested cross-validation; LOOCV = leave-one-out nested cross-validation.
Figure 2.

Top predictors of being optimized to mindfulness ecological momentary intervention (MEMI) for increased probability of pre-1MFU generalized anxiety disorder (GAD) remission in the final multivariable model
Note. Pre-1MFU = pretreatment to one-month follow-up; SHAP = SHapley Additive exPlanations. On the y-axis, the number denotes the relative importance of the prescriptive predictor in relation to the clinical outcome of interest. For each predictor, if each SHAP-based bee swarm plot turns darker in color from left to right on the x-axis, higher values of that predictor indicate a stronger positive correlation with the outcome and vice versa. On the x-axis, the dot’s position denotes the effect the predictor has on the model’s prediction for a specific participant. When multiple dots emerge at the same x position, they accumulate to display density.
3.3. Prescriptive predictors of pre-1MFU reliable improvement in perseverative cognitions
RF CV was the best-performing initial model (AUC = .816, accuracy = .744, BAC = .741; Table 2 Part B). The final model also performed well (AUC = .821, accuracy = .749, BAC = .739). The predictors of optimization to the MEMI were higher trait perseverative cognitions (#1), being White (#2), older age (#3), more inhibitory dyscontrol (#4), and lower verbal fluency (#5; Fig. 3).
Figure 3.

Top predictors of being optimized to mindfulness ecological momentary intervention (MEMI) for increased probability of pre-1MFU reliable improvement in global perseverative cognitions in the final multivariable model
Note. Pre-1MFU = pretreatment to one-month follow-up; SHAP = SHapley Additive exPlanations. On the y-axis, the number denotes the relative importance of the prescriptive predictor in relation to the clinical outcome of interest. For each predictor, if each SHAP-based bee swarm plot turns darker in color from left to right on the x-axis, higher values of that predictor indicate a stronger positive correlation with the outcome and vice versa. On the x-axis, the dot’s position denotes the effect the predictor has on the model’s prediction for a specific participant. When multiple dots emerge at the same x position, they accumulate to display density.
3.4. Prescriptive predictors of pre-1MFU reliable improvement in trait mindfulness
SVM CV was the best-performing initial model (AUC = .752, accuracy = .674, BAC = .671; Table 2 Part C). The final model had moderate-to-good performance (AUC = .770, accuracy = .667, BAC = .657). The predictors of optimization to the MEMI were higher trait mindfulness (#1), being White (#2), lower set-shifting deficits (#3), being women (#4), and lower verbal fluency (#5; Fig. 4).
Figure 4.

Top predictors of being optimized to mindfulness ecological momentary intervention (MEMI) for increased probability of pre-1MFU reliable improvement in global trait mindfulness in the final multivariable model
Note. Pre-1MFU = pretreatment to one-month follow-up; SHAP = SHapley Additive exPlanations. On the y-axis, the number denotes the relative importance of the prescriptive predictor in relation to the clinical outcome of interest. For each predictor, if each SHAP-based bee swarm plot turns darker in color from left to right on the x-axis, higher values of that predictor indicate a stronger positive correlation with the outcome and vice versa. On the x-axis, the dot’s position denotes the effect the predictor has on the model’s prediction for a specific participant. When multiple dots emerge at the same x position, they accumulate to display density.
3.5. Prescriptive predictors of pre-1MFU reliable improvement in EF composite
SVM CV was the best-performing initial model (AUC = .886, accuracy = .843, BAC = .843; Table 2 Part D). The final model also performed well (AUC = .886, accuracy = .804, BAC = .797). The predictors of optimization to the MEMI were higher trait perseverative cognitions (#1), stronger trait mindfulness (#2), more inhibitory dyscontrol (#3), lower GAD severity (#4), and being women (#5; Fig. 5). Whereas Hypothesis 2 received full support in examining ML models across all four outcomes, Hypothesis 3 had partial support.
Figure 5.

Top predictors of being optimized to mindfulness ecological momentary intervention (MEMI) for increased probability of pre-1MFU reliable improvement in executive functioning composite in the final multivariable model
Note. Pre-1MFU = pretreatment to one-month follow-up; SHAP = SHapley Additive exPlanations. On the y-axis, the number denotes the relative importance of the prescriptive predictor in relation to the clinical outcome of interest. For each predictor, if each SHAP-based bee swarm plot turns darker in color from left to right on the x-axis, higher values of that predictor indicate a stronger positive correlation with the outcome and vice versa. On the x-axis, the dot’s position denotes the effect the predictor has on the model’s prediction for a specific participant. When multiple dots emerge at the same x position, they accumulate to display density.
4. Discussion
Overall, MEMI outperformed SM to generate pre-1MFU reliable improvement in GAD severity and perseverative cognitions, but not trait mindfulness and global EF for participants with GAD. The more remarkable pre-1MFU reliable improvement rate in GAD severity and perseverative cognitions in the MEMI vs. SM was comparable to other DMHIs for GAD, such as a CBT app provided across six weeks (Miller et al., 2021). Plausibly, the non-significant differential treatment efficacy on reliable improvement in trait mindfulness and global EF might be due to the brief nature of the MEMI. Higher-intensity MBIs (Zainal & Newman, In Press) and alternative interventions that explicitly targeted EF, such as EF training (Kenworthy et al., 2014, pp. 12161) and metacognitive therapy (Doig et al., 2021), could be necessary to produce reliable improvement in trait mindfulness and global EF.
Consistent with Hypothesis 2, our strong AUC values (i.e., >0.70) for the final multivariate models were noteworthy. They aligned with other ML studies predicting therapy outcomes for diverse mental illnesses, such as body dysmorphic disorder (Curtiss et al., 2022), depression (Pearson et al., 2019), and social anxiety disorder (Hoogendoorn et al., 2017). We extended the limited but increasing studies that evaluated for whom MEMIs and other DMHI apps were appropriate (e.g., for highly ruminative adolescents; Webb et al., 2022).
Noteworthy was how GAD severity consistently emerged as a top predictor, but in different directions: Although higher GAD severity predicted better chances of GAD improvement, lower GAD severity predicted more EF improvement. However, it is unclear why these contradictory findings occurred, and coefficient direction differences could be due to variations in the targeted outcomes. The former finding concurred with evidence that higher initial GAD severity predicted greater improvement magnitudes following CBT among adults with GAD (Newman & Fisher, 2010; Wetherell et al., 2005). Perhaps adults with higher GAD severity exhibited increased treatment motivation, which promoted more symptom alleviation. At the same time, it is possible that the latter outcome occurred because less symptomatic patients with GAD were naturally more adept and versatile in their thought and action repertoires (Zainal & Newman, 2022a), and exploiting such assets might have encouraged more EF enhancements. These findings offer hope to clinicians treating severely anxious GAD patients by suggesting the value of a self-guided MEMI to augment treatment effects based on precision psychiatry methods. Harnessing optimal precision psychiatry methods is critical because about 10, 000–20,000 DMHIs exist (Bautista & Schueller, 2022), and those with GAD gravitate toward apps emphasizing self-monitoring and mindfulness skills (Nardi et al., 2022). Further, findings partially supported Hypothesis 3 (i.e., those with higher perseverative cognitions, trait mindfulness, and EF may have gained more from the MEMI vs. SM).
Why did higher trait perseverative cognitions, inhibitory dyscontrol, and verbal dysfluency predict optimization to the MEMI for most, if not all, outcomes (i.e., alleviating GAD severity, trait perseverative cognitions, improving mindfulness, and EF)? Our findings aligned with meta-analyses indicating that greater baseline severity correlated with an enhanced likelihood of treatment response in various psychiatric disorders (Andersson et al., 2019). Compromised inhibition and verbal fluency (Zainal & Newman, 2021a; Zainal & Newman, 2021b, 2022b) and excessive rumination (Kim & Newman, 2023) have also been shown to predict, correlate, and be a consequence of GAD symptoms. These pre-treatment attributes may thus have provided individuals with GAD an increased opportunity to enhance their mindfulness skills over time, potentially improving their chances of better intervention outcomes via the MEMI over SM (cf. compensation model; Cohen & DeRubeis, 2018). Individuals with weaker inhibitory control and verbal fluency skills may have benefitted further from MEMI’s structured approach, which included well-organized instructions, among other elements (Hybel et al., 2017). The MEMI repeatedly instructed attentional focus and acceptance of current life circumstances, which could have promoted EF by engendering better emotion regulation skills over time (Im et al., 2021; Schumer et al., 2018). Further research is essential for drawing definitive conclusions.
Of note, other aspects of our findings buttressed the capitalization model (Cheavens et al., 2012; Probst et al., 2022). Specifically, stronger set-shifting and trait mindfulness predicted higher odds of reliable improvement in GAD and trait mindfulness. Leveraging those preexisting behavioral strengths in DMHIs, including brief MEMIs for GAD, might have enhanced their efficacy. The MEMI may have further nurtured existing skills (e.g., capacity to switch thinking modes flexibly, detached observing, nonreactivity to inner experiences) that aligned well with patients’ aptitude.
Some demographic variables emerged as significant prescriptive predictors. Older age predicted being optimized by the MEMI vs. SM for reliable improvements in GAD and perseverative cognitions. This outcome may have been partly due to evidence that younger (vs. older) adults with GAD benefitted more from higher-intensity and more extensive face-to-face psychotherapies or pharmacotherapies (refer to a recent meta-analysis by Carl et al., 2020a). Women were likelier to experience reliable improvement in trait mindfulness and EF than men with MEMI. These findings could have been due to our consistent result that patients with higher tendencies toward perseverative cognition reaped more benefits regarding improvement in various outcomes and prior findings that women displayed increased propensities toward repetitive thinking (Nolen-Hoeksema et al., 2008). Our trial enrolled adults experiencing GAD, which is characterized mainly by persistent negative and future-oriented thoughts. The MEMI, emphasizing present-moment attention and metacognitive stance, could help adults grappling with GAD to shift to adopt alternative adaptive and flexible thinking habits (Zainal & Newman, In Press). Increased versatility via the MEMI might have enhanced organizational, planning, and other EF skills more for women. The MEMI was also more likely to improve trait perseverative cognitions and mindfulness for White individuals than non-White individuals. Cultural differences, such as more innate mindfulness philosophies and practices embedded in other cultures, might account for this difference. Such findings concurred with meta-analytic data that although MBIs were statistically more efficacious than active controls, those comparative effect sizes were modest for ethnic minorities when contrasted with cultural majority populations (Sun et al., 2022). Cultural adaptations of MEMIs may serve to better cater to the requirements of people of color. However, those adaptations based on lived experiences (e.g., structural racism) require rigorous evaluation given complex intersectionality factors (e.g., economic circumstances) and inconsistencies in the literature about the value of culturally adapting psychotherapies (Cougle & Grubaugh, 2022).
Our study had several limitations that merit attention. First, our prescriptive predictor models need larger follow-up replication studies to assess their applicability to different samples with various anxiety disorders (Luedtke et al., 2019). A larger sample size has the potential for more precise scalable and efficacious smartphone DMHIs targeting GAD, leveraging ML techniques, and tailoring treatments to individual characteristics. However, ML-based multivariable predictor models using RF and SVM algorithms with nested CV permit small sample sizes such as ours for building treatment prediction models, as they can address overfitting and class imbalance concerns (Chen & Jeong, 2007; Guo et al., 2010; Han et al., 2021). Future DMHI studies should explore applying ML to prospective observational designs, aggregated RCT data, and pragmatic trial designs (Kessler, 2018). Second, subsequent studies should examine the replicability of our ML models without SMOTE. Third, future studies should use a broader predictor set to determine unique DMHI outcome predictors. Fourth, readers should be cautious when drawing causal inferences when interpreting predictor-outcome relations (Leeuwenberg et al., 2022). Fifth, future studies should recruit more culturally diverse samples regarding race/ethnicity and sex. Sixth, we originally hoped to have 1:1 equipoise randomization, but unfortunately, we had uneven randomization. Lastly, ML models should be used to guide treatment planning instead of providing strict recommendations.
However, study strengths include novelty, RF and SVM suitability for small, imbalanced datasets, identification of differential predictors for two DMHIs (Boehmke & Greenwell, 2019), and good model performance. Also, ML could reduce biases due to multicollinearity (Hawes et al., 2022). Moreover, prescriptive predictors of differential DMHI outcomes were identified to determine persons with GAD most likely to benefit from MEMI over SM. This approach offered more clinically actionable results than identifying prognostic predictors (variables predicting outcomes regardless of treatment type). Together, these prescriptive predictors may serve as a means to impartially identify which patients with GAD would be most likely to benefit from MEMI vs. SM before they use the MEMI. Providing the free MEMI to optimized clients on a waitlist for more expensive and intensive psychotherapies might also improve their clinical care. It might also augment intensive face-to-face psychotherapies (Newman et al., 2014).
Some ethical aspects warrant consideration. One of our study strengths was that we used the TRIPOD guidelines to facilitate transparency in our decision rules in how we built our prescriptive predictor models. Through open science initiatives, it is also crucial to conscientiously address data privacy concerns (Walsh et al., 2018). The current study obtained consent for the use of patients’ deidentified data for clinical research and secondary analyses. We also ensured that training and testing data differed to prevent data leakage. However, our 35% diversity in the sample likely created generalization issues to more diverse groups (Tay et al., 2022). Future studies should recruit more diverse samples.
If future studies replicate our pattern of results and with these ethical considerations in mind, some clinical implications merit attention. The MEMI could benefit clients diagnosed with principal or comorbid GAD seeking treatment in busy outpatient settings such as college counseling centers and academic medical centers with long waitlists and heavy client loads (Xiao et al., 2017). Clinicians could collect client data from self-report and brief neurocognitive tests, then input them into the algorithm using the top five predictive factors model to predict MEMI effectiveness as part of a prescriptive calculator (Webb et al., 2022; Webb et al., 2021). When gathering pre-treatment data, clinicians should omit assessing variables deemed unimportant as prescriptive predictors in the current algorithm (EF errors and WM). Whereas some GAD clients may benefit notably from the MEMI, others may not experience the same optimization level. Still, the MEMI could offer more tools in the repertoire of emotion regulation skills for others predicted not to receive similar optimization. Offering a low-intensity MEMI to clients likely to benefit from it could aid in their transition to more intensive therapist-led cognitive-behavioral and related therapies, which involve homework to enhance their skills and mindset. Earlier actuarial research indicated that more complex cases should begin with more intensive psychotherapies (Delgadillo et al., 2017). In summary, providing higher-intensity psychotherapies to clients unlikely to benefit from the MEMI within a stratified model can enhance patient-centered care by offering this less common and costlier intervention to those who would gain the most (Herzog & Kaiser, 2022).
Supplementary Material
Acknowledgments
An anonymous donation supported the current study.
Funding
The current study received funding from the National Institute of Mental Health (NIMH) (R01 MH115128), the Pennsylvania State University RGSO Dissertation award, Penn State Susan Welch/Nagle Family Graduate Fellowship, the National University of Singapore (NUS) Development Grant, and the Association for Behavioral and Cognitive Therapies (ABCT) Leonard Krasner Student Dissertation Award.
Footnotes
CRediT authorship contribution statement
Newman Michelle G.: Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Supervision, Validation, Writing – original draft, Writing – review & editing. Zainal Nur Hani: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Visualization, Writing – original draft, Writing – review & editing.
Declaration of Competing Interest
My research team and I have no conflicts of interest to declare.
Statement of Ethics
This study was conducted in compliance with the American Psychological Association (APA) ethical standards in the treatment of human participants and approved by the institutional review board (IRB). Further, this research was conducted was conducted ethically in accordance with the World Medical Association Declaration of Helsinki. Informed consent was obtained from participants as per IRB requirements at the Massachusetts General Hospital and Harvard Medical School.
Appendix A. Supporting information
Supplementary data associated with this article can be found in the online version at doi:10.1016/j.janxdis.2024.102825.
Data availability
The authors do not have permission to share data.
References
- Aafjes-van Doorn K, Kamsteeg C, Bate J, & Aafjes M (2021). A scoping review of machine learning in psychotherapy research. Psychotherapy Research, 31(1), 92–116. 10.1080/10503307.2020.1808729 [DOI] [PubMed] [Google Scholar]
- American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (DSM-5). American Psychiatric Association,. 10.1176/appi.books.9780890425596 [DOI] [Google Scholar]
- Andersson G, Carlbring P, & Rozental A (2019). Response and remission rates in internet-based cognitive behavior therapy: an individual patient data meta-analysis. Frontiers in Psychiatry, 10, Article 749. 10.3389/fpsyt.2019.00749 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Apolinário-Hagen J, Drüge M, & Fritsche L (2020). Cognitive behavioral therapy, mindfulness-based cognitive therapy and acceptance commitment therapy for anxiety disorders: Integrating traditional with digital treatment approaches. In Kim Y-K (Ed.), Anxiety disorders: Rethinking and understanding recent discoveries (pp. 291–329). Springer; Singapore. 10.1007/978-981-32-9705-0_17. [DOI] [PubMed] [Google Scholar]
- Archer KJ, & Kimes RV (2008). Empirical characterization of random forest variable importance measures. Computational Statistics & Data Analysis, 52(4), 2249–2260. 10.1016/j.csda.2007.08.015 [DOI] [Google Scholar]
- Baer RA (2006). Using self-report assessment methods to explore facets of mindfulness. Assessment, 13(1), 27–45. 10.1177/1073191105283504 [DOI] [PubMed] [Google Scholar]
- Bautista J, & Schueller SM (2022). Digital mental health deserves investment but the questions are which interventions and where? AJOB Neuroscience, 13(3), 191–193. 10.1080/21507740.2022.2082596 [DOI] [PubMed] [Google Scholar]
- Blampied NM (2022). Reliable change and the reliable change index: Still useful after all these years? The Cognitive Behaviour Therapist, 15, Article e50. 10.1017/s1754470x22000484 [DOI] [Google Scholar]
- Boehmke B, & Greenwell BM (2019). Hands-on machine learning with R. CRC Press,. [Google Scholar]
- Borkowski JG, Benton AL, & Spreen O (1967). Word fluency and brain damage. Neuropsychologia, 5(2), 135–140. 10.1016/0028-3932(67)90015-2 [DOI] [Google Scholar]
- Brown TA, & Barlow DH (2014). Anxiety and related disorders interview schedule for DSM-5 (ADIS-5L): Client interview schedule. Oxford University Press,. [Google Scholar]
- Canby NK, Eichel K, Peters SI, Rahrig H, & Britton WB (2021). Predictors of out-of-class mindfulness practice adherence during and after a mindfulness-based intervention. Psychosomatic Medicine, 83(6), 655–664. 10.1097/PSY.0000000000000873 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carl E, Witcraft SM, Kauffman BY, Gillespie EM, Becker ES, Cuijpers P, & Powers MB (2020a). Psychological and pharmacological treatments for generalized anxiety disorder (GAD): A meta-analysis of randomized controlled trials. Cognitive Behavioral Therapy, 49(1), 1–21. 10.1080/16506073.2018.1560358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carl JR, Miller CB, Henry AL, Davis ML, Stott R, Smits JAJ, & Espie CA (2020b). Efficacy of digital cognitive behavioral therapy for moderate-to-severe symptoms of generalized anxiety disorder: A randomized controlled trial. Depress Anxiety, 37(12), 1168–1178. 10.1002/da.23079 [DOI] [PubMed] [Google Scholar]
- Cawley GC, & Talbot NLC (2010). On over-fitting in model selection and subsequent selection bias in performance evaluation. Journal of Machine Learning Research, 11(70), 2079–2107. 〈http://jmlr.csail.mit.edu/papers/v11/cawley10a.html〉. [Google Scholar]
- Chawla NV, Bowyer KW, Hall LO, & Kegelmeyer WP (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321–357. 10.1613/jair.953 [DOI] [Google Scholar]
- Cheavens JS, Strunk DR, Lazarus SA, & Goldstein LA (2012). The compensation and capitalization models: A test of two approaches to individualizing the treatment of depression. Behaviour Research and Therapy, 50(11), 699–706. 10.1016/j.brat.2012.08.002 [DOI] [PubMed] [Google Scholar]
- Chen X.-w, & Jeong JC, (2007). Minimum reference set based feature selection for small sample classifications. In: Proceedings of the 24th international conference on Machine learning, Corvalis, Oregon, USA. 10.1145/1273496.1273516. [DOI] [Google Scholar]
- Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, & Van Calster B (2019). A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. Journal of Clinical Epidemiology, 110, 12–22. 10.1016/j.jclinepi.2019.02.004 [DOI] [PubMed] [Google Scholar]
- Cohen ZD, & DeRubeis RJ (2018). Treatment selection in depression. Annual Review of Clinical Psychology, 14(1), 209–236. 10.1146/annurev-clinpsy-050817-084746 [DOI] [PubMed] [Google Scholar]
- Cougle JR, & Grubaugh AL (2022). Do psychosocial treatment outcomes vary by race or ethnicity? A review of meta-analyses. Clinical Psychology Review, 96, Article 102192. 10.1016/j.cpr.2022.102192 [DOI] [PubMed] [Google Scholar]
- Cristianini N, & Shawe-Taylor J (1999). An introduction to support vector machines: And other kernel-based learning methods. Cambridge University Press,. [Google Scholar]
- Curtiss JE, Bernstein EE, Wilhelm S, & Phillips KA (2022). Predictors of pharmacotherapy outcomes for body dysmorphic disorder: A machine learning approach. Psychological Medicine, 1–11. 10.1017/S0033291721005390 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dao KP, De Cocker K, Tong HL, Kocaballi AB, Chow C, & Laranjo L (2021). Smartphone-delivered ecological momentary interventions based on ecological momentary assessments to promote health behaviors: Systematic review and adapted checklist for reporting ecological momentary assessment and intervention studies. JMIR Mhealth Uhealth, 9(11), Article e22890. 10.2196/22890 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Delgadillo J, Huey D, Bennett H, & McMillan D (2017). Case complexity as a guide for psychological treatment selection. Journal of Consulting and Clinical Psychology, 85(9), 835–853. 10.1037/ccp0000231 [DOI] [PubMed] [Google Scholar]
- Delis DC, Kaplan E, & Kramer JH (2001). The Delis–Kaplan executive function system: Examiner’s manual. The Psychological Corporation,. [Google Scholar]
- Doig EJ, Fleming J, & Ownsworth T (2021). Evaluation of an occupation-based metacognitive intervention targeting awareness, executive function and goal-related outcomes after traumatic brain injury using single-case experimental design methodology. Neuropsychological Rehabilitation, 31(10), 1527–1556. 10.1080/09602011.2020.1786410 [DOI] [PubMed] [Google Scholar]
- Dreiseitl S, Ohno-Machado L, Kittler H, Vinterbo S, Billhardt H, & Binder M (2001). A comparison of machine learning methods for the diagnosis of pigmented skin lesions. Journal of Biomedical Informatics, 34(1), 28–36. 10.1006/jbin.2001.1004 [DOI] [PubMed] [Google Scholar]
- Egeland J (2015). Measuring working memory with digit span and the letter-number sequencing subtests from the wais-iv: Too low manipulation load and risk for underestimating modality effects. Applied Neuropsychology: Adult, 22(6), 445–451. 10.1080/23279095.2014.992069 [DOI] [PubMed] [Google Scholar]
- Elhai JD, & Montag C (2020). The compatibility of theoretical frameworks with machine learning analyses in psychological research. Current Opinion in Psychology, 36, 83–88. 10.1016/j.copsyc.2020.05.002 [DOI] [PubMed] [Google Scholar]
- Faaland P, Vedaa Ø, Langsrud K, Sivertsen B, Lydersen S, Vestergaard CL, & Kallestad H (2022). Digital cognitive behaviour therapy for insomnia (dCBT-I): Chronotype moderation on intervention outcomes. Journal of Sleeping Research, 31 (5), Article e13572. 10.1111/jsr.13572 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Folkersma W, Veerman V, Ornee DA, Oldehinkel AJ, Alma MA, & Bastiaansen JA (2021). Patients’ experience of an ecological momentary intervention involving self-monitoring and personalized feedback for depression. Internet Interventions, 26, Article 100436. 10.1016/j.invent.2021.100436 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gallagher MW, Naragon-Gainey K, & Brown TA (2014). Perceived control is a transdiagnostic predictor of cognitive–behavior therapy outcome for anxiety disorders. Cognitive Therapy and Research, 38(1), 10–22. 10.1007/s10608-013-9587-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goetter EM, Frumkin MR, Palitz SA, Swee MB, Baker AW, Bui E, & Simon NM (2020). Barriers to mental health treatment among individuals with social anxiety disorder and generalized anxiety disorder. Psychological Services, 17 (1), 5–12. 10.1037/ser0000254 [DOI] [PubMed] [Google Scholar]
- Goldberg SB, Torous J, & Sun S (2022). Let’s decide what would be convincing, conduct randomized trials with rigorous comparison conditions, and report tests of moderation and publication bias in meta-analyses. PLOS Digital Health, 1(11), Article e0000127. 10.1371/journal.pdig.0000127 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goldberg SB, Wielgosz J, Dahl C, Schuyler B, MacCoon DS, Rosenkranz M, & Davidson RJ (2016). Does the five facet mindfulness questionnaire measure what we think it does? Construct validity evidence from an active controlled randomized clinical trial. Psychological Assessment, 28(8), 1009–1014. 10.1037/pas0000233 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gruszka P, Burger C, & Jensen MP (2019). Optimizing expectations via mobile apps: A new approach for examining and enhancing placebo effects. Frontiers in Psychiatry, 10, Article 365. 10.3389/fpsyt.2019.00365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guo Y, Graber A, McBurney RN, & Balasubramanian R (2010). Sample size and statistical power considerations in high-dimensionality data settings: a comparative study of classification algorithms. BMC Bioinformatics, 11(1), Article 447. 10.1186/1471-2105-11-447 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hallion LS, Wright AGC, Joormann J, Kusmierski SN, Coutanche MN, & Caulfield MK (2022). A five-factor model of perseverative thought. Journal of Psychopathology and Clinical Science, 131(3), 235–252. 10.1037/abn0000737 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han S, Williamson BD, & Fong Y (2021). Improving random forest predictions in small datasets from two-phase sampling designs. BMC Medical Informatics and Decision Making, 21(1), Article 322. 10.1186/s12911-021-01688-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hawes MT, Schwartz HA, Son Y, & Klein DN (2022). Predicting adolescent depression and anxiety from multi-wave longitudinal data using machine learning. Psychological Medicine, 1–7. 10.1017/S0033291722003452 [DOI] [PubMed] [Google Scholar]
- Herzog P, & Kaiser T (2022). Is it worth it to personalize the treatment of PTSD? - A variance-ratio meta-analysis and estimation of treatment effect heterogeneity in RCTs of PTSD. Journal of Anxiety Disorders, 91, Article 102611. 10.1016/j.janxdis.2022.102611 [DOI] [PubMed] [Google Scholar]
- Hirsch CR, & Mathews A (2012). A cognitive model of pathological worry. Behaviour Research and Therapy, 50(10), 636–646. 10.1016/j.brat.2012.06.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffart A, Burger J, Johnson SU, & Ebrahimi OV (2022). Daily dynamics and mechanisms of anxious symptomatology in the general population: A network study during the COVID-19 pandemic. Journal of Anxiety Disorders, 93, Article 102658. 10.1016/j.janxdis.2022.102658 [DOI] [PubMed] [Google Scholar]
- Hofmann SG, & Gomez AF (2017). Mindfulness-based interventions for anxiety and depression. Psychiatric Clinics of North America, 40(4), 739–749. 10.1016/j.psc.2017.08.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoogendoorn M, Berger T, Schulz A, Stolz T, & Szolovits P (2017). Predicting social anxiety treatment outcome based on therapeutic email conversations. IEEE Journal of Biomedical and Health Informatics, 21(5), 1449–1459. 10.1109/JBHI.2016.2601123 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hornstein S, Forman-Hoffman V, Nazander A, Ranta K, & Hilbert K (2021). Predicting therapy outcome in a digital mental health intervention for depression and anxiety: A machine learning approach. Digital Health, 7, 1–11. 10.1177/20552076211060659 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hybel KA, Mortensen EL, Lambek R, Hojgaard D, & Thomsen PH (2017). Executive function predicts cognitive-behavioral therapy response in childhood obsessive-compulsive disorder. Behaviour Research and Therapy, 99, 11–18. 10.1016/j.brat.2017.08.009 [DOI] [PubMed] [Google Scholar]
- Im S, Stavas J, Lee J, Mir Z, Hazlett-Stevens H, & Caplovitz G (2021). Does mindfulness-based intervention improve cognitive function?: A meta-analysis of controlled studies. Clinical Psychology Review, 84, Article 101972. 10.1016/j.cpr.2021.101972 [DOI] [PubMed] [Google Scholar]
- Jacobson NC, & Nemesure MD (2021). Using artificial intelligence to predict change in depression and anxiety symptoms in a digital intervention: Evidence from a transdiagnostic randomized controlled trial. Psychiatry Research, 295, Article 113618. 10.1016/j.psychres.2020.113618 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jacobson NS, & Truax P (1991). Clinical significance: A statistical approach to defining meaningful change in psychotherapy research. Journal of Consulting and Clinical Psychology, 59(1), 12–19. 10.1037//0022-006X.59.1.12 [DOI] [PubMed] [Google Scholar]
- James G, Witten D, Hastie T, & Tibshirani R (2013). An introduction to statistical learning (Vol. 103). New York: Springer,. [Google Scholar]
- Jenkins DG, & Quintana-Ascencio PF (2020). A solution to minimum sample size for regressions. PLOS One, 15(2), Article e0229345. 10.1371/journal.pone.0229345 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiang B, Zhang XG, & Cai TX (2008). Estimating the confidence interval for prediction errors of support vector machine classifiers. Journal of Machine Learning Research, 9, 521–540. <Go to ISI>://WOS:000256642000007 〈https://dl-acm-org.ezaccess.libraries.psu.edu/doi/pdf/10.5555/1390681.1390698〉. [Google Scholar]
- Jordan MI, & Mitchell TM (2015). Machine learning: Trends, perspectives, and prospects. Science, 349(6245), 255–260. 10.1126/science.aaa8415 [DOI] [PubMed] [Google Scholar]
- Kabat-Zinn J (1990). Full catastrophe living: Using the wisdom of your body and mind to face stress, pain and illness. Dell Publishing,. [Google Scholar]
- Kaiser T, Volkmann C, Volkmann A, Karyotaki E, Cuijpers P, & Brakemeier E-L (2022). Heterogeneity of treatment effects in trials on psychotherapy of depression. Clinical Psychology: Science and Practice, 29(3), 294–303. 10.1037/cps0000079 [DOI] [Google Scholar]
- Keller AS, Sydnor VJ, Pines A, Fair DA, Bassett DS, & Satterthwaite TD (2023). Hierarchical functional system development supports executive function. Trends in Cognitive Sciences, 27(2), 160–174. 10.1016/j.tics.2022.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kenworthy L, Anthony LG, Naiman DQ, Cannon L, Wills MC, Luong-Tran C, & Wallace GL (2014). Randomized controlled effectiveness trial of executive function intervention for children on the autism spectrum. Journal of Child Psychology and Psychiatry, 55(4), 374–383. 10.1111/jc [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kern C, Klausch T, & Kreuter F (2019). Tree-based machine learning methods for survey research. Survey Research Methods, 13(1), 73–93. [PMC free article] [PubMed] [Google Scholar]
- Kessler RC (2018). The potential of predictive analytics to provide clinical decision support in depression treatment planning. Current Opinion in Psychiatry, 31(1), 32–39. 10.1097/YCO.0000000000000377 [DOI] [PubMed] [Google Scholar]
- Kim H, & Newman MG (2023). Worry and rumination enhance a positive emotional contrast based on the framework of the contrast avoidance model. Journal of Anxiety Disorders, 94, Article 102671. 10.1016/j.janxdis.2023.102671 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kokol P, Kokol M, & Zagoranski S (2022). Machine learning on small size samples: A synthetic knowledge synthesis, 368504211029777 Science Progress, 105(1). 10.1177/00368504211029777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuhn M, & Johnson K (2018). Applied predictive modeling. Springer,. [Google Scholar]
- Laber EB, & Zhao YQ (2015). Tree-based methods for individualized treatment regimes. Biometrika, 102(3), 501–514. 10.1093/biomet/asv028 [DOI] [PMC free article] [PubMed] [Google Scholar]
- LaFreniere LS, & Newman MG (2016). A brief ecological momentary intervention for generalized anxiety disorder: A randomized controlled trial of the worry outcome journal. Depression and Anxiety, 33(9), 829–839. 10.1002/da.22507 [DOI] [PubMed] [Google Scholar]
- Leeuwenberg AM, van Smeden M, Langendijk JA, van der Schaaf A, Mauer ME, Moons KGM, & Schuit E (2022). Performance of binary prediction models in high-correlation low-dimensional settings: A comparison of methods. Diagnostic and Prognostic Research, 6(1). 10.1186/s41512-021-00115-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lekkas D, Price G, McFadden J, & Jacobson NC (2021). The application of machine learning to online mindfulness intervention data: A primer and empirical example in compliance assessment. Mindfulness, 12(10), 2519–2534. 10.1007/s12671-021-01723-4 [DOI] [Google Scholar]
- Lenhard F, Sauer S, Andersson E, Mansson KN, Mataix-Cols D, Rück C, & Serlachius E (2018). Prediction of outcome in internet-delivered cognitive behaviour therapy for paediatric obsessive-compulsive disorder: A machine learning approach. International Journal of Methods in Psychiatric Research, 27(1), Article e1576. 10.1002/mpr.1576 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewis MJ, Spiliopoulou A, Goldmann K, Pitzalis C, McKeigue P, & Barnes MR (2023). nestedcv: an R package for fast implementation of nested cross-validation with embedded feature selection designed for transcriptomics and high-dimensional data. Bioinformatics Advances, 3(1), Article vbad048. 10.1093/bioadv/vbad048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loh WY (2011). Classification and regression trees. WIREs Data Mining and Knowledge Discovery, 1(1), 14–23. 10.1002/widm.8 [DOI] [Google Scholar]
- Luedtke A, Sadikova E, & Kessler RC (2019). Sample size requirements for multivariate models to predict between-patient differences in best treatments of major depressive disorder. Clinical Psychological Science, 7(3), 445–461. 10.1177/2167702618815466 [DOI] [Google Scholar]
- Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, & Lee SI (2020). From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2(1), 56–67. 10.1038/s42256-019-0138-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matthews G, & Wells A (2004). Rumination, depression, and metacognition: The S-REF model. In Papageorgiou C, & Wells A (Eds.), Depressive rumination: Nature, theory and treatment (pp. 125–152). Wiley. 10.1002/9780470713853.ch7. [DOI] [Google Scholar]
- McDevitt-Murphy ME, Luciano MT, & Zakarian RJ (2018). Use of ecological momentary assessment and intervention in treatment with adults. Focus, 16(4), 370–375. 10.1176/appi.focus.20180017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- McGlinchey JB, Atkins DC, & Jacobson NS (2002). Clinical significance methods: Which one to use and how useful are they? Behavior Therapy, 33(4), 529–550. 10.1016/S0005-7894(02)80015-6 [DOI] [Google Scholar]
- Miller CB, Gu J, Henry AL, Davis ML, Espie CA, Stott R, & Carl JR (2021). Feasibility and efficacy of a digital CBT intervention for symptoms of generalized anxiety disorder: A randomized multiple-baseline study. Journal of Behavior Therapy and Experimental Psychiatry, 70, Article 101609. 10.1016/j.jbtep.2020.101609 [DOI] [PubMed] [Google Scholar]
- Mizera CM, Bolin RM, Nugent WR, & Strand EB (2015). Facets of mindfulness related to a change in anxiety following a mindfulness-based intervention. Journal of Human Behavior in the Social Environment, 26(1), 100–109. 10.1080/10911359.2015.1062674 [DOI] [Google Scholar]
- Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, & Collins GS (2015). Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): Explanation and elaboration. Annals of Internal Medicine, 162(1), W1–W73. 10.7326/M14-0698 [DOI] [PubMed] [Google Scholar]
- Murphy ST, Cooper AA, Hollars SN, & Strunk DR (2021). Who benefits from a cognitive vs. behavioral approach to treating depression? A pilot study of prescriptive predictors. Behavior Therapy, 52(6), 1433–1448. 10.1016/j.beth.2021.03.012 [DOI] [PubMed] [Google Scholar]
- Nardi W, Roy A, Dunsiger S, & Brewer J (2022). Analyzing the impact of mobile app engagement on mental health outcomes: Secondary analysis of the unwinding anxiety program. Journal of Medical Internet Research, 24(8), Article e33696. 10.2196/33696 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Newman MG, Cho S, & Kim H (2017). Worry and generalized anxiety disorder: A review. Reference Module in Neuroscience and Biobehavioral Psychology. 10.1016/B978-0-12-809324-5.05108-7 [DOI] [Google Scholar]
- Newman MG, & Fisher AJ (2010). Expectancy/credibility change as a mediator of cognitive behavioral therapy for generalized anxiety disorder: Mechanism of action or proxy for symptom change? International Journal of Cognitive Therapy, 3, 245–261. 10.1521/ijct.2010.3.3.245 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Newman MG, Kanuri N, Rackoff GN, Jacobson NC, Bell MJ, & Taylor CB (2021). A randomized controlled feasibility trial of internet-delivered guided self-help for generalized anxiety disorder (GAD) among university students in India. Psychotherapy, 58(4), 591–601. 10.1037/pst0000383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Newman MG, Przeworski A, Consoli AJ, & Taylor CB (2014). A randomized controlled trial of ecological momentary intervention plus brief group therapy for generalized anxiety disorder. Psychotherapy, 51(2), 198–206. 10.1037/a0032519 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Newman MG, Zainal NH, & Hoyer J (2020). Cognitive-behavioral therapy (CBT) for generalized anxiety disorder (GAD). Generalized Anxiety Disorder and Worrying, 203–230. 10.1002/9781119189909.ch10 [DOI] [Google Scholar]
- Newman MG, Zuellig AR, Kachin KE, Constantino MJ, Przeworski A, Erickson T, & Cashman-McGrath L (2002). Preliminary reliability and validity of the generalized anxiety disorder questionnaire-IV: A revised self-report diagnostic measure of generalized anxiety disorder. Behavior Therapy, 33(2), 215–233. 10.1016/S0005-7894(02)80026-0 [DOI] [Google Scholar]
- Nolen-Hoeksema S, Wisco BE, & Lyubomirsky S (2008). Rethinking rumination. Perspectives on Psychological Science, 3(5), 400–424. 10.1111/j.1745-6924.2008.00088.x [DOI] [PubMed] [Google Scholar]
- Pearson R, Pisner D, Meyer B, Shumake J, & Beevers CG (2019). A machine learning ensemble to predict treatment outcomes following an Internet intervention for depression. Psychological Medicine, 49(14), 2330–2341. 10.1017/S003329171800315X [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perestelo-Perez L, Barraca J, Penate W, Rivero-Santana A, & Alvarez-Perez Y (2017). Mindfulness-based interventions for the treatment of depressive rumination: Systematic review and meta-analysis. International Journal of Clinical and Health Psychology, 17(3), 282–295. 10.1016/j.ijchp.2017.07.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Petkova E, Park H, Ciarleglio A, Todd Ogden R, & Tarpey T (2019). Optimising treatment decision rules through generated effect modifiers: A precision medicine tutorial. BJPsych Open, 6(1), Article e2. 10.1192/bjo.2019.85 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Probst G, Vîslă A, & Flückiger C (2022). Patients’ symptoms and strengths as predictors of long-term outcomes of CBT for generalized anxiety disorder – A three-level, multi-predictor analysis. Journal of Anxiety Disorders, 92, Article 102635. 10.1016/j.janxdis.2022.102635 [DOI] [PubMed] [Google Scholar]
- Qi Y (2012). Random forest for bioinformatics. In Zhang C, & Ma YQ (Eds.), Ensemble machine learning (pp. 307–323). Springer. 10.1007/978-1-4419-9326-7_11. [DOI] [Google Scholar]
- Rauschenberg C, Boecking B, Paetzold I, Schruers K, Schick A, van Amelsvoort T, & Reininghaus U (2021). A compassion-focused ecological momentary intervention for enhancing resilience in help-seeking youth: Uncontrolled pilot study. JMIR Mental Health, 8(8), Article e25650. 10.2196/25650 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rebello TJ, Marques A, Gureje O, & Pike KM (2014). Innovative strategies for closing the mental health treatment gap globally. Current Opinion in Psychiatry, 27 (4), 308–314. 10.1097/YCO.0000000000000068 [DOI] [PubMed] [Google Scholar]
- Renna ME, Quintero JM, Soffer A, Pino M, Ader L, Fresco DM, & Mennin DS (2018). A pilot study of emotion regulation therapy for generalized anxiety and depression: Findings from a diverse sample of young adults. Behavior Therapy, 49(3), 403–418. 10.1016/j.beth.2017.09.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schumer MC, Lindsay EK, & Creswell JD (2018). Brief mindfulness training for negative affectivity: A systematic review and meta-analysis. Journal of Consulting and Clinical Psychology, 86(7), 569–583. 10.1037/ccp0000324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Segal ZV, Williams JMG, & Teasdale JD (2002). Mindfulness-based cognitive therapy for depression: A new approach to preventing relapse. Guilford Press,. [Google Scholar]
- Silang KA, Sohal PR, Bright KS, Leason J, Roos L, Lebel C, & Tomfohr-Madsen LM (2022). eHealth interventions for treatment and prevention of depression, anxiety, and insomnia during pregnancy: Systematic review and meta-analysis. JMIR Mental Health, 9(2), Article e31116. 10.2196/31116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Song R, Kosorok M, Zeng D, Zhao Y, Laber E, & Yuan M (2015). On sparse representation for optimal individualized treatment selection with penalized outcome weighted learning. Stat: Bulletin of the Wisconsin Nurses Association, 4(1), 59–68. 10.1002/sta4.78 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spinhoven P, Hoogerwerf E, van Giezen A, & Greeven A (2022). Mindfulness-based cognitive group therapy for treatment-refractory anxiety disorder: A pragmatic randomized controlled trial. Journal of Anxiety Disorders, 90, Article 102599. 10.1016/j.janxdis.2022.102599 [DOI] [PubMed] [Google Scholar]
- Spinhoven P, Huijbers MJ, Ormel J, & Speckens AEM (2017). Improvement of mindfulness skills during mindfulness-based cognitive therapy predicts long-term reductions of neuroticism in persons with recurrent depression in remission. Journal of Affective Disorders, 213, 112–117. 10.1016/j.jad.2017.02.011 [DOI] [PubMed] [Google Scholar]
- Spinhoven P, Klein N, Kennis M, Cramer AOJ, Siegle G, Cuijpers P, & Bockting CL (2018). The effects of cognitive-behavior therapy for depression on repetitive negative thinking: A meta-analysis. Behaviour Research and Therapy, 106, 71–85. 10.1016/j.brat.2018.04.002 [DOI] [PubMed] [Google Scholar]
- Stroop JR (1935). Studies of interference in serial verbal reactions. Journal of Experimental Psychology, 18(6), 643–662. 10.1037/h0054651 [DOI] [Google Scholar]
- Sun S, Goldberg SB, Loucks EB, & Brewer JA (2022). Mindfulness-based interventions among people of color: A systematic review and meta-analysis. Psychotherapy Research, 32(3), 277–290. 10.1080/10503307.2021.1937369 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Swets JA (1988). Measuring the accuracy of diagnostic systems. Science, 240(4857), 1285–1293. 10.1126/science.3287615 [DOI] [PubMed] [Google Scholar]
- Szkodny LE, & Newman MG (2019). Delineating characteristics of maladaptive repetitive thought: Development and preliminary validation of the perseverative cognitions questionnaire. Assessment, 26(6), 1084–1104. 10.1177/1073191117698753 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tay L, Woo SE, Hickman L, Booth BM, & D’Mello S (2022). A conceptual framework for investigating and mitigating machine-learning measurement bias (MLMB) in psychological assessment. Advances in Methods and Practices in Psychological Science, 5(1). 10.1177/25152459211061337 [DOI] [Google Scholar]
- Tymofiyeva O, Yuan JP, Huang CY, Connolly CG, Henje Blom E, Xu D, & Yang TT (2019). Application of machine learning to structural connectome to predict symptom reduction in depressed adolescents with cognitive behavioral therapy (CBT). NeuroImage: Clinical, 23, Article 101914. 10.1016/j.nicl.2019.101914 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Buuren S (2007). Multiple imputation of discrete and continuous data by fully conditional specification. Statistical Methods in Medical Research, 16(3), 219–242. 10.1177/0962280206074463 [DOI] [PubMed] [Google Scholar]
- Varma S, & Simon R (2006). Bias in error estimation when using cross-validation for model selection. BMC Bioinformatics, 7, Article 91. 10.1186/1471-2105-7-91 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wallert J, Boberg J, Kaldo V, Mataix-Cols D, Flygare O, Crowley JJ, & Rück C (2022). Predicting remission after internet-delivered psychotherapy in patients with depression using machine learning and multi-modal data. Translational Psychiatry, 12 (1), Article 357. 10.1038/s41398-022-02133-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walsh CG, Xia W, Li M, Denny JC, Harris PA, & Malin BA (2018). Enabling open-science initiatives in clinical psychology and psychiatry without sacrificing patients’ privacy: Current practices and future challenges. Advances in Methods and Practices in Psychological Science, 1(1), 104–114. 10.1177/2515245917749652 [DOI] [Google Scholar]
- Webb CA, Swords CM, Lawrence HR, & Hilt LM (2022). Which adolescents are well-suited to app-based mindfulness training? A randomized clinical trial and data-driven approach for personalized recommendations. Journal of Consulting and Clinical Psychology, 90(9), 655–669. 10.1037/ccp0000763 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Webb CA, Swords CM, Murray L, & Hilt LM (2021). Abased mindfulness training for adolescent rumination: Predictors of immediate and cumulative benefit. Mindfulness, 12(10), 2498–2509. 10.1007/s12671-021-01719-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wechsler D (2008). Wechsler adult intelligence scale-fourth edition: Technical and interpretative manual. Pearson Assessment,. [Google Scholar]
- Wester RA, Rubel J, & Mayer A (2022). Covariate selection for estimating individual treatment effects in psychotherapy research: A simulation study and empirical example. Clinical Psychological Science, 10(5), 920–940. 10.1177/21677026211071043 [DOI] [Google Scholar]
- Wetherell JL, Hopko DR, Diefenbach GJ, Averill PM, Beck JG, Craske MG, & Stanley MA (2005). Cognitive-behavioral therapy for late-life generalized anxiety disorder: Who gets better? Behavior Therapy, 36(2), 147–156. 10.1016/S0005-7894(05)80063-2 [DOI] [Google Scholar]
- Xiao H, Carney DM, Youn SJ, Janis RA, Castonguay LG, Hayes JA, & Locke BD (2017). Are we in crisis? National mental health and treatment trends in college counseling centers. Psychological Services, 14(4), 407–415. 10.1037/ser0000130 [DOI] [PubMed] [Google Scholar]
- Zainal NH, & Newman MG (2021a). Increased inflammation predicts nine-year change in major depressive disorder diagnostic status. Journal of Abnormal Psychology, 130(8), 829–840. 10.1037/abn0000716 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zainal NH, & Newman MG (2021b). Within-person increase in pathological worry predicts future depletion of unique executive functioning domains. Psychological Medicine, 51(10), 1676–1686. 10.1017/S0033291720000422 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zainal NH, & Newman MG (2022a). Executive functioning constructs in anxiety, obsessive-compulsive, post-traumatic stress, and related disorders. Current Psychiatry Reports, 24(12), 871–880. 10.1007/s11920-022-01390-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zainal NH, & Newman MG (2022b). Inflammation mediates depression and generalized anxiety symptoms predicting executive function impairment after 18 years. Journal of Affective Disorders, 296, 465–475. 10.1016/j.jad.2021.08.077 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zainal NH, & Newman MG (2023). A randomized controlled trial of a 14-day mindfulness ecological momentary intervention (MEMI) for generalized anxiety disorder. European Psychiatry, 66(1), Article e12. 10.1192/j.eurpsy.2023.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zainal NH, & Newman MG (In Press). Mindfulness enhances cognitive functioning: A meta-analysis of 111 randomized controlled trials. Health Psychology Review. 10.1080/17437199.2023.2248222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao Y, Zeng D, Rush AJ, & Kosorok MR (2012). Estimating individualized treatment rules using outcome weighted learning. Journal of the American Statistical Association, 107(499), 1106–1118. 10.1080/01621459.2012.695674 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou Y, Cao Z, Yang M, Xi X, Guo Y, Fang M, & Du Y (2017). Comorbid generalized anxiety disorder and its association with quality of life in patients with major depressive disorder. Scientific Reports, 7, Article 40511. 10.1038/srep40511 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zulfiker MS, Kabir N, Biswas AA, Nazneen T, & Uddin MS (2021). An in-depth analysis of machine learning approaches to predict depression. Current Research in Behavioral Sciences, 2. 10.1016/j.crbeha.2021.100044 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The authors do not have permission to share data.
