Skip to main content
Trials logoLink to Trials
. 2025 Sep 1;26:334. doi: 10.1186/s13063-025-09012-4

Statistical analysis plan for a parallel group randomized clinical trial comparing schema therapy versus treatment as usual for outpatients with difficult-to-treat depression (DEPRE-ST)

Ida-Marie T P Arendt 1,2,, Matthias Gondan 3, Sophie Juul 4,5,6, Lene Halling Hastrup 7,8, Carsten Hjorthøj 9,10, Stine B Moeller 1,2
PMCID: PMC12403386  PMID: 40890784

Abstract

Background

“Difficult-to-treat” depression — here operationalized as either chronic or treatment-resistant depression — encompasses about one third of all patients with depression. Despite its considerable size, the patient group is understudied, and evidence-based psychotherapeutic treatment options are currently limited. The DEPRE-ST trial therefore seeks to investigate beneficial and harmful effects of schema therapy for difficult-to-treat depression.

Methods

This is a randomized multicenter parallel-group superiority clinical trial. A total of 129 patients will be randomized to receive either 30 sessions of individual schema therapy or psychotherapy according to Treatment As Usual, along with psychopharmacological treatment. Participants are assessed before randomization and at 6, 12, and 24 months after randomization. The primary outcome is depression symptoms (6-item Hamilton Rating Scale for Depression), assessed at 12 months. Secondary outcomes include health-related quality of life (European Quality of Life 5 Dimensions 5 Level Version), functional impairment (Work and Social Adjustment Scale), psychological wellbeing (WHO-5 Well-being Index), and negative effects of treatment (Negative Effects Questionnaire), alongside a range of patient-relevant exploratory outcomes. Blinding of treatment allocation is ensured for primary outcome assessors, statisticians, and the data safety and monitoring committee. The primary outcome will be analyzed with multilevel linear regression, with conservative multiple imputation for missing data, and presented as the covariate-adjusted difference between treatments’ change scores with its 95% confidence intervals. Further, a health economic analysis will be performed.

Discussion

This statistical analysis plan was developed and submitted before unblinding of data to ensure transparency and diminish bias in selection, analysis, and reporting of results.

Trial registration

ClinicalTrials.gov NCT05833087. Registered on 15th April 2023. https://clinicaltrials.gov/study/NCT05833087?term=depre-st&checkSpell=false&rank=1

Supplementary Information

The online version contains supplementary material available at 10.1186/s13063-025-09012-4.

Keywords: Schema therapy, Difficult-to-treat depression, Treatment-resistant depression, Treatment refractory depression, Chronic depression, Depressive personality disorder, Persistent depressive disorder, Randomized controlled trial, Psychotherapy, Childhood trauma

Introduction

Background and rationale

While major depression in general is seen as treatable with many forms of psychotherapy [1], some patients require more specialized and focused forms of psychotherapy treatment [2, 3]. Up to 35% of patients suffer from either chronic (> 2 years duration) or treatment-resistant depression (often defined as persistent depression after sufficient treatment attempts with at least two different antidepressants), which together can be labelled “difficult-to-treat depression” (DTD) [4]. Despite both high prevalence and illness burden, DTD is underprioritized in psychotherapy research, with room for improvement both in the understanding of the disorder and in efficient treatment options [5, 6].

Patients with DTD seem to differ qualitatively from patients with non-DTD in several aspects [7]: First, they exhibit much higher frequencies of childhood adversity [811]. Second, problematic personality traits or comorbid personality disorders as well as dysfunctional interpersonal behavior or cognitive styles are more common [10, 12]. As all of these factors can both exacerbate depression symptoms, maintain or prolong depressive periods, and be risk factors for depression relapse [814], they should be addressed in psychotherapeutic treatment for DTD in order to ensure a strong and long-lasting treatment effect.

Schema therapy (ST) was derived from cognitive behavior therapy with elements from psychodynamic, emotion focused, and Gestalt therapy, exactly with the purpose of treating more complex cases with deeply rooted psychological issues. Using a distinct experiential focus [15, 16], ST works directly with childhood adversity and trauma, while it trans-diagnostically addresses problematic personality traits underlying the presenting symptoms and disorders. While some smaller studies on schema therapy for chronic depression have shown promising results [17, 18], and two recent randomized controlled trials have investigated the effects of ST compared to psychodynamic therapy for depression with comorbid borderline personality disorder [19] and the effects of ST for depression more broadly [20], ST for DTD has to our knowledge not yet been studied in a randomized controlled trial.

The current trial aims to contribute to the evidence base for ST in the treatment of DTD. It investigates the beneficial and harmful effects of up to 30 individual sessions of ST compared to treatment as usual (TAU) for depression in a secondary psychiatric out-patient setting.

Methods

Trial design

DEPRE-ST is a multi-center, two-arm, parallel group, assessor-blinded, randomized controlled superiority study, comparing the effect of ST vs TAU on the primary outcome of depression symptoms (HAMD-6) at 12 months after randomization. Patients are allocated 1:1 to either 30 sessions of individual ST or TAU for major depression. TAU can be delivered both as individual and group therapy and may be psychodynamic or cognitive behavior therapy.

Trial interventions

Schema therapy will consist of 30 weekly 45–60-min individual sessions of ST during a period of 10 to 12 months (allowing for sick days and holidays). The therapy is delivered according to a manual developed for this study (see study protocol [21]). Treatment As Usual corresponds to the psychotherapeutic treatment that is routinely offered at the particular treatment site. It can consist of Cognitive Behavioral Therapy or psychodynamic therapy, either individually (6–16 sessions of 45–60 min) or in groups (13–16 sessions of 90–120 min).

Since the study protocol was submitted [21], two more sites have been added to the trial: a district psychiatric out-patient center in the Danish town Svendborg, and a psychiatric out-patient clinic specializing in treatment of mood disorders in the Danish capital Copenhagen.

In both treatment arms, participants will additionally be offered psychopharmacological treatment (as deemed appropriate for the individual patient), meetings with next-of kin, etc., as these are parts of the treatment in the Danish secondary mental health system. See the study protocol [21] for more details on treatment, both specific and general, for the two treatment arms.

Eligibility

Participant eligibility criteria are as follows (identical to those stated in the study protocol [21]):

  • Outpatient

  • Aged 18 or above

  • Referred to treatment for depression in a psychiatric clinic, or already in treatment at the clinic and eligible for a second treatment package at the time of inclusion. For an explanation of treatment packages, see Item 11a of the study protocol [21].

  • Meeting the diagnosis of chronic or treatment-resistant depression, that is: Clinical major depressive disorder (MDD) as assessed by the M.I.N.I.−5 diagnostic interview [22]; MDD duration of at least two years or persistent MDD after two or more trials of antidepressants from different classes, in an adequate dosage and time period (≥ 4 weeks), or moderate treatment resistance as measured on the Maudsley Staging Model (MSM) [23], score above 6.

  • Minimum score of 9 points on the Hamilton Rating Scale for Depression 6 (HAMD-6), corresponding to moderate to severe MDD.

Exclusion criteria are as follows:

Concurrent with the criteria for treatment packages for depression in Danish secondary mental health settings:

  • Alcohol or substance abuse

  • Bipolar disorder

  • Psychotic disorders or current psychotic symptoms of a character that precludes a depression treatment package

  • Acute suicidal risk

  • Mental disability (estimated IQ < 70)

Further study-specific exclusion criteria:

  • Non-Danish speaker, to the extent that the person cannot read and respond to the trial questionnaires in Danish

  • Have received treatment with ST in the past 5 years

  • Pregnancy known at the time of inclusion, since childbirth and infant care could lead to difficulties in upkeeping continuous treatment appointments.

Screening of eligible participants

Eligible patients are identified at the participating sites, either by researchers present at multidisciplinary case conferences or by a treatment provider at the site, whereafter a short phone screening for eligibility takes place. If the participant is deemed initially eligible, a more thorough eligibility screening is made in person or in an online meeting, after which the patient can be included and randomized. We will register the overall number of potentially eligible patients that we have screened in conferences or through referral from treatment providers, the number of patients not deemed eligible for participation, and overall reasons for not including patients (e.g., did not fulfill the inclusion criteria). All information is entered in a CONSORT flow diagram (see Fig. 1). We will not be able to register the total number of potentially eligible patients for the study as we are not able to be present at all case conferences throughout the study, meaning that this number would be unreliable.

Fig. 1.

Fig. 1

Consolidated Standards of Reporting Trials (CONSORT) flow chart

Patient characteristics

The following data will be collected at baseline: age, sex, country of origin (Danish/non-Danish, Western/non-Western), civil status, living alone or with partner/children/others, number of children, level of education, current job situation (including sick leave), yearly income of patient and of household, current psychosocial stressors (Yes/No), number of previous depressions, age at the time of the first depression, former psychiatric treatment (duration, format), depression and other diagnoses from electronic patient journal, comorbid diagnoses according to the M.I.N.I diagnostic interview, severity of current depression measured on the HAMD-6 (grouped by severity in “Moderate depression” or “Severe depression”), current and former intake of psychotropic medicine in current depressive episode, former treatment with psychotherapy in current depressive episode, other treatments in current depressive episode, childhood adversity, expectancies for changes in depression symptoms, and the baseline measurements of the primary, secondary, and exploratory outcomes. Baseline characteristics will be presented in a table as seen in Additional file 2.

Adherence

Criteria for protocol adherence are defined at the level of both the therapists and the patients.

Therapist adherence

All ST therapists’ competency and adherence to the treatment protocol is ensured in several ways: ST training consists of formalized, practice-based live and online training, tailored to the current level of ST knowledge for the individual therapist. Initially, one video recording is submitted by each therapist with a ST session with a current patient at the treatment site (who is not participating in the study). Therapists performing below the required level of competency will be asked to resubmit a video recording of a session with ST until satisfactory standards are met.

All schema therapists receive 1.5 h of group supervision monthly throughout the treatment phase of the study. One-third of all sessions with schema therapy are videotaped. The therapist adherence to the schema therapy manual and therapists’ competency in schema therapy is measured by rating two randomly selected recordings from each patient with the Schema Therapy Rating Scale [24]. Adherence will be reported as the number of schema therapy elements used in the session (number of endorsed items from items number 6–14), while competency will be reported as the average grade given for all endorsed items.

Patient adherence

Major protocol deviations are defined as:

  • Premature drop-out of treatment without prior agreement with the therapist. However, discontinuation of treatment is not noted as dropout in the case of explicit agreement between therapist and patient due to either persistent improvement of the patient’s condition, lack of effect or adverse effects, low patient motivation, treatment adherence or attendance, or referral to other treatment due to exclusion criteria that were not known at the time of inclusion (for details, please consult the study protocol [21]).

  • Cross-over, that is, the patient is randomized to one study arm but received the treatment from the other arm.

  • Contamination, that is, patients given treatment or elements of treatment from the other treatment arm in addition to the treatment they were randomized to. This does not include elements of Cognitive Behavior Therapy (CBT) given as part of a schema therapy treatment, as CBT elements (such as cognitive restructuring) can be part of ST and are included in the ST manual.

  • Non-permitted additional treatment, for example, patients given additional treatment at the treatment site than the treatment they have been randomized to receive (e.g., CBT for insomnia).

  • Other deviations that might arise during the course of the trial will be reported if deemed that they can influence the effects of treatment (exceptions to these are noted in the study protocol [21].

The electronic patient journals are accessed in order to record the above information. The absolute number and percentage of treatment drop-out, crossover, contamination, and other deviations, and the average length of treatment (completed sessions) will be reported in the main article.

Participants may drop out from the intervention but still participate in data collection; they may continue in the intervention but no longer wish to participate in data collection; or they may be lost to follow-up in the case we are not able to obtain contact with them. However, they are not able to withdraw consent to the analysis of data already collected. Level, timing of and reasons for withdrawal, and lost to follow-up will be presented in the CONSORT flow diagram (see Fig. 1). In the case we cannot get in touch with a participant, we will get data from the national Danish registries to see whether the participant may have moved abroad or died.

Randomization and stratification

Randomization is done directly after completion of the baseline assessment in the data management system REDCap (Research Electronic Data Capture), a secure web application which is used in the trial for building and managing measurement tools. Outcome assessors (for the observer-rated HAMD-6), data managers, statisticians, the data safety monitoring committee, and researchers writing up overall conclusions for the outcome article are all blinded to treatment allocation. For details about the timing about randomization and sequence generation as well as information on blinding, see the study protocol and its Additional file 4 [21].

The randomization was initially stratified by depression severity (moderate or severe; HAMD-6) and childhood adversity (Childhood Trauma Questionnaire (CTQ-SF), dichotomized as a CTQ score of < or ≥ 49) at baseline. However, after including 14 participants, the trial team realized that stratification for treatment site was lacking, where after this stratification has been applied for the remainder of the included participants, in lieu of stratifying for depression severity and childhood adversity.

Sample size

The sample size was calculated based on the primary outcome, HAMD-6, with a minimally clinically relevant difference of 2.0 points between the treatment groups and a standard deviation of 3.5. For further details of the sample size calculation, see the study protocol [21]. The total sample size will be 129 participants, giving 80% statistical power when accounting for approximately 20% dropout as well as clustering for about 5 patients per therapist, assuming an intra-cluster correlation of 0.01. Table 1 shows that this number of patients also yields adequate power in the secondary outcomes, but not for the exploratory outcomes [2534].

Table 1.

Power estimates for secondary and exploratory outcomes

Outcome MCID SD α Power Reference
Secondary
Level of functioning (WSAS) 8.0 9.59 5% 98% [27]
Psychological well-being (WHO-5) 10.0 7.9 5% 100% [26]
Health-related quality of life (ED-5D-5L) 0.071 0.102 5% 93% [28]
Negative effects of treatment (NEQ) 2.0 3.5 5% 89% [29]
Exploratory
Changes in patient-elected problems (PSYCHLOPS) 2.33* 4.66 5% 70% [30]
Perseverative Thinking (PTQ) 5.6* 11.2 5% 70% [31]
Anger reactions (DAR-7) 6.8* 13.6 5% 70% [32]
Anxiety (SCL-10, anxiety subscale) 0.31* 0.71 5% 70% [33]
Anger processing (MAP)** - - - - -
Personal recovery (Brief INSPIRE-O) 8 20.7 5% 48% [34]

MCID, minimal clinically relevant difference SD, standard deviation, α: two-tailed significance level. For simplicity, the above power estimates do not account for clustering or study dropout.

*Where the MCID was not available, it was estimated as SD/2

**It was not possible to obtain data for the MAP

Outcomes

For details on the outcomes, please see the study protocol [21]. Outcomes are assessed at 6, 12, and 24 months after randomization (with a few exceptions, see Table 2). The primary time point of interest is 12 months after randomization for all outcomes with a few exceptions (for details, see Table 2). It is the goal to have a “visit window” of no more than 4 weeks around the exact planned time point for assessment. The average timing (days) between randomization and outcome assessments will be reported.

Table 2.

Timing of interventions and assessments

Enrollment Post-allocation Planned
T0 Start treatment Month 6 Month 12 Month 24 Month 36
Eligibility screening X
Informed consent X
Randomization X
Schema therapy
Treatment as usual
Baseline data X
PSYCHLOPS pre X
PSYCHLOPS post TAU ST
CTQ X
DCES-P X
NEQ X X X X
EQ-5D-5L X X X X X
PTQ X X X X X
WSAS X X X X X
MAP-9 X X X X X
DAR X X X X X
WHO5 X X X X X
INSPIRE-O Brief X X X X X
SCL-10 X X X X X
SMI 1.1 (part) TAU X TAU X X
YSQ S.3 (part) TAU X TAU X X
SMI 1.1 ST ST
YSQ S.3 ST ST

CTQ Childhood Trauma Questionnaire, DAR Dimensions of Anger Reactions Scale, DCES-P Depression Change Expectancy Scale—Pessimistic scale, EQ-5D-5L European Quality of Life 5 Dimensions 5 Level Version, MAP-9 Metacognitive Anger Processing Scale (9 items), NEQ Negative Effects Questionnaire, PSYCHLOPS post Psychological outcome profiles post-therapy, PSYCHLOPS pre Psychological outcome profiles pre-therapy, PTQ Perseverative Thinking Questionnaire, SCL-10 Symptom Checklist (anxiety subscale; 5 items), SMI 1.1 Schema Mode Inventory version 1.1 (full version), SMI 1.1 part Schema Mode Inventory version 1.1 (Healthy Adult, Vulnerable Child, Punitive and Demanding Parent-modes only), ST schema therapy, TAU treatment as usual, WHO-5 WHO-5 Well-Being Index, WSAS Work and Social Adjustment Scale, YSQ S.3 Young Schema Questionnaire S3 (full version), YSQ S.3 part Young Schema Questionnaire S3 (Emotional Deprivation, Abandonment, Mistrust/Abuse, Social Isolation, and Defectiveness schemas only).

Primary outcome

The primary outcome is the severity of depression, assessed with the 6-item Hamilton Rating Scale for Depression (HAMD-6) [35], observer-rated, 12 months after randomization. The HAMD-6 will also be assessed at baseline (for covariate adjustment), and at 6 and 24 months after randomization (as a secondary outcome).

Secondary outcomes

In recent years, there has been increasing focus on the voice of patients and their needs [36]. Especially in the case of severe or chronic illnesses, full recovery from symptoms might not be feasible, but treatments may provide other beneficial effects that are equally or more important for patients. For this reason, the current study has chosen a wide array of secondary and exploratory outcomes, as seen below, in the effort to encompass alternative ways of measuring treatment success. Notably, the Psychological Outcome Profiles (PSYCHLOPS) [30]) lets the patient choose idiosyncratic problems that might be targeted by treatment.

  • Depression symptoms on the HAMD-6 at 6 and 24 months after randomization

  • Quality of life, assessed with the European Quality of Life 5 Dimensions 5 Level Version (EQ-5D-5L) (primary time point of interest at 24 months after randomization)

  • Level of functioning, assessed with the Work and Social Adjustment Scale (WSAS)

  • Psychological well-being, assessed with the WHO-5 Well-Being Index (WHO-5)

  • Negative effects of treatment, assessed with the Negative Effects Questionnaire (NEQ)

Exploratory outcomes

Exploratory outcomes were defined as such on the basis of inadequate power (< 80%) [25] (see Table 1). Similar to the secondary outcomes mentioned above, the results on the exploratory outcomes will be interpreted in a non-confirmatory, hypothesis-testing sense.

  • Personal recovery assessed with the Brief INSPIRE-O

  • Changes in patient-elected problems assessed with the Psychological Outcome Profiles (PSYCHLOPS)

  • Reactions to anger assessed with the Dimensions of Anger Reactions—Revised (DAR)

  • Anger processing assessed with the Metacognitive Anger Processing scale, Short Version (MAP-9)

  • Repetitive negative thinking assessed with the Perseverative Thinking Questionnaire (PTQ)

  • Anxiety symptoms assessed with the anxiety subscale of the Symptom Checklist-10 (SCL-10)

  • Response and remission of depressive symptoms (as defined in the study protocol [21])

Raw outcome scores will be reported, and change from baseline will be calculated where baseline measurements are available. Follow-up measurements at 36 months after randomization are planned conditional on additional funding. Outcomes from the 24- and 36-month time points, including the health economic evaluation based on the EQ-5D-5L, will be reported in separate publications.

Safety

The proportion of participants with one or more serious adverse events will be reported for each treatment group. A serious adverse event will be defined according to the International Conference on Harmonisation of technical requirements for registration of pharmaceuticals for human use—Good Clinical Practice (ICH-GCP): any untoward medical occurrence that resulted in death, was life-threatening, required hospitalization or prolonging of existing hospitalization, and resulted in persistent or significant disability or jeopardized the participant [37]. We will collect data from the national Danish registries regarding the possible death of participants. The results of the NEQ (see above under “Secondary outcomes”) will also be analyzed to show the participants’ subjectively experienced negative consequences of treatment.

Statistical analysis

All analyses will be done by data analysists who are blinded to treatment condition of the two groups, that is, treatment arms will be labelled “Treatment A” and “Treatment B.” Two different abstracts for the outcome article with opposing results of the two treatments and resulting conclusions will be written before unbreaking the blind, as described in the study protocol [21].

Interim analyses

Interim analyses are planned to take place when 50% of the 6-month follow-up data have been collected. An external data safety monitoring committee (DSMC) with no allegiance to the trialists will perform the interim analyses and decide whether to stop or continue the trial based on convincing benefit or harm of the interventions. The DSMC will receive blinded data but can be unblinded upon request. Focus will be on data from the 6-month follow-up regarding serious adverse events, significant differences between groups on the primary outcome, and serious deterioration on the primary outcome.

The following statistical analyses are proposed, but the DSMC can perform other analyses and require more data if deemed necessary:

  • Inferiority: A t-test for independent groups will be used to test for differences between study arms. If this test is significant at the one-tailed 2.5% and indicates that ST is inferior, the DSMC will reassess the costs and benefits of the trial and consider early stopping.

  • Serious adverse events: A chi-square test will be used to test for differences between groups in the number of participants with one or more serious adverse events (emergency room visits for somatic or psychiatric reasons, documented suicide attempts or self-harm). If this test is significant at the one-tailed 2.5% and indicates a substantial increase in adverse events under schema therapy, the DSMC will reassess the costs and benefits of the trial and consider early stopping. In case of low event counts, Barnard’s test will be used to avoid poor asymptotic behavior of the chi-square test.

  • Serious deterioration: A chi-square test will be used to test for differences between groups in the number of participants with an increase of 2 or more points on the primary outcome (HAMD-6). If this test is significant at the one-tailed 2.5% and indicates serious deterioration under schema therapy, the DSMC will reassess the costs and benefits of the trial and consider early stopping. In case of low event counts, Barnard’s test will be used to avoid poor asymptotic behavior of the chi-square test.

All interim analyses will be performed on the available data without imputation of missing data. An adjustment of the significance level is not needed, since a stop for efficacy is not planned.

Primary analysis

The primary analysis will be based on the intention-to-treat principle (i.e., including all randomized patients who will be analyzed according to their initial randomization, regardless of the actual treatment received), using conservative, multiple imputation of missing outcomes by jump to reference [38]. The effect of the treatment will be presented as the covariate-adjusted difference between the average change scores in the two arms, with its two-tailed 95% confidence interval. In other words, superiority of ST will be claimed if the change in ST is significantly better than the change in TAU, at a two-tailed significance level of 5%. Analyses will be performed when all 12-month assessments have been completed.

The HAMD-6 score is assumed to be an interval-scaled variable, with normally distributed residuals. The therapies will be compared using a multilevel linear regression with therapy (ST vs. TAU) as the main effect, adjusted for treatment site (categorical, 6 levels) and the HAMD-6 score at baseline as covariates, and therapist as a random intercept. Efficacy of ST vs. TAU will be expressed by the covariate-adjusted difference between the change scores of the experimental versus the control intervention, along with its two-tailed 95% confidence interval. Superiority of ST will be claimed if this confidence interval is above zero. After adjustment for baseline severity, this estimate is numerically identical to the comparison of the raw scores between the treatment arms at end-of-therapy [39]. Secondary outcomes will be analyzed in the same way, with the same 95% confidence intervals. If the respective endpoint is not available in the baseline assessment by design (e.g., NEQ), the baseline HAMD-6 will be used as a baseline covariate, and effect estimates will be based on raw scores instead of change scores.

Missing outcomes will be multiply imputed by a ‘jump to the reference treatment,’ that is, we assume that patients with missing data in ST score similar to patients in TAU. This analysis method is implemented in the R package RefBasedMI [40, 41]. Box 1 (Additional file 3) shows the R code for the analysis using RefBasedMI’s own example data. Secondary outcomes will be analyzed in the same manner.

Sensitivity analyses

To assess the impact of the imputation procedure, we will run a number of sensitivity analyses for the primary outcome (see Boxes 2 and 3, Additional file 3): A first sensitivity analysis will be based under the usual assumption of missingness-at-random. This supersedes the initially planned regression imputation with dropout status and secondary outcomes in the imputation model [21]. The reason for this change of strategy is that primary and secondary outcomes are collected at the same session. Therefore, practically speaking, missing data in secondary outcomes co-occur with gaps in the primary outcome, so imputation by secondary outcomes will not help much—it will mostly mirror the results of the analysis with available cases [42].

A second sensitivity analysis will be carried out for the subset of per-protocol participants with available outcome data, without imputation of missing outcomes. Further sensitivity analyses will be made with best-best and worst-worst case imputation of missing outcomes, and without adjustment for covariates [39].

To rule out bias due to ceiling or floor effects in the outcome, a final sensitivity analysis will be carried out using negative binomial regression, with HAMD-6 treated as count data (i.e., a score of 10 points will be counted as ten “events”) [43]. Results of this analysis will be presented as rate ratios for therapy type, adjusted for baseline severity and treatment site.

Subgroup analyses

Subgroup analyses are performed with the purpose of assessing homogeneity of the treatment effects and further to reveal potentially harmful effects in specific subpopulations of patients. Subgroup effects will be investigated by adding treatment-by-covariate interactions to the primary analysis model and by illustrating the subgroup-specific therapy effects in forest plots (see Box 4, Additional file 3, for example code).

Patients with childhood adversity and corresponding early maladaptive schema may benefit more from ST than patients with less severe adverse experiences in childhood [44]. This will be analyzed by adding the interaction of therapy x adversity (CTQ, dichotomized as present/not present of moderate to severe childhood adversity on one or more of the five domains: physical, emotional or sexual abuse, emotional or psychical neglect, with cutoffs as suggested by Bernstein et al. [45]) to the statistical model of the primary analysis, and by estimating stratum-specific therapy effects in participants with and without adversity experiences.

Further, we will explore therapy x site interactions, as recommended in the International Conference on Harmonisation E9 guideline [46]. Additional subgroup analyses will be made for depression symptoms at baseline, sex (male, female), and psychiatric comorbidities as measured in the M.I.N.I. interview (present; not present). It is understood that such subgroup analyses are exploratory, subject to lack of statistical power and, at the same time, multiple testing issues. Any observed subgroup effect will be interpreted in the light of its psychological plausibility, weighed against statistical artifacts such as regression to the mean (e.g., treatment x severity interactions). Observed effects in the primary outcome will be compared to the corresponding results in the relevant secondary outcomes.

Health economic analysis

The health economic analysis investigates the health-related quality of life as measured on the EQ-5D-5L and has its primary timepoint at 24 months after randomization. The EQ-5D-5L instrument includes five dimensions (mobility, self-care, usual activities, pain/discomfort, and anxiety/depression), each with five levels of severity. EQ-5D-5L responses will be converted into Quality-Adjusted Life Years (QALYs) using Danish utility weights [47].

Cost data will include direct health care costs and lost productivity costs. Direct health care costs will include costs of primary and secondary health care services and prescription medication. Unit costs of General Practitioner services and other medical specialists will be based on the prevailing National Health Insurance fee schedules [48]. Direct costs, including in-patient and out-patient health care costs, will be calculated using Diagnostic Related Groups (DRG) weights and specific out-patient tariffs. These cost estimates are all based on data from the Danish Ministry of Health and reflect the average cost of treating patients with similar conditions in a Danish hospital [49, 50]. The participants’ use of prescription medication and information on retail price on each transaction will be derived from the Register of Medical Product Statistics [51]. Lost productivity costs defined as difference in earned income between the groups and data on transfer payments will be derived from The Danish National Labour Market Authority’s DREAM database [52]. The incremental cost-effectiveness ratio (ICER) will be estimating the difference between intervention and control group in QALYs/costs [53]. Non-parametric bootstrapping will be used to estimate confidence intervals for the ICER. Results will be presented in the cost-effectiveness plane, and a cost-effectiveness acceptability curve will illustrate the probability that the intervention is cost-effective compared to an alternative, given different values of willingness to pay for a unit of health benefit [54].

Predictors

It is hypothesized that the level of depression treatment resistance (measured by the MSM [23]), childhood adversity (assessed on the CTQ [45, 55]), and expectancies for depressive symptom change (assessed on the pessimistically worded statements in the Depression Change Expectancy Scale [56, 57]) will predict the effect of treatment on depression symptoms in both treatment groups (for description of the assessment tools, please see study protocol [21]. This will be investigated by adding a covariate for MSM to the primary analysis model, as well as an interaction term for Therapy:MSM (see Box 5, Additional file 3). Due to its exploratory nature, this analysis will be based on the per-protocol population, without imputation of missing data.

Mediators

ST is theorized to work through changing early maladaptive schemas; however, recent work has focused more on coping with schemas through working with schema modes [16]. We hypothesize that ST will increase the Healthy Adult Mode and minimize the Vulnerable Child Mode and the Demanding/Critical and Punitive authority modes (measured on the SMI; see Fig. 2). An exploratory mediation analysis will be performed to investigate the relative contribution by therapy type of changes in the four mentioned schema modes on depression symptoms as measured by the HAMD-6.

Fig. 2.

Fig. 2

Proposed mediation model

HA healthy adult, VC vulnerable child, AM authority modes (demandingand critical authority modes pooled)

The exposure is, thus, the binary variable, therapy, with levels ST and TAU. The mediators investigated individually are schema modes Vulnerable Child, Punitive authority and Demanding/Critical authority (the two authority modes pooled) for which we expect lower scores under ST as compared to TAU, and Healthy Adult mode for which we expect higher scores under ST as compared to TAU. The outcome is HAMD-6, thus the same as the primary outcome of the main analysis. Due to its exploratory nature, this analysis will be based on the per protocol population, without imputation of missing data. The code is shown in Box 6 (Additional file 3).

Details of statistical packages to be used to carry out the analyses

For all analyses except for the health economic analyses, R version 4.5.1 will be used [58], with package RefBasedMI for the imputation of continuous missing data [59, 60], and bootImpute [61] and dejaVu [62] for the sensitivity analysis. Lavaan will be used for the mediation analysis [63]. Stata version 18 will be used for the health economic analyses. [64].

Discussion

The purpose of this paper is to present a detailed, predefined statistical analysis plan in order to minimize bias due to selective selection, analyses, and reporting of outcomes. Important strengths and limitations of the trial and its methods have been presented in the trial protocol [21]. We will now add a few points regarding the strengths and limitations of the analytical and statistical methods.

Strengths

Methodological strengths for this trial include the following: Having predefined a statistical analysis plan minimizes the potential for data-driven and/or selective outcome reporting. Further, analyses are made according to the intention-to-treat principle, making up for potential bias due to patient characteristics that increase drop-out. We assume normal distribution of the data and specific mechanisms of missingness and dropout. A number of sensitivity analyses have been planned to assess the robustness of the results with regard to these assumptions, including an analysis that treats the primary outcome as an event count.

Limitations

While power calculations have been made to secure adequate statistical power for primary and secondary outcomes, these calculations were based on treatment effects and their standard deviations from population and interventions that were not exactly identical to those of the current trial [2628, 35, 65]. This might create inaccuracies in power estimations, potentially skewing the estimation of statistical significance differences between treatment groups. Also, subgroup analyses and exploratory analyses are planned, which are probably underpowered. At the same time, there is a Type 1 error inflation with each additional significance test. This will be taken into account when interpreting the results. Further, while the secondary outcomes are planned to exhibit adequate power, we are not adjusting secondary outcome results for multiple testing, affecting the certainty of findings. This means that conclusions and recommendations based on the results of this trial are made based on the primary outcome only. Finally, given the vulnerable state of the trial population, a substantial amount of missing data can be expected. While this will be handled appropriately as described above, missing data in clinical trials can create validity problems that cannot be corrected for [66]. This will be considered an important potential limitation when reporting the results of our trial.

Supplementary Information

13063_2025_9012_MOESM1_ESM.docx (26.9KB, docx)

Additional file 1: Statistical Analysis Plan (SAP) Checklist v 1.0 2019

13063_2025_9012_MOESM2_ESM.docx (21KB, docx)

Additional file 2: Table reporting of baseline characteristics

13063_2025_9012_MOESM3_ESM.docx (20.5KB, docx)

Additional file 3: Code for analyses to be used in R, version 4.5.1

Additional file 4.

Acknowledgements

Not applicable.

Statistical Analysis Plan (SAP) version 2, 28/7/2025

This SAP is consistent with the study protocol: Arendt IMTP, Gondan M, Juul S, Bering L, Moeller SB et al. Schema therapy versus treatment as usual for outpatients with difficult-to-treat depression: study protocol for a parallel group randomized clinical trial (DEPRE-ST). Trials. 2024;25(1):266. Published 2024 Apr 16. doi: 10.1186/s13063-024-08079-9

The SAP was prepared according to the guidelines by Gamble et al. [67]. A SAP checklist is found in Additional File 1.

Current status of the trial

At the submission of this manuscript, the trial has included 83 participants. The inclusion of participants is expected to conclude in June 2027, and the planned 24-month follow-up assessments will then be conducted in June 2029.

Abbreviations

CTQ

Childhood Trauma Questionnaire

DAR

Dimensions of Anger Reactions Scale

DCES-P

Depression Change Expectancy Scale–Pessimistic scale

DSM

Diagnostic and Statistical Manual of Mental Disorders

DSMC

Data Safety Monitoring Committee

DRG

Diagnostic related group

EQ-5D-5L

European Quality of Life 5 Dimensions 5 Level Version

HAMD-6

Hamilton Rating Scale for Depression–6-item version

HA

Healthy adult

ICD-10

International Classification of Diseases, 10th Revision

ICER

Incremental cost-effectiveness ratio

ICH-GCP

International Conference on HarmonisationGood Clinical Practice

INSPIRE-O

Brief INSPIRE for personal recovery (Outcome version)

MAP-9

Metacognitive Anger Processing Scale (9 items)

MCID

Minimal clinically important difference

MDD

Major depressive disorder

M.I.N.I.

Mini International Neuropsychiatric Interview

MI

Multiple imputation

MNAR

Missing not at random

MSM

Maudsley Staging Model

NEQ

Negative Effects Questionnaire

PSYCHLOPS

Psychological Outcome Profiles

PTQ

Perseverative Thinking Questionnaire

QALY

Quality-adjusted life year

RCT

Randomized controlled trial

REDCap

Research Electronic Data Capture

SAP

Statistical analysis plan

SCL-10

Symptom Checklist-10

SD

Standard deviation

SMI 1.1

Schema Mode Inventory version 1.1 (full version)

ST

Schema therapy

TAU

Treatment as usual

VC

Vulnerable child

WHO5

WHO-5 Well-Being Index

WSAS

Work and Social Adjustment Scale

YSQ S.3

Young Schema Questionnaire Short form, version 3

Authors’ contributions

IMTPA is the chief investigator and project manager on the project and wrote the Statistical Analysis Plan with significant contributions on content from all other authors. SBM is the project owner. MG is the senior statistician on the project and wrote the sections on statistical analyses. LB wrote the section on health economic analyses. CH and SJ consulted on the statistical analyses. All authors approved the content of the manuscript.

Funding

Open access funding provided by University of Southern Denmark The project is funded by the Danish, private foundation Trygfonden after an external, international peer review process, and by the Region of Southern Denmark’s PhD-fund. Further private and public foundations will be approached to cover the remainder of the costs for the project.

Expenses include salaries for the PhD/post doc, research and student assistants, as well as expenses for training and supervision of therapists. Further, it covers a refund to the participating clinics for additional clinician hours spent on providing schema therapy, as well as 5% overhead, PhD tuition fee, and minor expenses for IT, conferences, etc.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Declarations

Ethics approval and consent to participate

The trial has been approved by the Southern Region of Denmark Research Ethics Committee; Project ID S-20220033 for compliance with applicable research and human subjects regulation. Written consent will be obtained from all participants before inclusion.

Consent for publication

Not applicable.

Competing interests

SBM is a practicing schema therapist and a member of the International Society for Schema Therapy (ISST). IMTPA is an affiliated member of the ISST. All other authors declare that they have no competing interests.

The open access publication fee was covered by the Region of Southern Denmark Psychiatry’s publication fund.

No members of the project management or project steering group have financial connections to Trygfonden or to the board of Region of Southern Denmark’s PhD-fund. The funding bodies have no influence on the design of the trial, nor the collection, analysis, or interpretation of the data.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Cuijpers P, Miguel C, Harrer M, Plessen CY, Ciharova M, Papola D, et al. Psychological treatment of depression: a systematic overview of a “Meta-Analytic Research Domain.” J Affect Disord. 2023;335:141–51. [DOI] [PubMed] [Google Scholar]
  • 2.Jobst A, Brakemeier EL, Buchheim A, Caspar F, Cuijpers P, Ebmeier KP, et al. European Psychiatric Association guidance on psychotherapy in chronic depression across Europe. Eur Psychiatry. 2016;33(1):18–36. [DOI] [PubMed] [Google Scholar]
  • 3.McAllister-Williams RH, Arango C, Blier P, Demyttenaere K, Falkai P, Gorwood P, et al. The identification, assessment and management of difficult-to-treat depression: an international consensus statement. J Affect Disord. 2020;267:264–82. [DOI] [PubMed] [Google Scholar]
  • 4.Ladegaard N, Buchholtz PE, Christensen EM, Davidsen AS, Dolmer IB, Hansen TL, et al. National klinisk retningslinje for non-farmakologisk behandling af unipolar depression. Copenhagen: Sundhedsstyrelsen; 2016. Available at https://www.sst.dk/-/media/Udgivelser/2016/NKRdepression/National-klinisk-retningslinje-non-fatmakologiskbehandling-af-unipolar-depression.ashx?sc_lang=da&hash=DABD16860F0E26BBC4C869425EEE8E28. Accessed 15Aug 2025.
  • 5.Moeller SB, Gbyl K, Hjorthøj C, Andreasen M, Austin SF, Buchholtz PE, et al. Treatment of difficult-to-treat depression – clinical guideline for selected interventions. Nord J Psychiatry. 2022;76(3):177–88. [DOI] [PubMed] [Google Scholar]
  • 6.Markowitz JC, Wright JH, Peeters F, Thase ME, Kocsis JH, Sudak DM. The neglected role of psychotherapy for treatment-resistant depression. Am J Psychiatry. 2022;179(2):90–3. [DOI] [PubMed] [Google Scholar]
  • 7.Klein DN, Kotov R. Course of depression in a 10-year prospective study: evidence for qualitatively distinct subgroups. J Abnorm Psychol. 2016;125(3):337–48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Köhler S, Chrysanthou S, Guhn A, Sterzer P. Differences between chronic and nonchronic depression: systematic review and implications for treatment. Depress Anxiety. 2019;36(1):18–30. [DOI] [PubMed] [Google Scholar]
  • 9.Nelson J, Klumparendt A, Doebler P, Ehring T. Childhood maltreatment and characteristics of adult depression: meta-analysis. Br J Psychiatry. 2017;210(2):96–104. [DOI] [PubMed] [Google Scholar]
  • 10.Hölzel L, Härter M, Reese C, Kriston L. Risk factors for chronic depression — a systematic review. J Affect Disord. 2011;129(1):1–13. [DOI] [PubMed] [Google Scholar]
  • 11.Barnhofer T, Brennan K, Crane C, Duggan D, Williams JM. A comparison of vulnerability factors in patients with persistent and remitting lifetime symptom course of depression. J Affect Disord. 2014;152–154:155–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Riso LP, Miyatake RK, Thase ME. The search for determinants of chronic depression: a review of six factors. J Affect Disord. 2002;70(2):103–15. [DOI] [PubMed] [Google Scholar]
  • 13.Negele A, Kaufhold J, Kallenbach L, Leuzinger-Bohleber M. Childhood trauma and its relation to chronic depression in adulthood. Depress Res Treat. 2015;2015: 650804. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Nanni V, Uher R, Danese A. Childhood maltreatment predicts unfavorable course of illness and treatment outcome in depression: a meta-analysis. Am J Psychiatry. 2012;169(2):141–51. [DOI] [PubMed] [Google Scholar]
  • 15.Renner F, Arntz A, Leeuw I, Huibers M. Treatment for chronic depression using schema therapy. Clin Psychol Sci Pract. 2013;20(2):166–80. [Google Scholar]
  • 16.Arntz A, Jacob G. Schema therapy in practice: an introductory guide to the schema mode approach. 1st Ed. Hoboken, New Jersey: Wiley Blackwell; 2013.
  • 17.Renner F, Arntz A, Peeters FPML, Lobbestael J, Huibers MJH. Schema therapy for chronic depression: results of a multiple single case series. J Behav Ther Exp Psychiatry. 2016;51:66–73. [DOI] [PubMed] [Google Scholar]
  • 18.Malogiannis IA, Arntz A, Spyropoulou A, Tsartsara E, Aggeli A, Karveli S, et al. Schema therapy for patients with chronic depression: a single case series study. J Behav Ther Exp Psychiatry. 2014;45(3):319–29. [DOI] [PubMed] [Google Scholar]
  • 19.Kool M, Van H, Arntz A, Bartak A, Peen J, Dil L, et al. Dosage effects of psychodynamic and schema therapy in people with comorbid depression and personality disorder: four-arm pragmatic randomised controlled trial. Br J Psychiatry. 2024;225(1):274–81. [DOI] [PubMed] [Google Scholar]
  • 20.Kopf-Beck J, Müller CL, Tamm J, Fietz J, Rek N, Just L, et al. Effectiveness of schema therapy versus cognitive behavioral therapy versus supportive therapy for depression in inpatient and day clinic settings: a randomized clinical trial. Psychother Psychosom. 2024;93(1):24–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Arendt ITP, Gondan M, Juul S, Hastrup LH, Hjorthøj C, Bach B, et al. Schema therapy versus treatment as usual for outpatients with difficult-to-treat depression: study protocol for a parallel group randomized clinical trial (DEPRE-ST). Trials. 2024;25(1):266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs J, Weiller E, et al. The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry. 1998;59 Suppl 20:22–33;quiz 4–57. [PubMed]
  • 23.Fekadu A, Wooderson S, Donaldson C, Markopoulou K, Masterson B, Poon L, et al. A multidimensional tool to quantify treatment resistance in depression: the Maudsley staging method. J Clin Psychiatry. 2009;70(2):177–84. [DOI] [PubMed] [Google Scholar]
  • 24.Young J, Fosse G. Schema therapy rating scale. New York: Schema Therapy Institute; 2005. [Google Scholar]
  • 25.Jakobsen JC, Ovesen C, Winkel P, Hilden J, Gluud C, Wetterslev J. Power estimations for non-primary outcomes in randomised clinical trials. BMJ Open. 2019;9(6): e027092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Topp CW, Østergaard SD, Søndergaard S, Bech P. The WHO-5 well-being index: a systematic review of the literature. Psychother Psychosom. 2015;84(3):167–76. [DOI] [PubMed] [Google Scholar]
  • 27.Zahra D, Adam Q, William H, Rod T, Cath Q, Jill P, et al. The work and social adjustment scale: reliability, sensitivity and value. Int J Psychiatry Clin Pract. 2014;18(2):131–8. [DOI] [PubMed] [Google Scholar]
  • 28.John JR, Tannous WK, Jones A. Changes in health-related quality of life before and after a 12-month enhanced primary care model among chronically ill primary care patients in Australia. Health Qual Life Outcomes. 2020;18(1): 288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Rozental A, Kottorp A, Forsström D, Månsson K, Boettcher J, Andersson G, et al. The negative effects questionnaire: psychometric properties of an instrument for assessing negative effects in psychological treatments. Behav Cogn Psychother. 2019;47(5):559–72. [DOI] [PubMed] [Google Scholar]
  • 30.Ashworth M, Evans C, Clement S. Measuring psychological outcomes after cognitive behaviour therapy in primary care: a comparison between a new patient-generated measure “PSYCHLOPS” (Psychological Outcome Profiles) and “HADS” (Hospital Anxiety and Depression Scale). J Ment Health. 2009;18(2):169–77. [Google Scholar]
  • 31.Moeller SB, Larsen PV, Arendt I-MTP, Ehring T, Reinholt N, Hvenegaard M, et al. Validation of the Danish version of Perseverative Thinking Questionnaire (PTQ) – introducing the PTQ short version. Psychological Test Adaptation and Development. 2023;4(1):310–8. [Google Scholar]
  • 32.Forbes D, Alkemade N, Mitchell D, Elhai JD, McHugh T, Bates G, et al. Utility of the dimensions of anger reactions-5 (DAR-5) scale as a brief anger measure. Depress Anxiety. 2014;31(2):166–73. [DOI] [PubMed] [Google Scholar]
  • 33.Simon GE, Ralston JD, Savarino J, Pabiniak C, Wentzel C, Operskalski BH. Randomized trial of depression follow-up care by online messaging. J Gen Intern Med. 2011;26(7):698–704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Moeller SB, Larsen PV, Austin S, Slade M, Arendt IMTP, Andersen MS, et al. Scalability, test–retest reliability and validity of the brief INSPIRE-O measure of personal recovery in psychiatric services. Front Psychiatry. 2024. 10.3389/fpsyt.2024.1327020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Licht RW, Qvitzau S, Allerup P, Bech P. Validation of the Bech-Rafaelsen melancholia scale and the Hamilton depression scale in patients with major depression; is the total score a valid measure of illness severity? Acta Psychiatr Scand. 2005;111(2):144–9. [DOI] [PubMed] [Google Scholar]
  • 36.Cuijpers P. Measuring success in the treatment of depression: what is most important to patients? Expert Rev Neurother. 2020;20(2):123–5. [DOI] [PubMed] [Google Scholar]
  • 37.International Conference on Harmonisation of technical requirements for registration of pharmaceuticals for human use—Good Clinical Practice (ICH-GCP). ICH harmonised guideline integrated addendum to ICH E6(R1): Guideline for Good Clinical Practice ICH E6(R2). 2015. [Available from: https://ichgcp.net/da Accessed on 3/10/2024).
  • 38.Wolbers M, Noci A, Delmar P, Gower-Page C, Yiu S, Bartlett JW. Standard and reference-based conditional mean imputation. Pharm Stat. 2022;21(6):1246–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Committee for Medicinal Products for Human Use. Guideline on adjustment for baseline covariates in clinical trials [Internet]. London/Amsterdam: European Medicines Agency; 2015 [cited 2025 Aug 15]. Available from: https://www.ema.europa.eu/en/documents/scientific-guideline/guideline-adjustment-baseline-covariates-clinical-trials_en.pdf. Accessed 15 Aug 2025.
  • 40.McGrath K. RefBasedMI: reference-based imputation for longitudinal clinical trials with protocol deviation. 10.32614/CRAN.package.RefBasedMI Accessed on 16th June 2025 2024 
  • 41.Carpenter J, Roger J, Kenward M. Analysis of longitudinal trials with protocol deviation: a framework for relevant, accessible assumptions, and inference via multiple imputation. J Biopharm Stat. 2013;23(6):1352–71. 10.1080/10543406.2013.834911. [DOI] [PubMed] [Google Scholar]
  • 42.Higgins J TJ, Chandler J, Cumpston M, Li T, Page M, et al. Cochrane Handbook for Systematic Reviews of Interventions version 6.5 (updated August 2024) Cochrane: Cochrane; 2024 [Available from:https://www.cochrane.org/authors/handbooks-and-manuals/handbook] Accessed 14 Aug 2025.
  • 43.Bartlett JW. Reference-based multiple imputation—what is the right variance and how to estimate it. Stat Biopharm Res. 2023;15(1):178–86. [Google Scholar]
  • 44.Carter JD, McIntosh VV, Jordan J, Porter RJ, Douglas K, Frampton CM, et al. Patient predictors of response to cognitive behaviour therapy and schema therapy for depression. Aust N Z J Psychiatry. 2018;52(9):887–97. [DOI] [PubMed] [Google Scholar]
  • 45.Bernstein DP, Fink L. Childhood Trauma Questionnaire: A retrospective self-report manual. San Antonio (TX): The Psychological Corporation, Harcourt Brace & Company; 1998.
  • 46.International Conference on Harmonisation. Topic E9: Statistical principles for clinical trials (CPMP/ICH/363/96) [Internet]. London (UK): European Medicines Agency; 1998. Available from: https://www.ema.europa.eu/en/documents/scientificguideline/ich-e9-statistical-principles-clinical-trials-step-5_en.pdf. Accessed 15 Aug 2025.
  • 47.Jensen MB, Jensen CE, Gudex C, Pedersen KM, Sørensen SS, Ehlers LH. Danish population health measured by the EQ-5D-5L. Scand J Public Health. 2023;51(2):241–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Sahl Andersen J, De Fine Olivarius N, Krasnik A. The Danish National Health Service Register. Scand J Public Health. 2011;39(7_suppl):34–7. [DOI] [PubMed] [Google Scholar]
  • 49.Schmidt M, Johannesdottir SSA, Lynge SJ, Vera E, Lars P, Sørensen HT. The Danish National Patient Registry: a review of content, data quality, and research potential. Clin0 Epidemiol. 2015;7:449–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Lynge E, Sandegaard JL, Rebolj M. The Danish National Patient Register. Scand J Public Health. 2011;39(7 Suppl):30–3. [DOI] [PubMed] [Google Scholar]
  • 51.Kildemoes HW, Sørensen HT, Hallas J. The Danish National Prescription Registry. Scand J Public Health. 2011;39(7 Suppl):38–41. [DOI] [PubMed] [Google Scholar]
  • 52.Hjollund NH, Larsen FB, Andersen JH. Register-based follow-up of social benefits and other transfer payments: accuracy and degree of completeness in a Danish interdepartmental administrative database compared with a population-based survey. Scand J Public Health. 2007;35(5):497–502. [DOI] [PubMed] [Google Scholar]
  • 53.Drummond MF, Sculpher MJ, Claxton K, Stoddart GL, Torrance GW. Methods for the economic evaluation of health care programmes. 4th ed. Oxford: Oxford University Press; 2015.
  • 54.Fenwick E, Byford S. A guide to cost-effectiveness acceptability curves. Br J Psychiatry. 2005;187:106–8. [DOI] [PubMed] [Google Scholar]
  • 55.Bernstein DP, Stein JA, Newcomb MD, Walker E, Pogge D, Ahluvalia T, et al. Development and validation of a brief screening version of the childhood trauma questionnaire. Child Abuse Negl. 2003;27(2):169–90. [DOI] [PubMed] [Google Scholar]
  • 56.Eddington KM, Dozois DJ, Backs-Dermott BJ. Evaluation of the internal consistency, factor structure, and validity of the depression change expectancy scale. Assessment. 2014;21(5):607–17. [DOI] [PubMed] [Google Scholar]
  • 57.Dozois DJ, Westra HA. Development of the Anxiety Change Expectancy Scale (ACES) and validation in college, community, and clinical samples. Behav Res Ther. 2005;43(12):1655–72. [DOI] [PubMed] [Google Scholar]
  • 58.R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2024. [Google Scholar]
  • 59.Gower-Page C, Nosi A, Wolbers M. rbmi: a R package for standard and reference-based multiple imputation methods. J Open Source Softw. 2022;7(74):4251. [Google Scholar]
  • 60.Gower-Page C, Noci A, Gravestock I. rbmi: reference based multiple imputation. R package version 1.3.1. https://insightsengineering.github.io/rbmi/latest-tag/. Accessed 14 Aug 2025.
  • 61.Bartlett JW. bootImpute: bootstrap inference for multiple imputation. R package version 1.2.2. https://cran.rproject.org/web/packages/bootImpute/index.html (2025). Accessed 14 Aug 2025.
  • 62.Burkoff N MP, Bartlett J, Ruau D. _dejaVu: multiple imputation for recurrent events_. R package version 0.3.1,. https://cran.rproject.org/web/packages/dejaVu/dejaVu.pdf. Accessed 14 Aug 2025.
  • 63.Rosseel Y. lavaan: an R package for structural equation modeling. J Stat Softw. 2012;48(2):1–36. [Google Scholar]
  • 64.StataCorp. Stata statistical software: release 18. College Station (TX): StataCorp LLC; 2023.
  • 65.Østergaard SD, Bech P, Miskowiak KW. Fewer study participants needed to demonstrate superior antidepressant efficacy when using the Hamilton melancholia subscale (HAM-D6) as outcome measure. J Affect Disord. 2016;190:842–5. [DOI] [PubMed] [Google Scholar]
  • 66.Juul S, Faltermeier P, Petersen JJ, Olsen MH, Andersen RK, Kamp CB, et al. Missing outcome data in randomised clinical trials of psychological interventions: a review of published trial reports in major psychiatry journals. BMC Psychiatry. 2024;24(1): 798. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Gamble C, Krishan A, Stocken D, Lewis S, Juszczak E, Doré C, et al. Guidelines for the content of statistical analysis plans in clinical trials. JAMA. 2017;318(23):2337–43. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

13063_2025_9012_MOESM1_ESM.docx (26.9KB, docx)

Additional file 1: Statistical Analysis Plan (SAP) Checklist v 1.0 2019

13063_2025_9012_MOESM2_ESM.docx (21KB, docx)

Additional file 2: Table reporting of baseline characteristics

13063_2025_9012_MOESM3_ESM.docx (20.5KB, docx)

Additional file 3: Code for analyses to be used in R, version 4.5.1

Additional file 4.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.


Articles from Trials are provided here courtesy of BMC

RESOURCES