Skip to main content
Wellcome Open Research logoLink to Wellcome Open Research
. 2020 Apr 1;4:69. Originally published 2019 Apr 16. [Version 3] doi: 10.12688/wellcomeopenres.15225.3

What factors indicate prognosis for adults with depression in primary care? A protocol for meta-analyses of individual patient data using the Dep-GP database

Joshua EJ Buckman 1,a, Rob Saunders 1, Zachary D Cohen 2, Katherine Clarke 1, Gareth Ambler 3, Robert J DeRubeis 4, Simon Gilbody 5, Steven D Hollon 6, Tony Kendrick 7, Edward Watkins 8, Ian R White 9, Glyn Lewis 10, Stephen Pilling 1
PMCID: PMC6880263  PMID: 31815189

Version Changes

Updated. Changes from Version 2

We have changed the parameters for our considerations of between study heterogeneity in our meta-analyses, in order to make them more stringent. This has resulted in a change from conducting sensitivity analyses if "heterogeneity between the studies is considered problematic e.g. with I 2 above 80% or where the effect in one study appears to be considerably different from that of all other studies after inspecting the forest plot", to instead conducting sensitivity analyses if "heterogeneity between the studies were considerable based on guidance from the Cochrane Collaboration e.g. with I 2 above 75% or where the effect in one study appears to be considerably different from that of all other studies after inspecting the forest plot ( Higgins et al., 2003), sensitivity analyses will be performed removing studies contributing most to the heterogeneity from the meta-analyses to consider their impact on the summary statistics. If the same variables were found to have considerable amounts of heterogeneity when analysed in each of the four models above, sensitivity analyses would be conducted for the model controlling for the most other variables, e.g. symptom severity and covariates (model 3) only. In addition, for variables in the final model(s), sensitivity analyses were similarly planned where the threshold for substantial heterogeneity was met (I 2 above 50%) ( Higgins et al., 2003)."

Abstract

Background: Pre-treatment severity is a key indicator of prognosis for those with depression. Knowledge is limited on how best to encompass severity of disorders. A number of non-severity related factors such as social support and life events are also indicators of prognosis. It is not clear whether this holds true after adjusting for pre-treatment severity as a) a depressive symptom scale score, and b) a broader construct encompassing symptom severity and related indicators: “disorder severity”. In order to investigate this, data from the individual participants of clinical trials which have measured a breadth of “disorder severity” related factors are needed.

Aims: 1) To assess the association between outcomes for adults seeking treatment for depression and the severity of depression pre-treatment, considered both as i) depressive symptom severity only and ii) “disorder severity” which includes depressive symptom severity and comorbid anxiety, chronicity, history of depression, history of previous treatment, functional impairment and health-related quality of life.

2) To determine whether i) social support, ii) life events, iii) alcohol misuse, and iv) demographic factors (sex, age, ethnicity, marital status, employment status, level of educational attainment, and financial wellbeing) are prognostic indicators of outcomes, independent of baseline “disorder severity” and the type of treatment received.

Methods: Databases were searched for randomised clinical trials (RCTs) that recruited adults seeking treatment for depression from their general practitioners and used the same diagnostic and screening instrument to measure severity at baseline – the Revised Clinical Interview Schedule; outcome measures could differ between studies. Chief investigators of all studies meeting inclusion criteria were contacted and individual patient data (IPD) were requested.

Conclusions: In total 15 RCTs met inclusion criteria. The Dep-GP database will include the 6271 participants from the 13 studies that provided IPD. This protocol outlines how these data will be analysed.

Registration: PROSPERO CRD42019129512 (01/04/2019)

Keywords: Depression, Prognosis, Primary Health Care, Meta-analysis, Individual Patient Data, Systematic Review, Protocol

Introduction

One in 20 adults across the globe will experience an episode of major depression every year ( Thornicroft et al., 2017), most of whom will not receive any treatment ( Olfson et al., 2016; Thornicroft et al., 2017). For those that do get treatment the majority will either not reach remission or it will take a number of trials of different treatments before they do ( Kessler, 2018; Rush et al., 2006). Not reaching full remission is one of the strongest predictors of relapse and recurrence ( Buckman et al., 2018). There is a lack of knowledge of prognosis independent of treatment and within different types of treatment, and therefore a lack of evidence with which to make informed choices of whether any active treatment should be trialled, or which type of treatment to trial at any given point, for any given individual (e.g. Cohen & DeRubeis, 2018).

In order to reduce the burden of depression it is imperative that we understand more about the response to treatments, and their limits, to better consider the risk for poor prognostic outcomes. One major focus has been on the effect of baseline severity on outcomes. That severity is related to outcome holds with the ‘common-sense’ view of most illnesses, depression included, but as recommended by Leucht and colleagues ( Leucht et al., 2015) the consideration of prognosis needs to account for more than just the number or intensity of depressive symptoms. Studies considering the role of pre-treatment depressive severity have typically been limited to group level analyses (e.g. Kirsch et al., 2008) so have been unable to consider severity beyond a score on a depressive symptom measure. In so doing, these studies have been unable to account for the seriousness of the presentation of depression (e.g. Leucht et al., 2012)). Such studies and others that have utilised individual patient data have also typically been limited to a narrow band of treatment types (e.g. Fournier et al., 2010) and to studies with small sample sizes (e.g. Cohen et al., 2019), limiting their generalizability ( Rothwell, 2005).

Several factors which may be considered in conjunction with depressive symptom scale scores as part of “disorder severity”, and could potentially act through the same mechanisms on outcome (e.g. Fried & Nesse, 2014), have also been found to be important in prognostic models. For example: factors related to past experiences of depression, duration or chronicity ( Fournier et al., 2009), a history of depression (e.g. Chekroud et al., 2016), and a history of previous treatments for depression (e.g. DeRubeis et al., 2014); and functional impairment (e.g. Delgadillo et al., 2016; Saunders et al., 2016) are all indicators of prognosis.

There is a lack of agreement on the prognostic role of anxiety symptoms and of comorbid anxiety disorders for those with depression despite agreement that symptoms of anxiety are common among those with depression either as part of their depressive episode or another comorbid disorder (e.g. Kessler et al., 2005; Sartorius et al., 1996). Somatic anxiety and avoidance related symptoms of agoraphobia ( Chekroud et al., 2016) have been found to be prognostic for those treated with antidepressant medications (ADM) but not in those treated with psychological therapies (e.g. Lutz et al., 2006). Symptoms of generalised anxiety disorder and phobias have also been found to be predictive of outcomes in some clinical cohorts ( Saunders et al., 2019; Saunders et al., 2016); but not in others ( Delgadillo et al., 2016). Given the high rates of comorbidity and the co-occurrence of depressive and anxious symptoms even at sub-clinical levels, it would be useful to know whether the prognostic effects of anxiety symptoms and disorders (collectively or individually) operate independently from depressive symptom severity, “disorder severity”, and independent of the type of treatment given, if any. One potential explanation for the somewhat contradictory findings on the role of anxiety symptoms and disorders on the prognosis of patients with depression is that many studies have used different scales to measure the same and indeed different anxiety conditions. Consistency in the measurement of such factors might allow for a more definitive investigation of the prognostic role of such symptoms and disorders.

There is similar disagreement regarding alcohol misuse as an indicator of prognosis, it is also highly comorbid with depression but has been less well studied (e.g. Weaver et al., 2003). Some studies have suggested that alcohol misuse (excluding alcohol dependence) is a prognostic indicator of treatment outcomes for those with depression ( Clarkson et al., 2016), but others have suggested that it is unrelated to treatment outcomes ( Boschloo et al., 2012) and instead is predictive only of dropping out of treatment ( Buckman et al., 2018). There are several other factors that may be related to depression treatment outcomes but again, the effects have been less well studied. These include health-rated quality of life (e.g. Huibers et al., 2014), social support (e.g. Hallgren et al., 2017) and life events recent to the present episode (e.g. DeRubeis et al., 2014; Fournier et al., 2009). This leaves the question then of whether or not these factors are indicative of prognosis independent of baseline severity (whether this encompasses only depressive symptoms (“symptom severity”) or the wider construct of “disorder severity” including other factors noted above), and independently of treatment. In addition, a number of demographic factors have been found to be important in predictive models of depression outcomes alongside symptom severity, including: age (e.g. Delgadillo et al., 2016; Fournier et al., 2009); gender ( Saunders et al., 2016); ethnicity (e.g. Chekroud et al., 2016; Saunders et al., 2016); marital status (e.g. Fournier et al., 2009); employment status (e.g. Chekroud et al., 2016; Fournier et al., 2009); level of educational attainment (e.g. Chekroud et al., 2016); and markers of socio-economic status or financial stability/security (e.g. Saunders et al., 2016). However, whether these factors are indicators of prognosis independent of severity (either as just depressive symptoms or the wider construct encompassing more than just depressive symptoms) remains to be seen. Here, we use the phrase independent of treatment to highlight that we wish to investigate factors that affect outcome regardless of any treatments rather than trying to identify factors that help predict response to a given type of treatment or those that predict differential response to two or more treatments.

Over the last three years a number of authors of the current article have worked to collect individual participant data (IPD) from randomised clinical trials (RCTs) of any treatment for depression, recruited from primary care services/general practice, that used the same clinical interview schedule (the CIS-R) to measure “disorder severity” factors, determine diagnoses, and capture symptoms across a range of depressive and anxious disorders. This article explains how that IPD dataset was formed and describes a protocol for a series of analyses of it.

Aims and objectives

  • 1)

    To determine whether certain “disorder severity” factors are indicators of prognosis, independent of treatment, and independent of baseline depression symptom scale scores. These are i) chronicity of depression at baseline; ii) a history of depression; iii) a history of any previous treatment for depression; iv) a history of ADM treatment; v) anxiety symptom severity; vi) presence of and number of comorbid anxiety disorders; vii) duration of anxiety problems; viii) functional impairment; and ix) health-related quality of life.

  • 2)

    To determine whether or not the following are indicators of prognosis independent of severity of depression as measured in both ways outlined in 1 above - symptom severity, and “disorder severity”:

    • i)

      social support

    • ii)

      the occurrence of recent stressful life events

    • iii)

      alcohol misuse

    • iv)

      demographic factors (age, gender, ethnicity, employment status, marital status, highest level of educational attainment, and financial wellbeing including housing status)

Methods

Identification and selection of studies

Studies were identified using a combination of keyword and subject heading searches on the bibliographic databases below, hand-searching through the references of studies identified in the searches, and by contacting experts for unpublished or missed studies. Searches were run in several stages, firstly to scope the literature in November 2015 and in order to refine inclusion and exclusion criteria, again in April 2016 to identify studies and begin the process of data collection, then finally in March 2019 to ensure no studies published more recently were missed. The final searches were run on the Cochrane CENTRAL Trial Register (searched on 20 th March 2019), Embase 1947 to 2019 Week 12, International Pharmaceutical Abstracts 1970 to March 2019, Ovid MEDLINE 1946 to March Week 3 2019, and PsycINFO 1806 to March Week 3 2019. Search terms included variations of phrases such as “depression” or “major depression”, “RCT” or “Randomised Controlled Trial” or “Clinical Trial”, and “CIS-R” or “Clinical Interview Schedule”. Full details of the searches are provided as Extended data ( Buckman, 2019).

A single reviewer (JB) screened titles and abstracts of potentially eligible studies returned by the searches, those that were potentially relevant to the review were then read in full and judged against inclusion/exclusion criteria. Uncertainties in inclusion/exclusion were discussed with two other reviewers (GL and SP). Relevant studies were then read in full by all three reviewers before reaching consensus.

Inclusion & exclusion criteria

Inclusion and exclusion criteria were refined over the stages of running scoping searches for this work. After this refining process, studies were included if they were randomised clinical trials (RCTs) of adults (aged 16 or over), had at least one active treatment arm, and used the CIS-R at baseline to measure symptoms of anxiety and depression and to determine diagnoses. The study samples had to have unipolar depression, depressive symptoms significant enough to lead them to seek treatment from their GP, or a CIS-R ( Lewis et al., 1992) score of ≥12; recruited from primary care centres. While all studies had to use the CIS-R at baseline, outcome measures could differ between studies.

Studies were excluded if they did not meet the above criteria and if they: included patients with depression as a secondary diagnosis in studies of adults with personality disorders, psychotic conditions, or neurological conditions; were studies of adults with bi-polar or psychotic depressions; were studies of children or adolescents; were feasibility studies only; or did not recruit participants from General Practices or in primary care.

Measures

The relevant measures included in the identified studies are:

The CIS-R ( Lewis et al., 1992): consists of 14 symptom subsections scored 0–4 covering core features of depression, depressive thoughts (scored 0–5), fatigue, concentration/forgetfulness, and sleep, generalized anxiety, worry, irritability, obsessions, compulsions, health anxiety/somatic concerns, phobic anxiety (split into agoraphobia, social phobia, and specific phobia), and panic. A final section measures general health, impairment and weight change. The total score ranges from 0–57 with a cut-off of ≥12 used to indicate likely common mental disorder, primary and secondary diagnoses using ICD-10 criteria are given as are binary indictors of diagnosis for all the disorders assessed.

Beck Depression Inventory (BDI-II: Beck et al., 1996)): used to measure depressive symptoms, each item is scored 0–3 with a maximum score obtainable of 63. A cut-off of ≥10 is used indicate significant symptoms of depression.

Patient Health Questionnaire (PHQ-9: Kroenke et al., 2001) is a 9-item depression screening measure. Items are scored 0–3, a cut-off of ≥10 is used to indicate “caseness” for depression.

Hospital Anxiety and Depression Scale (HADS: Zigmond & Snaith, 1983): measures symptoms on two subscales, depression and anxiety. The cut-off for caseness on the depression subscale is ≥8.

General Health Questionnaire (12-item version) (GHQ-12: Goldberg, 1992): a cut-off of ≥2 is used to indicate the presence of common mental disorders.

Edinburgh Postnatal Depression Scale (EPDS: Cox et al., 1987)): measures symptoms of depression focussed on women in the post-natal period, scores of ≥13 are indicative of a depressive episode

Social Support: an 8-item instrument assessing the degree to which participants rated the social support of their friends and family in each of the following domains: 1) being accepted for who one is; 2) feeling cared about; 3) feeling loved; 4) feeling important to them; 5) being able to rely on them; 6) feeling well supported and encouraged by them; 7) being made to feel happy by them; and 8) feeling able to talk to them whenever one might like. These were adapted by authors of RCTs (e.g. Kessler et al., 2009) included in this IPD from items of the Medical Outcomes Study Social Support Survey: ( Sherboune & Stewart, 1991). Items are scored 1–3, with total scores ranging from 8–24; higher scores indicate higher levels of perceived social support.

Life events: the Social Readjustment Rating Scale ( Holmes & Rahe, 1967): participants are asked to say yes/no to whether they have suffered any of nine events within the last six months e.g. a death/bereavement; being physically attacked/injured; or going through a divorce/separation. Each item is scored yes (1) or no (0) and the total score is the sum of all the items.

Alcohol use: the alcohol use disorder identification test primary care version (AUDIT-PC: Piccinelli et al., 1997) was used to assess alcohol misuse, this includes five items scored 0–4. A cut-off of ≥5 indicates hazardous alcohol use that may be harmful to one’s health.

Health related quality of life: EQ-5D-3L & EQ-5D-5L: ( Herdman et al., 2011)): the EQ-5D is a generic measure of health status in five domains – mobility; self-care; usual activities; pain/discomfort; and anxiety/depression. Each domain in the 3L version has three response categories ranging from no problem present (1) to extreme problems in the given domain (3), the 5L version has five response options ranging from “I have no problems…” (1) to “I am unable to…” or “I have/am extreme/extremely…” (5). A total score is derived from summing the score on the five items with higher scores indicating more severe health problems than lower scores. A cross-walk of scores from the 3L and 5L versions will be used to derive a continuous index score representing the EQ-5D total score in the present study ( van Hout et al., 2012).

Characteristics of the included studies

In total, 15 RCTs were identified as meeting inclusion criteria for the IPD, of which 12 have provided individual patient-data and one is in the process of providing these data, see Figure 1. Nine studies were identified at the initial scoping stage, with a further three found in the second stage of searching and one final further study identified as meeting inclusion criteria after the final searches. After a final consensus meeting once the final searches had been run, two studies previously considered to not meet inclusion criteria were re-classified as meeting criteria, and the study authors were contacted for IPD accordingly. A description of each included study for which authors agreed to provide IPD can be found in Table 1 and descriptive statistics and degrees of missingness for key predictor and outcome variables discussed below are presented in the Extended data ( Buckman, 2019). Integrity of the data for each study was checked with the study team and against details published about each study, discrepancies were discussed and investigated in conjunction with each study team until satisfactory explanations were found and updated data were provided if appropriate and if required.

Figure 1. Flow diagram of study selection.

Figure 1.

Table 1. Description of studies included in individual patient data (IPD).

Study N at T0 Sample Interventions Outcome Measure Endpoint at
3–4 months
N at 3–4
months
AHEAD ( Kendrick et al., 2006) 327 Adults with new depressive episodes diagnosed by GP TCA vs SSRI vs
Lofepramine
HADS (also CIS-R) Yes 169
CADET ( Richards et al., 2013) 527 Adults ≥18, ICD-10 Depressive Episode Collaborative Care vs
TAU
PHQ-9 Yes 505
COBALT ( Thomas et al., 2012) 469 Adults 18–75 with treatment resistant depression,
scoring ≥14 BDI-II
TAU vs CBT+TAU BDI-II (also PHQ-9) Yes 469
CPN-GP ( Kendrick et al., 2005;
Kendrick et al., 2006)
247 Adults 18–65 seeking treatment from GP for new
episode of depression, anxiety or reaction to life
difficulties, scoring >3 on GHQ-12 with between
4weeks and 6months duration
TAU vs CPN vs CPN +
Problem solving
CIS-R (also HADS) No 210
GENPOD ( Lewis et al., 2011) 601 Adults 18–74 with depressive episode Citalopram vs
Reboxetine
BDI-II (also HADS) Yes 601
HEALTHLINES ( Salisbury et al., 2016) 609 Adults ≥18, PHQ-9 score ≥10, confirmed diagnosis of
depression with CIS-R, internet access
TAU vs Healthlines
telecare + TAU
PHQ-9 Yes 525
IPCRESS ( Kessler et al., 2009) 295 Adults scoring ≥14 BDI-II and GP confirmed diagnosis
of depression
iCBT vs TAU BDI-II Yes 299
ITAS ( Thomas et al., 2004) 798 Adults ≥16, scored ≥12 on CIS-R TAU vs
Recommendation + TAU
GHQ-12 No 617
MIR ( Kessler et al., 2018) 480 Adults ≥18 taking SSRIs or SNRIs at adequate dose
for≥ 6 weeks, and scored ≥14 on BDI-II
Mirtazapine vs Placebo BDI-II (also PHQ-9) Yes 424
PANDA ( Salaminios et al., 2017) 652 Adults presenting with low mood or depression to GP in
last 2 years, free of ADM for 8 weeks up to baseline
Sertraline vs Placebo PHQ-9 (also BDI-II) Yes 653
REEACT ( Gilbody et al., 2015) 685 Adults with PHQ-9>10 presenting to GP with depression Moodgym vs Beating
the Blues vs TAU
PHQ-9 Yes 526
RESPOND ( Sharp et al., 2010) 220 Women meeting criteria for MDD within 6-months post-
partum
ADM vs Listening
intervention
EPDS Yes 218
TREAD ( Chalder et al., 2012) 361 Adults 18–69 who met diagnostic criteria for MDD and
scored ≥14 on BDI-II
TAU vs Physical Activity
+ TAU
BDI-II Yes 290

Abbreviations: ADM – antidepressant medication; BDI-II – Beck Depression Inventory; EPDS – Edinburgh Postnatal Depression Scale; GHQ-12 – General Health Questionnaire 12 item version; HADS-D – Hospital Anxiety and Depression Scale – depression subscale; iCBT (internet based therapist delivered cognitive behavioural therapy); MDD – Major Depressive Disorder; T0 - Baseline; TAU – treatment as usual; TCA – tricyclic antidepressant

Ethical considerations and trial registrations

All studies included in the Dep-GP database were granted ethical approvals by NHS Research Ethics Committees. Specific ethical approvals and trial registration details are given in Extended data ( Buckman, 2019).

Data analysis plan

End-point data. Of the included studies, 11 collected endpoint data between three and four months post-baseline (see Table 1), this will be the primary endpoint of interest for the analyses outlined in this protocol Additional end-points between six and eight months, and nine and 12 months post-baseline will be used for sensitivity analyses (see Table 2). Endpoints prior to three months or after 12 months will be excluded from the present analyses.

Table 2. Endpoints and time from baseline in weeks in each study in the Dep-GP database.

Endpoint and time from
baseline in weeks (w)
3–4m 6–8m 9–12m
Study 12–18w 24–32w 36–52w
AHEAD 12w 26w 52w
CADET 16w 52w
COBALT 12w 26w 36w
CPN-GP 26w
GENPOD 12w
HEALTHLINES 16w 32w 52w
IPCRESS 16w 32w
ITAS 26w
MIR 12w 24w 52w
PANDA 12w
REEACT 16w 52w
RESPOND 18w 44w
TREAD 16w 32w 52w

Outcomes

Primary outcomes. The primary outcome for the present analyses will be the score on the primary depressive symptom measure used at 3–4 months post-baseline. Scores on the different measures of depressive symptoms used across the studies at the endpoints will be z-score standardised. This will be done for each symptom measure using the mean and standard deviation (SD) at 3–4 months pooled across all arms of all studies that reported that symptom measure at that time. The same mean and SD will be used to create z-scores for secondary outcomes at 6–8 and 9–12 months post-baseline.

A second primary outcome will be the log of 3–4 month post-baseline depression scale scores, without standardising across the measures. This will allow for the consideration of proportional change in symptom scores (e.g. Button et al., 2015)

Secondary outcomes. In any analysis where the only outcome measures used in the studies of the Dep-GP database were the BDI-II or the PHQ-9 a secondary outcome will be a conversion of those two measure scores to the PROMIS T-score ( Choi et al., 2014). This will be achieved using cross-walk tables derived from an item-response theory based analysis of several depression symptom measures ( Choi et al., 2014).

Additional secondary outcomes will be partial remission on each of the primary outcome measures used in each study (scores below the cut-off for caseness on each measure as described in Measures section above), and the proportion of participants that dropped-out/withdrew from each study at each time-point.

Prognostic indicators under consideration

  • 1.

    “Disorder severity” of depression at baseline, from self-reported:

    • scores on the depressive symptom measures detailed above

    • the sum of the scores on the depressive sub-scales of the CIS-R

    • the sum of the scores on the non-depressive/anxiety sub-scales of the CIS-R combined, and individually by subscale

    • the number and type of comorbid anxiety disorders

    • the duration of depression

    • the duration of anxiety

    • whether or not participants have a history of depression

    • whether or not participants have a history of previous treatment for depression, and whether or not participants have a history of ADM treatment

    • whether or not participants were experiencing significant functional impairment at baseline

    • Health-rated quality of life at baseline.

  • 2.

    Demographic factors

    • Age

    • Gender

    • Ethnicity

    • employment status

    • marital status

    • highest level of educational attainment

    • financial wellbeing

    • housing tenure

  • 3.

    Social support in all eight domains listed in the Measures section above and the total score on the measure.

  • 4.

    Life events in all nine domains as discussed in the Measures above and the total score.

  • 5.

    Total score on the alcohol measure

Confounding factors

Different confounding factors will be considered in relation to each prognostic factor under investigation. Determinations of which factors to include in the meta-analytic models as confounders will be made based on a priori considerations of the relationship under investigation and the relationships between the confounder and both the prognostic indicator and outcome. Only factors that are independently associated with both the prognostic factor and the outcome, are not potentially caused by the prognostic factor, and affect the association between the prognostic factor and outcome will be considered as potential confounders. For example, age is a priori assumed to confound the relationship between duration of depression and outcome at 3-to-4 months. The presence of any long-term physical health condition might be considered a confounder in the relationship between health-related quality of life and outcome. In addition, research site or centre, and the clinical and demographic factors listed above in the prognostic indicator section (for analyses in which they are not the predictor of interest) will all be investigated as potential confounders. The variables used to stratify the randomisation beyond site and initial depressive symptom severity will be investigated as potential confounders within each study. Treatment allocation, i.e. the randomisation in each study will be controlled for in all multivariable models.

Data handling and data management

Pre-processing

Data from the 12 trials were received and cleaned on an individual study basis before combining all studies into a single aggregated dataset, the final Dep-GP dataset will be formed once data from the 13 th study are received and cleaned.

A number of baseline variables were re-categorised into higher-order categories due to small numbers, see Table 3. Of note, there was poorer data-coverage across the IPD on information about the number of past depressive episodes than there was on a separate question about whether or not the participant had any previous episodes, see Extended data ( Buckman, 2019).

Table 3. Categorisation of variables during data pre-processing.

Variable Original categories New categories
Ethnicity White White
Mixed Other
Black
Asian
Chinese
Other
Employment Status Full time employed Employed
Part time employed
Student Not seeking employment
Retired
House-person
Other
Unemployed jobseeker Unemployed
Unemployed due to ill-health
Marital Status Married/cohabiting Married/cohabiting
Single Single
Separated No longer married
Divorced
Widowed
Highest level of education Degree or higher Degree or higher
Foundation Degree/Diploma A-level or Diplomas
A-level
GCSE GCSE
Other qualifications None or Other
No formal qualifications
Financial Wellbeing Living Comfortably OK financially
Doing alright
Just about getting by Just about getting by
Hard to make ends meet Struggling financially
Very hard to make ends meet
Long-term Health
Condition Status
None No long-term physical
health conditions
Mental Health Only
Diabetes At least one long-term
physical health condition
Asthma or COPD
Arthritis
Heart Disease
Stroke
Cancer
Kidney Disease

Further pre-processing for the analyses specified below will be considered. The distributions of all variables will be inspected prior to imputation (discussed further below). Continuous variables that are non-normally distributed will be transformed to normality prior to imputation. If transformation is required of the prognostic indicators these will only be log transformed in order that the interpretation of their effects is sensible. If log-transformation does not result in approximate normality of the distribution of these variables, predictive mean matching ( Morris et al., 2014) will be used for imputation of missing data as part of the multiple imputation with chained equations approach discussed further below.

Missing data

Missing data will be imputed using multiple imputation with chained equations (MICE) in Stata 15.0 ( StataCorp, 2017). This approach uses regression models to impute missing values. A number of imputed datasets (here we will use 50) are produced to reflect the uncertainty/variability in the imputation process. If data are not reasonably able to be log transformed to meet normality assumptions, predictive mean matching (PMM) via a k-nearest neighbours approach will be used as it is considered to be more appropriate for non-normal continuous variables ( Horton & Lipsitz, 2001), here we will use k=10. Linear regression will be used for approximately normally distributed continuous variables, logistic regression models will be used for binary variables, and ordinal and multinomial regression models will be used for ordered and unordered categorical variables respectively. All imputation models will be built using data on baseline and outcome variables following conventions (e.g. Royston & White, 2011). Only variables with less than 50% missing data will imputed (see Extended data for degrees of missing by variable ( Buckman, 2019)). All imputation models will be run to produce 50 imputed datasets. If the primary analysis (detailed below) shows that results differ considerably when studies with systematically missing baseline data are included/excluded from the meta-analytic models, then a separate imputation approach will be taken to impute these systematically missing data: multiple imputation with multilevel random effects for study (e.g. Resche-Rigon et al., 2013).

Software & packages

Stata SE 15 ( StataCorp, 2017): ipdmetan ( Fisher, 2015) and ICE ( Royston, 2009), mi impute pmm ( Morris et al., 2014) packages.

Primary analyses

To investigate Aim 1 linear regression models of the score on the depressive symptom scales at 3–4 months post-baseline will be built in each study, adjusting for the random allocation in each study, baseline depression scale scores in each study, and then separately for other “disorder severity” related factors listed above. Estimates from each study will then be pooled in random effects meta-analyses. A multivariable model of outcome will be built considering all of the “disorder severity” factors that are significantly associated with outcome after adjusting for baseline depressive symptom scale scores alone. This will be done initially with only variables that are not systematically missing between the studies, such models will be built firstly on all studies and then on all studies that do not have systematically missing covariates that could otherwise have been included in the multivariable model. These models will be compared and if there is a considerable difference in the effects systematically variables will then be imputed as described above. Decisions on which factors to include/exclude in the multivariable models will be led by consideration of the unique contribution to the models by each variable, the amount of variance explained (R 2) when modelled with and without the given factor, and to tests of the assumptions of linear regression models. If there are high degrees of multicollinearity the variable(s) explaining most variance in outcome will be retained in the model while the other(s) is/are removed. Link tests will be performed to consider the appropriateness of the linear link function. Multivariate normality, homoscedasticity, and overly influential data points will be considered by plotting residuals, and assessing Cook’s distance in the residuals plotted against leverage.

Aim 2 - Separate meta-analyses will be conducted with each of the prognostic indicators under consideration (social support, life events, alcohol misuse, and the demographic factors outlined in the Introduction above), unadjusted and adjusted for severity (symptom severity and “disorder severity”) to determine whether or not they are indicative of outcome of treatment independently from either or both of symptom severity and “disorder severity”.

There will therefore be three models of the primary outcome built for each prognostic factor assessed and an additional model just for the confounding factors and the baseline depressive symptom scale scores:

  • 1.

    Baseline depressive symptom scale score adjusted for confounding factors.

  • 2.

    As in 1 but with the addition of each “disorder severity” factor (one by one).

  • 3.

    As in 1 with the addition of all “disorder severity” factors that were significant or otherwise important in 2, and then removing factors that are no longer significant.

  • 4.

    As in 3 with the addition of the other potential prognostic factors (e.g. social support) (one by one).

Meta-analyses will be conducted using the “ipdmetan” package in Stata ( Fisher, 2015) and displayed using forest plots. All meta-analyses will be conducted using a DerSimonian and Laird random effects model. This takes into account heterogeneity of coefficients between trials. The degree of heterogeneity will be assessed using prediction intervals and its impact will be assessed using the I 2 statistic ( Higgins et al., 2003).

Secondary and sensitivity analyses

If heterogeneity between the studies were considerable based on guidance from the Cochrane Collaboration e.g. with I 2 above 75% or where the effect in one study appears to be considerably different from that of all other studies after inspecting the forest plot ( Higgins et al., 2003), sensitivity analyses will be performed removing studies contributing most to the heterogeneity from the meta-analyses to consider their impact on the summary statistics. If the same variables were found to have considerable amounts of heterogeneity when analysed in each of the four models above, sensitivity analyses would be conducted for the model controlling for the most other variables, e.g. symptom severity and covariates (model 3) only. In addition, for variables in the final model(s), sensitivity analyses were similarly planned where the threshold for substantial heterogeneity was met (I 2 above 50%) ( Higgins et al., 2003). Additional investigations of potential heterogeneity between the studies will involve assessing effects in Aim 1 in subgroups of patients including those with treatment resistant depression compared to those with a first episode and those with no history of treatment. Further sensitivity analyses will be conducted using the endpoint at 6-to-8 months in bivariate meta-analyses in order to include the two studies that did not have an endpoint in the 3-to-4 month post-baseline time period. This will initially be done only to assess the prognostic indication of baseline depressive scale scores adjusted for the confounding factors specified. If it is found that this leads to considerable variation in the results then this method will be similarly used in the analyses of the other potential prognostic factors.

In addition to considering the associations between social support and outcome as modelled with the total score on the social support scale, analyses will be conducted with each of the eight domains measured on that scale. Likewise, each of the nine domains measured in the life events scale will be considered individually.

Sample size and power

The sample size for each of the proposed analyses will be dependent on the number of studies identified as relevant to that analysis and the degree of systematically missing data across the studies on the variables of interest. However, in Dep-GP there will be sufficient power to detect effects in all of outlined analyses as sample sizes will be beyond the minimum required to detect such effects. For example, for 80% power to detect an effect of depressive symptom severity of the same size found in a prior analyses (R 2 of 0.09: Delgadillo et al., 2017) with alpha set at 0.05, the minimum required sample size is 161 participants. It would be 105 participants to detect a similar effect for that found for employment status (R 2 of 0.137). In Dep-GP there are data on 4679 participants at 3–4 months post-baseline or 5226 once missing outcome data have been imputed. These sample sizes would give 80% power to detect effects where R 2 is greater than or equal to 0.0033 or 0.0029 respectively, with alpha set a 0.05.

Risk of bias

Risks of bias assessments will be conducted using the Quality in Prognosis Studies (QUIPS) ( Hayden et al., 2013). Two reviewers (JB & RS) will independently rate the risk of bias on the QUIPS in each study related to : i) study participation; ii) study attrition; iii) prognostic factor measurement; iv) outcome measurement; v) study confounding; and vi) statistical analysis and reporting. Studies well then be given a rating of “high risk”, “moderate risk” or “low risk”. The quality of evidence for each prognostic indicator will be assessed using the Grading Recommendations, Assessment, Development and Evaluations (GRADE) framework ( Guyatt et al., 2008).

Discussion and conclusions

Knowledge of prognosis for those seeking treatment for depression after accounting for baseline severity has been limited to the consideration of severity only as a depressive symptom scale score, but many other related factors including the chronicity of depression and comorbid symptoms of anxiety have been found to be important prognostic indicators. In addition, a number of factors have been reported to be indicators of prognosis for depressed patients, but whether this is true after adjusting for severity encompassed in a scale score (symptom severity) or a more broad range of related factors (which here we call “disorder severity”) remains to be seen. In order to investigate this, data from the individual participants of a wide range of clinical trials which have measured this breadth of severity related factors is needed.

We found 15 studies that do this and met inclusion criteria, 12 have given IPD data and one is in the process of providing IPD data to help form the Dep-GP database, data from the remaining two studies were no longer available as these were conducted approximately 20 years ago. We will use differing subsets of the 13 studies to meet our aims as necessary where data on key variables are available. The consistency in setting and the variability in both the populations drawn upon in the 13 studies and the treatments received in those studies means that findings from the Dep-GP database may be generalizable to adults with depression seeking treatment from their GP/family physician.

Study status

This is protocol version 1.3 last amended, 24 th October 2019. Data collection from eligible studies started in April 2016, agreement for data sharing from the final eligible study able to provide IPD was provided on 12 th March 2019 in principle and in full on 16 th September 2019, complete data for that study have not been provided yet. Estimated time to complete the outlined analyses is six months from the point at which we receive the final study dataset or if there is any reason that it cannot be provided, six months from the time we are notified of that eventuality.

Dissemination

Findings from the analyses outlined above will be disseminated through peer-reviewed publications, and academic conference proceedings, through online blogs and other grey-literature and to appropriate service user research advisory groups linked to the host organisation.

Data availability

Underlying data

No data are associated with this article

Extended data

Open Science Framework: What factors indicate prognosis for adults with depression in primary care? https://doi.org/10.17605/OSF.IO/UX95Q ( Buckman, 2019)

This project contains the following extended data:

  • details of missing data across dep-gp studies.docx (Missing data from included studies)

  • Ethics approval and trial registration details for dep-gp studies.docx (Ethics approval and trial registration details of included studies)

  • Search results_OSF.docx (Search terms and results of searches)

Reporting guidelines

Open Science Framework: PRISMA-P checklist ( Shamseer et al., 2015) for “What factors indicate prognosis for adults with depression in primary care? A protocol for meta-analyses of individual patient data using the Dep-GP database”. https://doi.org/10.17605/OSF.IO/UX95Q ( Buckman, 2019)

Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication).

Funding Statement

This work was supported by the Wellcome Trust through a Clinical Research Fellowship to JEJB [201292]. SP, KC and RS were all funded by University College London through the Research Department of Clinical, Educational and Health Psychology and respectively funded by the Higher Education Funding Council for England, NIHR, and the Royal College of Psychiatrists. SP and GL, and GA were supported by the National Institute for Health Research University College London Hospitals Biomedical Research Centre. ZC was supported by MQ Foundation [MQDS16/72]. RJDeR was on the faculty in the Department of Psychology at the University of Pennsylvania. SH was on the faculty in the Department of Psychology at Vanderbilt University in Nashville Tennessee USA. IW was supported by the Medical Research Council Programme [MC_UU_12023/21]. TK was funded through the University of Southampton by the Higher Education Funding Council for England. EW was funded through the University of Exeter by the Higher Education Funding Council for England. SG was funded by grants from the National Institute for Health Research.

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 3; peer review: 2 approved]

References

  1. Beck AT, Steer RA, Brown GK: Beck depression inventory-II. San Antonio.1996;78(2):490–498. Reference Source [Google Scholar]
  2. Boschloo L, Vogelzangs N, van den Brink W, et al. : Alcohol use disorders and the course of depressive and anxiety disorders. Br J Psychiatry. 2012;200(6):476–484. 10.1192/bjp.bp.111.097550 [DOI] [PubMed] [Google Scholar]
  3. Buckman JEJ: What factors indicate prognosis for adults with depression in primary care?2019. 10.17605/OSF.IO/UX95Q [DOI] [PMC free article] [PubMed]
  4. Buckman JEJ, Naismith I, Saunders R, et al. : The Impact of Alcohol Use on Drop-out and Psychological Treatment Outcomes in Improving Access to Psychological Therapies Services: an Audit. Behav Cogn Psychother. 2018;46(5):513–527. 10.1017/S1352465817000819 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Buckman JEJ, Underwood A, Clarke K, et al. : Risk factors for relapse and recurrence of depression in adults and how they operate: A four-phase systematic review and meta-synthesis. Clin Psychol Rev. 2018;64(7):13–38. 10.1016/j.cpr.2018.07.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Button KS, Kounali D, Thomas L, et al. : Minimal clinically important difference on the Beck Depression Inventory--II according to the patient’s perspective. Psychol Med. 2015;45(15):3269–3279. 10.1017/S0033291715001270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chalder M, Wiles NJ, Campbell J, et al. : Facilitated physical activity as a treatment for depressed adults: randomised controlled trial. BMJ. 2012;344: e2758. 10.1136/bmj.e2758 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chekroud AM, Zotti RJ, Shehzad Z, et al. : Cross-trial prediction of treatment outcome in depression: a machine learning approach. Lancet Psychiatry. 2016;3(3):243–250. 10.1016/S2215-0366(15)00471-X [DOI] [PubMed] [Google Scholar]
  9. Choi SW, Schalet BD, Cook KF, et al. : Establishing a common metric for depressive symptoms: linking the BDI-II, CES-D, and PHQ-9 to PROMIS depression. Psychol Assess. 2014;26(2):513–527. 10.1037/a0035768 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Clarkson P, Giebel CM, Challis D, et al. : Outcomes from a pilot psychological therapies service for UK military veterans. Nurs Open. 2016;3(4):227–235. 10.1002/nop2.57 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cohen ZD, DeRubeis RJ: Treatment Selection in Depression. Annu Rev Clin Psychol. 2018;14:209–236. 10.1146/annurev-clinpsy-050817-084746 [DOI] [PubMed] [Google Scholar]
  12. Cohen ZD, Kim TT, Van HL, et al. : A demonstration of a multi-method variable selection approach for treatment selection: Recommending cognitive-behavioral versus psychodynamic therapy for mild to moderate adult depression. Psychother Res. 2019;1–14. 10.1080/10503307.2018.1563312 [DOI] [PubMed] [Google Scholar]
  13. Cox JL, Holden JM, Sagovsky R: Detection of postnatal depression. Development of the 10-item Edinburgh Postnatal Depression Scale. Br J Psychiatry. 1987;150(6):782–786. 10.1192/bjp.150.6.782 [DOI] [PubMed] [Google Scholar]
  14. Delgadillo J, Huey D, Bennett H, et al. : Case complexity as a guide for psychological treatment selection. J Consult Clin Psychol. 2017;85(9):835–853. 10.1037/ccp0000231 [DOI] [PubMed] [Google Scholar]
  15. Delgadillo J, Moreea O, Lutz W: Different people respond differently to therapy: A demonstration using patient profiling and risk stratification. Behav Res Ther. 2016;79:15–22. 10.1016/j.brat.2016.02.003 [DOI] [PubMed] [Google Scholar]
  16. DeRubeis RJ, Cohen ZD, Forand NR, et al. : The Personalized Advantage Index: translating research on prediction into individualized treatment recommendations. A demonstration. PLoS One. 2014;9(1):e83875. 10.1371/journal.pone.0083875 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fisher DJ: Two-stage individual participant data meta-analysis and generalized forest plots. Stata J. 2015;15(2):369–396. 10.1177/1536867X1501500203 [DOI] [Google Scholar]
  18. Fournier JC, Derubeis RJ, Hollon SD, et al. : Antidepressant drug effects and depression severity: a patient-level meta-analysis. JAMA. 2010;303(1):47–53. 10.1001/jama.2009.1943 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fournier JC, DeRubeis RJ, Shelton RC, et al. : Prediction of response to medication and cognitive therapy in the treatment of moderate to severe depression. J Consult Clin Psychol. 2009;77(4):775–787. 10.1037/a0015401 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Fried EI, Nesse RM: The impact of individual depressive symptoms on impairment of psychosocial functioning. PLoS One. 2014;9(2):e90311. 10.1371/journal.pone.0090311 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gilbody S, Littlewood E, Hewitt C, et al. : Computerised cognitive behaviour therapy (cCBT) as treatment for depression in primary care (REEACT trial): large scale pragmatic randomised controlled trial. BMJ. 2015;351:h5627. 10.1136/bmj.h5627 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Goldberg D: General Health Questionnaire (GHQ-12) NFER. Nelson Publishing Co. Ltd Windsor, UK.1992. [Google Scholar]
  23. Guyatt GH, Oxman AD, Vist GE, et al. : GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ. 2008;336(7650):924–926. 10.1136/bmj.39489.470347.AD [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hallgren M, Lundin A, Tee FY, et al. : Somebody to lean on: Social relationships predict post-treatment depression severity in adults. Psychiatry Res. 2017;249:261–267. 10.1016/j.psychres.2016.12.060 [DOI] [PubMed] [Google Scholar]
  25. Hayden JA, van der Windt DA, Cartwright JL, et al. : Assessing bias in studies of prognostic factors. Ann Intern Med. 2013;158(4):280–286. 10.7326/0003-4819-158-4-201302190-00009 [DOI] [PubMed] [Google Scholar]
  26. Herdman M, Gudex C, Lloyd A, et al. : Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res. 2011;20(10):1727–1736. 10.1007/s11136-011-9903-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Higgins JP, Thompson SG, Deeks JJ, et al. : Measuring inconsistency in meta-analyses. BMJ. 2003;327(7414):557–560. 10.1136/bmj.327.7414.557 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Holmes TH, Rahe RH: The Social Readjustment Rating Scale. J Psychosom Res. 1967;11(2):213–218. 10.1016/0022-3999(67)90010-4 [DOI] [PubMed] [Google Scholar]
  29. Horton NJ, Lipsitz SR: Multiple Imputation in Practice: Comparison of Software Packages for Regression Models With Missing Variables. Am Stat. 2001;55(3):244–254. 10.1198/000313001317098266 [DOI] [Google Scholar]
  30. Huibers MJH, Van Breukelen G, Roelofs J, et al. : Predicting response to cognitive therapy and interpersonal therapy, with or without antidepressant medication, for major depression: A pragmatic trial in routine practice. J Affect Disord. 2014;152–154:146–154. 10.1016/j.jad.2013.08.027 [DOI] [PubMed] [Google Scholar]
  31. Kendrick T, Peveler R, Longworth L, et al. : Cost-effectiveness and cost-utility of tricyclic antidepressants, selective serotonin reuptake inhibitors and lofepramine: randomised controlled trial. Br J Psychiatry. 2006;188(4):337–345. 10.1192/bjp.188.4.337 [DOI] [PubMed] [Google Scholar]
  32. Kendrick T, Simons L, Mynors-Wallis L, et al. : A trial of problem-solving by community mental health nurses for anxiety, depression and life difficulties among general practice patients. The CPN-GP study. Health Technol Assess. 2005;9(37):1–104, iii. 10.3310/hta9370 [DOI] [PubMed] [Google Scholar]
  33. Kessler D, Lewis G, Kaur S, et al. : Therapist-delivered internet psychotherapy for depression in primary care: a randomised controlled trial. Lancet. 2009;374(9690):628–634. 10.1016/S0140-6736(09)61257-5 [DOI] [PubMed] [Google Scholar]
  34. Kessler DS, Macneill SJ, Tallon D, et al. : Mirtazapine added to SSRIs or SNRIs for treatment resistant depression in primary care: phase III randomised placebo controlled trial (MIR). BMJ. 2018;363:k4218. 10.1136/bmj.k4218 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kessler RC: The potential of predictive analytics to provide clinical decision support in depression treatment planning. Curr Opin Psychiatry. 2018;31(1):32–39. 10.1097/YCO.0000000000000377 [DOI] [PubMed] [Google Scholar]
  36. Kessler RC, Chiu WT, Demler O, et al. : Prevalence, Severity, and Comorbidity of 12-Month DSM-IV Disorders in the National Comorbidity Survey Replication. Arch Gen Psychiatry. 2005;62(6):617–627. 10.1001/archpsyc.62.6.617 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kirsch I, Deacon BJ, Huedo-Medina TB, et al. : Initial severity and antidepressant benefits: a meta-analysis of data submitted to the Food and Drug Administration. PLoS Med. 2008;5(2):260–268, e45. 10.1371/journal.pmed.0050045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Kroenke K, Spitzer RL, Williams JB: The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. 2001;16(9):606–613. 10.1046/j.1525-1497.2001.016009606.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Leucht S, Helfer B, Gartlehner G, et al. : How effective are common medications: a perspective based on meta-analyses of major drugs. BMC Med. 2015;13(1):253–258. 10.1186/s12916-015-0494-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Leucht S, Hierl S, Kissling W, et al. : Putting the efficacy of psychiatric and general medicine medication into perspective: review of meta-analyses. Br J Psychiatry. 2012;200(2):97–106. 10.1192/bjp.bp.111.096594 [DOI] [PubMed] [Google Scholar]
  41. Lewis G, Mulligan J, Wiles N, et al. : Polymorphism of the 5-HT transporter and response to antidepressants: randomised controlled trial. Br J Psychiatry. 2011;198(6):464–471. 10.1192/bjp.bp.110.082727 [DOI] [PubMed] [Google Scholar]
  42. Lewis G, Pelosi AJ, Araya R, et al. : Measuring psychiatric disorder in the community: a standardized assessment for use by lay interviewers. Psychol Med. 1992;22(2):465–486. 10.1017/S0033291700030415 [DOI] [PubMed] [Google Scholar]
  43. Lutz W, Saunders SM, Leon SC, et al. : Empirically and Clinically Useful Decision Making in Psychotherapy: Differential Predictions With Treatment Response Models. Psychol Assess. 2006;18(2):133–141. 10.1037/1040-3590.18.2.133 [DOI] [PubMed] [Google Scholar]
  44. Morris TP, White IR, Royston P: Tuning multiple imputation by predictive mean matching and local residual draws. BMC Med Res Methodol. 2014;14:75–88. 10.1186/1471-2288-14-75 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Olfson M, Blanco C, Marcus SC: Treatment of adult depression in the United States. JAMA Intern Med. 2016;176(10):1482–1491. 10.1001/jamainternmed.2016.5057 [DOI] [PubMed] [Google Scholar]
  46. Piccinelli M, Tessari E, Bortolomasi M, et al. : Efficacy of the alcohol use disorders identification test as a screening tool for hazardous alcohol intake and related disorders in primary care: a validity study. BMJ. 1997;314(7078):420–4. 10.1136/bmj.314.7078.420 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Resche-Rigon M, White IR, Bartlett JW, et al. : Multiple imputation for handling systematically missing confounders in meta-analysis of individual participant data. Stat Med. 2013;32(28):4890–4905. 10.1002/sim.5894 [DOI] [PubMed] [Google Scholar]
  48. Richards DA, Hill JJ, Gask L, et al. : Clinical effectiveness of collaborative care for depression in UK primary care (CADET): cluster randomised controlled trial. BMJ. 2013;347:f4913. 10.1136/bmj.f4913 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Rothwell PM: Treating individuals 2. Subgroup analysis in randomised controlled trials: importance, indications, and interpretation. Lancet. 2005;365(9454):176–186. 10.1016/S0140-6736(05)17709-5 [DOI] [PubMed] [Google Scholar]
  50. Royston P: Multiple imputation of missing values: Further update of ice, with an emphasis on categorical variables. Stata J. 2009;9(3):466–477. 10.1177/1536867X0900900308 [DOI] [Google Scholar]
  51. Royston P, White IR: Multiple Imputation by Chained Equations (MICE): Implementation in Stata. J Stat Softw. 2011;45(4):1–20. 10.18637/jss.v045.i04 [DOI] [Google Scholar]
  52. Rush AJ, Kraemer HC, Sackeim HA, et al. : Report by the ACNP Task Force on response and remission in major depressive disorder. Neuropsychopharmacology. 2006;31(9):1841–1853. 10.1038/sj.npp.1301131 [DOI] [PubMed] [Google Scholar]
  53. Salaminios G, Duffy L, Ades A, et al. : A randomised controlled trial assessing the severity and duration of depressive symptoms associated with a clinically significant response to sertraline versus placebo, in people presenting to primary care with depression (PANDA trial): study protocol for a randomised controlled trial. Trials. 2017;18(1):496–510. 10.1186/s13063-017-2253-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Salisbury C, O'Cathain A, Edwards L, et al. : Effectiveness of an integrated telehealth service for patients with depression: a pragmatic randomised controlled trial of a complex intervention. Lancet Psychiatry. 2016;3(6):515–525. 10.1016/S2215-0366(16)00083-3 [DOI] [PubMed] [Google Scholar]
  55. Sartorius N, Üstün TB, Lecrubier Y, et al. : Depression comorbid with anxiety: results from the WHO study on psychological disorders in primary health care. Br J Psychiatry Suppl. 1996;168(30):38–43. 10.1192/S0007125000298395 [DOI] [PubMed] [Google Scholar]
  56. Saunders R, Buckman JEJ, Cape J, et al. : Trajectories of depression and anxiety symptom change during psychological therapy. J Affect Disord. 2019;249:327–335. 10.1016/j.jad.2019.02.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Saunders R, Cape J, Fearon P, et al. : Predicting treatment outcome in psychological treatment services by identifying latent profiles of patients. J Affect Disord. 2016;197:107–115. 10.1016/j.jad.2016.03.011 [DOI] [PubMed] [Google Scholar]
  58. Shamseer L, Moher D, Clarke M, et al. : Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015: elaboration and explanation. BMJ. 2015;350:g7647. 10.1136/bmj.g7647 [DOI] [PubMed] [Google Scholar]
  59. Sharp DJ, Chew-graham C, Tylee A, et al. : A pragmatic randomised controlled trial to compare antidepressants with a community-based psychosocial intervention for the treatment of women with postnatal depression: the RESPOND trial. Health Technol Assess. 2010;14(43): iii-iv, ix-xi, 1-153. 10.3310/hta14430 [DOI] [PubMed] [Google Scholar]
  60. Sherboune CD, Stewart AL: The MOS Social Support Survey. Soc Sci Med. 1991;32(6):705–714. 10.1016/0277-9536(91)90150-B [DOI] [PubMed] [Google Scholar]
  61. StataCorp LP: Stata Base Reference Manual - Release 15. College Station, Texas: Stata Press.2017. Reference Source [Google Scholar]
  62. Thomas LJ, Abel A, Ridgway N, et al. : Cognitive behavioural therapy as an adjunct to pharmacotherapy for treatment resistant depression in primary care: The CoBalT randomised controlled trial protocol. Contemp Clin Trials. 2012;33(2):312–319. 10.1016/j.cct.2011.10.016 [DOI] [PubMed] [Google Scholar]
  63. Thomas HV, Lewis G, Watson M, et al. : Computerised patient-specific guidelines for management of common mental disorders in primary care: a randomised controlled trial. Br J Gen Pract. 2004;54(508):832–837. [PMC free article] [PubMed] [Google Scholar]
  64. Thornicroft G, Chatterji S, Evans-Lacko S, et al. : Undertreatment of people with major depressive disorder in 21 countries. Br J Psychiatry. 2017;210(2):119–124. 10.1192/bjp.bp.116.188078 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. van Hout B, Janssen MF, Feng YS, et al. : Interim scoring for the EQ-5D-5L: mapping the EQ-5D-5L to EQ-5D-3L value sets. Value Health. 2012;15(5):708–715. 10.1016/j.jval.2012.02.008 [DOI] [PubMed] [Google Scholar]
  66. Weaver T, Madden P, Charles V, et al. : Comorbidity of substance misuse and mental illness in community mental health and substance misuse services. Br J Psychiatry. 2003;183:304–313. 10.1192/bjp.183.4.304 [DOI] [PubMed] [Google Scholar]
  67. Zigmond AS, Snaith RP: The hospital anxiety and depression scale. Acta Psychiatr Scand. 1983;67(6):361–370. 10.1111/j.1600-0447.1983.tb09716.x [DOI] [PubMed] [Google Scholar]
Wellcome Open Res. 2019 Nov 21. doi: 10.21956/wellcomeopenres.17017.r36881

Reviewer response for version 2

Julieta Galante 1

I thank the authors for carefully considering my comments to their first manuscript. The revised version reads clearer to me and I have no further comments to make. I look forward to the results of this interesting project.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Wellcome Open Res. 2019 Oct 23. doi: 10.21956/wellcomeopenres.16615.r35662

Reviewer response for version 1

Wolfgang Lutz 1, Brian Schwartz 1

This manuscript proposes a set of meta-analyses of individual patient data examining prognostic factors for treatment outcome in patients with depression. Prognostic factors include depressive symptoms, several indicators of disorder severity and a number of non-severity related factors. Databases were already searched for RCTs complying with the inclusion and exclusion criteria. Thirteen eligible RCTs were found, the protocol at hand describes the data analysis of the resulting 6271 patients.

The proposed project is concerned with an important topic by aggregating the results of RCTs examining prognostic factors for treatment outcome. It addresses the problem of inconsistent findings due to different measures of disorder severity and their dependence on baseline symptom severity. This meta-analysis is a promising approach to elucidate the association between different prognostic factors and treatment outcome in depression.

The hypotheses are derived comprehensibly from an extensive theoretical background. All supposed prognostic factors are based on recent empirical evidence or are otherwise well-founded. Instead of relying on an aggregated symptom severity score only, the authors capture the construct of disorder severity in a comprehensive manner based on several indicators.

The reported methods are well described. The proposal deals with variable transformations and data preprocessing as well as sophisticated statistical analyses. To rule out confounding effects, potentially confounding variables are controlled individually.

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Wellcome Open Res. 2019 Jun 18. doi: 10.21956/wellcomeopenres.16615.r35663

Reviewer response for version 1

Julieta Galante 1

This article proposes a meta-analysis of individual participant trial data to explore prognostic factors of adult depression, and assess their influence independently of each other. The authors have already secured individual participant data for almost all of the trials to be included.

I consider this manuscript to be well-written, with clear objectives and an appropriate study design. However, regarding reproducibility, I am not clear on the implication of the review search date, which is March 2019. I have to assume that data sharing agreements predate the search, which was only conducted to make sure no eligible trials were missing – was the 13 th trial a result of the search? It would be much clearer if the authors could spell out this process.

Another confusing point is that the PROSPERO registration dated April 2019 shows Preliminary searches, Piloting of the study selection process, Formal screening of search results against eligibility criteria and Data extraction as ongoing or not started; this is inconsistent with what is stated in the manuscript. Similarly, in the Data handling and data management section the authors wrote “data from 13 studies were received…”. This is inconsistent with previous statements that data from one trial is pending.

Figure 1 shows that pilot/feasibility trials were excluded (see records excluded). If these were RCTs and used a sample that was different from the larger trials that followed, it is unclear why they were excluded. If exclusion was pre-specified and performed in a systematic fashion, it should be explained and justified in detail as part of the exclusion criteria. If exclusion was not pre-specified systematically, then these trials should be included in the meta-analysis, or at least this mentioned as a limitation.

Regarding generalisability, the discussion section reads: “findings from the Dep-GP database should be widely generalizable to adults with depression seeking treatment from their GP/family physician”. I find this too strong an assertion of generalisability - authors are only including trials from the UK, and there is cultural variability in many of the factors measured, so I would be more cautious regarding the external validity of the findings.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Wellcome Open Res. 2019 Oct 24.
Joshua Buckman 1

This article proposes a meta-analysis of individual participant trial data to explore prognostic factors of adult depression, and assess their influence independently of each other. The authors have already secured individual participant data for almost all of the trials to be included.

I consider this manuscript to be well-written, with clear objectives and an appropriate study design.

Response: We thank Dr Galante for her thorough and informative review. In light of her comments we have made a number of amendments to the manuscript as detailed below.

1. However, regarding reproducibility, I am not clear on the implication of the review search date, which is March 2019. I have to assume that data sharing agreements predate the search, which was only conducted to make sure no eligible trials were missing – was the 13th trial a result of the search? It would be much clearer if the authors could spell out this process.

Response: Thank you for raising this important point, we have clarified our review process by adding the following to the manuscript:

Under the heading: “Identification and selection of studies”:

Searches were run in several stages, first an initial scoping search of the literature in November 2015, next a full search in April 2016 to identify studies and begin the process of data collection, then finally another search in March 2019 to ensure no studies published more recently were missed.

Under the heading “Inclusion & exclusion criteria”:

Inclusion and exclusion criteria were refined in the initial stage of the scoping searches for this work. After this refining process, studies were included if they were randomised clinical trials (RCTs) that had at least one active treatment arm and used the CIS-R at baseline to assess symptoms of anxiety and depression and to determine diagnoses. The studies were included if their samples were adults (aged 16 or over), with a diagnosis unipolar depression, depressive symptoms significant enough to lead them to seek treatment from their GP, or a CIS-R ( Lewis et al., 1992) score of ≥12, and if they were recruited from primary care centres.

Under the heading “ Characteristics of the included studies”:

In total, 15 RCTs were identified as meeting inclusion criteria for the IPD, of which 12 have provided individual patient-data and one is in the process of providing these data, see Figure 1. Nine studies were identified at the initial scoping stage, with a further three found in the second stage of searching and one final further study identified as meeting inclusion criteria after the final searches. After a consensus meeting once the final searches had been run, two studies previously considered to not meet inclusion criteria were re-classified as meeting criteria, and the study authors were contacted for IPD accordingly.

2. Another confusing point is that the PROSPERO registration dated April 2019 shows Preliminary searches, Piloting of the study selection process, Formal screening of search results against eligibility criteria and Data extraction as ongoing or not started; this is inconsistent with what is stated in the manuscript.

Response: At the time of writing we had just run our final searches, we were in the process of data collection and the final formal consideration of studies for inclusion and exclusion. As a part of that process we found one newly published study which met inclusion criteria and during a consensus meeting between the three reviewers involved in this process, we determined that two previously excluded studies would also have met inclusion criteria. As a result we contacted the authors of those latter two studies after submitting the initial version of the present manuscript, and we were informed that data from those studies were no longer available. So, it was accurate to state that the process of searching, the study selection process and data extraction were ongoing at the point of writing this manuscript and at the point of registration on PROSPERO. We will update our PROSPERO record as soon these processes are completed, once data from the final study are received and cleaned.

3. Similarly, in the Data handling and data management section the authors wrote “data from 13 studies were received…”. This is inconsistent with previous statements that data from one trial is pending.

Response: Thank you for pointing out this error, we have amended it to read:

“Data from the 12 trials were received and cleaned on an individual study basis before combining all studies into a single aggregated dataset, the final Dep-GP dataset will be formed once data from the 13 th study are received and cleaned”

4. Figure 1 shows that pilot/feasibility trials were excluded (see records excluded). If these were RCTs and used a sample that was different from the larger trials that followed, it is unclear why they were excluded. If exclusion was pre-specified and performed in a systematic fashion, it should be explained and justified in detail as part of the exclusion criteria. If exclusion was not pre-specified systematically, then these trials should be included in the meta-analysis, or at least this mentioned as a limitation.

Response: We have amended Figure 1 to show that 15 studies met our final inclusion criteria and the study authors from each of them were contacted for IPD. In terms of excluding the feasibility trials, this was part of our original pre-specified inclusion/exclusion criteria and was accordingly applied systematically across all searches.

5. Regarding generalisability, the discussion section reads: “findings from the Dep-GP database should be widely generalizable to adults with depression seeking treatment from their GP/family physician”. I find this too strong an assertion of generalisability - authors are only including trials from the UK, and there is cultural variability in many of the factors measured, so I would be more cautious regarding the external validity of the findings.

Response: We thank the reviewer for highlighting this, we have amended the manuscript accordingly, under the heading “ Discussion and Conclusions”:

The consistency in setting and the variability in both the populations drawn upon in the 13 studies and the treatments received in those studies means that findings from the Dep-GP database may be generalizable to adults with depression seeking treatment from their GP/family physician.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Buckman JEJ: What factors indicate prognosis for adults with depression in primary care?2019. 10.17605/OSF.IO/UX95Q [DOI] [PMC free article] [PubMed]

    Data Availability Statement

    Underlying data

    No data are associated with this article

    Extended data

    Open Science Framework: What factors indicate prognosis for adults with depression in primary care? https://doi.org/10.17605/OSF.IO/UX95Q ( Buckman, 2019)

    This project contains the following extended data:

    • details of missing data across dep-gp studies.docx (Missing data from included studies)

    • Ethics approval and trial registration details for dep-gp studies.docx (Ethics approval and trial registration details of included studies)

    • Search results_OSF.docx (Search terms and results of searches)

    Reporting guidelines

    Open Science Framework: PRISMA-P checklist ( Shamseer et al., 2015) for “What factors indicate prognosis for adults with depression in primary care? A protocol for meta-analyses of individual patient data using the Dep-GP database”. https://doi.org/10.17605/OSF.IO/UX95Q ( Buckman, 2019)

    Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication).


    Articles from Wellcome Open Research are provided here courtesy of The Wellcome Trust

    RESOURCES