Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Mar 1.
Published in final edited form as: Neuropsychology. 2023 Mar;37(3):315–329. doi: 10.1037/neu0000850

Concurrent validity and reliability of suicide risk assessment instruments: A meta analysis of 20 instruments across 27 international cohorts

Adrian I Campos 1,2,*, Laura S Van Velzen 3,4,*, Dick J Veltman 5, Elena Pozzi 3,4, Sonia Ambrogi 6, Elizabeth D Ballard 7, Nerisa Banaj 6, Zeynep Başgöze 8, Sophie Bellow 9, Francesco Benedetti 10,11, Irene Bollettini 10,11, Katharina Brosch 12, Erick J Canales-Rodríguez 13,14,15, Emily K Clarke-Rubright 16, Lejla Colic 66,69,70, Colm G Connolly 17, Philippe Courtet 18,19, Kathryn R Cullen 8, Udo Dannlowski 20, Maria R Dauvermann 9,21, Christopher G Davey 22, Jeremy Deverdun 23, Katharina Dohm 20, Tracy Erwin-Grabner 24, Roberto Goya-Maldonado 24, Negar Fani 25, Lydia Fortea 26,27, Paola Fuentes-Claramonte 13,14, Ali Saffet Gonul 28,29,30, Ian H Gotlib 31, Dominik Grotegerd 20, Mathew A Harris 33, Ben J Harrison 32, Courtney C Haswell 16, Emma L Hawkins 33, Dawson Hill 8, Yoshiyuki Hirano 34,35, Tiffany C Ho 60,63,64, Fabrice Jollant 36,37,38,39,65, Tanja Jovanovic 40, Tilo Kircher 41, Bonnie Klimes-Dougan 42, Emmanuelle le Bars 23,43, Christine Lochner 44, Andrew M McIntosh 33, Susanne Meinert 20, Yara Mekawi 25, Elisa Melloni 10,11, Philip Mitchell 45, Rajendra A Morey 16,46, Akiko Nakagawa 34,35, Igor Nenadić 12, Emilie Olié 18,19, Fabricio Pereira 47,48, Rachel D Phillips 16, Fabrizio Piras 6, Sara Poletti 10,11, Edith Pomarol-Clotet 13,14, Joaquim Radua 26,49,68, Kerry J Ressler 25,50, Gloria Roberts 45, Elena Rodriguez-Cano 13,51, Matthew D Sacchet 52, Raymond Salvador 13,14, Anca-Larisa Sandu 53, Eiji Shimizu 34,35, Aditya Singh 24, Gianfranco Spalletta 6,54, J Douglas Steele 55, Dan J Stein 56, Frederike Stein 12, Jennifer S Stevens 25, Giana I Teresi 31,67, Aslihan Uyar-Demir 28, Nic J van der Wee 57,71, Steven J van der Werff 57, Sanne JH van Rooij 25, Daniela Vecchio 6, Norma Verdolini 58, Eduard Vieta 58, Gordon D Waiter 59, Heather Whalley 33, Sarah L Whittle 32, Tony T Yang 60, Carlos A Zarate Jr 7, Paul M Thompson 61, Neda Jahanshad 61, Anne-Laura van Harmelen 9,62, Hilary P Blumberg 69,72,73, Lianne Schmaal 3,4, Miguel E Rentería 1,2,
PMCID: PMC10132776  NIHMSID: NIHMS1867333  PMID: 37011159

Abstract

Objective

A major limitation of current suicide research is the lack of power to identify robust correlates of suicidal thoughts or behaviour. Variation in suicide risk assessment instruments used across cohorts may represent a limitation to pooling data in international consortia.

Method

Here, we examine this issue through two approaches: (i) an extensive literature search on the reliability and concurrent validity of the most commonly used instruments; and (ii) by pooling data (N~6,000 participants) from cohorts from the ENIGMA-Major Depressive Disorder (ENIGMA-MDD) and ENIGMA-Suicidal Thoughts and Behaviour (ENIGMA-STB) working groups, to assess the concurrent validity of instruments currently used for assessing suicidal thoughts or behaviour.

Results

We observed moderate-to-high correlations between measures, consistent with the wide range (kappa range: 0.15-0.97; r range: 0.21-0.94) reported in the literature. Two common multi-item instruments, the Columbia Suicide Severity Rating Scale (C-SSRS) and the Beck Scale for Suicidal Ideation (SSI) were highly correlated with each other (r=0.83). Sensitivity analyses identified sources of heterogeneity such as the time frame of the instrument and whether it relies on self-report or a clinical interview. Finally, construct-specific analyses suggest that suicide ideation items from common psychiatric questionnaires are most concordant with the suicide ideation construct of multi-item instruments.

Conclusions

Our findings suggest that multi-item instruments provide valuable information on different aspects of suicidal thoughts or behaviour but share a modest core factor with single suicidal ideation items. Retrospective, multi-site collaborations including distinct instruments should be feasible provided they harmonise across instruments or focus on specific constructs of suicidality.

Keywords: suicide, concurrent validity, harmonisation, psychometrics, instruments

Introduction

Suicide is a leading cause of death worldwide, with an estimated 800,000 deaths by suicide occurring annually, or one person dying by suicide every 40 seconds (World Health Organization, 2014). Despite national and international efforts to prevent suicide, suicide rates continue to rise in some countries around the world (Alicandro et al., 2019).

To better understand and identify demographic, environmental, psychological, cognitive and neurobiological factors associated with suicidal thoughts and behaviour, we need large samples, as individual factors most likely explain a small proportion of complex phenotypes as suicidal thoughts or behaviours. Large and diverse samples additionally provide the opportunity to study the heterogeneity in associated factors by identifying subgroups and moderation effects. Large-scale international collaborations in consortia for suicide research may provide an important step forward to increase our understanding of suicidal thoughts and behaviour (STB).

Examples of these consortia include the International Suicide Genetics Consortium (Mullins et al., 2022) and the Enhancing NeuroImaging Genetics through Meta-Analysis Suicidal Thoughts and Behaviour (ENIGMA-STB) consortium (van Velzen et al., 2021). The aim of ENIGMA-STB is to study the neural correlates of STB, by bringing together research groups around the world that have collected both neuroimaging data and assessed STB in individuals with and without mental disorders. These groups use standardised protocols to process their neuroimaging data and then pool data in analyses that have increased statistical power to detect relevant associations.

While these large-scale collaborations have many strengths, it has been challenging to harmonize the different instruments employed to assess STBs across cohorts, and the validity of the findings will depend on the quality of STB measure harmonisation. For instance, in our recent large-scale analysis of cortical morphology across 18 research groups within the ENIGMA Major Depressive Disorder (ENIGMA-MDD) consortium (Campos et al., 2020), suicide attempt was assessed using 19 different measures, including single items on STBs from depression severity questionnaires, items from clinical interviews, in addition to items from comprehensive instruments specifically focused on STBs such as the Columbia Suicide Severity Rating Scale (C-SSRS) (Posner et al., 2008, 2011).

To inform future suicide research in international consortia, it is important to examine how these different suicide measures relate to each other and whether they can be used interchangeably. Therefore, the aim of this study was to examine the correlations between the 20 different assessment instruments for STBs used across 27 ENIGMA cohorts. In the first part of this report, we provide an overview of the literature on reliability and validity of these commonly-used measures to assess STBs, and the associations between these measures (concurrent validity). In the second part, we present findings from a meta-analysis performed using the cross-sectional measures, including multi-item and single-item measures, collected across 27 cohorts within the ENIGMA-MDD and ENIGMA-STB working groups.

Methods

Literature search

A literature search was conducted in PubMed (https://pubmed.ncbi.nlm.nih.gov) for articles published before December 2021, using the following search terms: suicid* AND (questionnaire* OR interview OR measures) AND (validity OR convergent validity OR discriminant validity OR reliability OR psychometric*), using ‘English’ and ‘Human’ as additional filters.

2,549 abstracts were screened by investigator LvV to identify studies which used psychometric measures to assess suicidal ideation and/or suicidal behaviour that were also collected by the ENIGMA research groups. These measures included: the Beck Depression Inventory (BDI) suicidal ideation item (Beck et al., 1961, 1996), Scale for Suicidal Ideation (SSI) (Beck et al., 1979, 1988), Children’s Depression Rating Scale (CDRS) suicidal ideation item (Poznanski & Mokros, 1996), Composite International Diagnostic Interview (CIDI) items on suicidal ideation and behaviour (WHO, 1997), Columbia Suicide Severity Rating Scale (C-SSRS) (Posner et al., 2011), Diagnostic Interview for Genetics Studies (DIGS) items on suicidal ideation and behaviour (Nurnberger et al., 1994), Hamilton Depression Rating Scale (HAM-D) item on suicidal ideation (Hamilton, 1960), Inventory of Depression and Anxiety Symptoms (IDAS-II) suicide subscale (Watson et al., 2012), (Quick) Inventory of Depressive Symptomatology (IDS/QIDS) suicidal ideation item (Rush et al., 1986, 2003), Kiddie Schedule for Affective Disorders and Schizophrenia (KSADS) suicide items (Kaufman et al., 1997), Montgomery-Asberg Depression Rating Scale (MADRS) suicidal ideation item (Montgomery & Asberg, 1979), Mini International Neuropsychiatric Interview (MINI) suicidality module (Sheehan et al., 1998), Revised Children’s Anxiety and Depression Scale (RCADS) suicidal ideation item (Chorpita et al., 2000), Structured Clinical Interview for DSM Disorders (SCID) suicide questions (First, 1997), Suicidal Ideation Questionnaire (SIQ) (Reynolds, 1987), Beck’s Suicide Intent Scale (SIS) (Beck et al., 1974), Self-Injurious Thoughts and Behaviours Interview (SITBI) (Nock et al., 2007), Suicide Score Scale (SSS) (Innamorati et al., 2008), Youth Self-Report suicide item (YSR) (Achenbach et al., 1991), Suicidal Ideation Questionnaire-Junior (SIQ-JR) (Reynolds, 1987).

A total of 225 studies were identified and screened for information on the reliability (inter-rater reliability, internal consistency or test-retest reliability) or validity (correlation with an established instrument e.g., concurrent validity or predictive validity) of those measures. For concurrent validity, we included only associations between measures that were collected by the ENIGMA working groups and mentioned above. Additional studies were identified by cross-referencing.

Data dimensionality reduction strategy

Our study comprised both multiple-item and single-item suicide risk assessment instruments. Importantly, only cross-sectional data was included. Single-item instruments, such as questions from depression severity rating scales or psychiatric interviews normally assess recent suicidal ideation. Multi-item instruments typically capture other dimensions such as control over suicidal thoughts, protective factors and, in the case of suicide attempt, degree of intent to die. By extracting common factor scores for the multi-item instruments, we are able to obtain a score of the underlying suicidal risk being measured by the instruments while reducing the need to adjust for slight differences between versions. The choice of dimensionality reduction approach, common factor scores using full-information maximum likelihood (FIML), was motivated by two reasons; (i) this approach deals with missing data, which is common in these questionnaires, using FIML and (ii) we obtain a single factor score capturing the main liability measured by the instrument, as opposed to other approaches (e.g., PCA) that require non-missing data and output several new variables. Data dimensionality reduction for the SSI, KSADS, IDAS, SIQ, DIGS, C-SSRS, MINI and SSS was performed by extracting common factor scores using FIML factor analysis as implemented in the structural equation modelling package umx in R (Bates, 2018; Bates et al., 2016). Notably dimensionality reduction included all items available for each cohort (with any variance) of the multi-item instruments. For instance, the CSSR-S instrument includes items on severity of suicidal ideation, intensity of suicidal ideation, and suicidal behaviours. We did not focus on a specific section of these items in our main analyses.

Main analysis on ENIGMA data

We implemented a sample size-weighted meta-analysis of correlations between suicide-risk assessment instruments across 27 international cohorts from the ENIGMA Major Depressive Disorder (MDD) and Suicidal Thoughts and Behaviour (STB) working groups. Cohorts shared de-identified individual level response data to STBs assessment instruments or items on STBs from depression symptom severity questionnaires or clinical interviews. All participants provided informed consent and all projects were approved by their respective relevant ethics committees. Our initial analysis consisted of three steps: i) data dimensionality reduction for multi-item suicide risk assessment instruments (see above); ii) within-cohort unadjusted correlations for all possible pairs of instruments; and iii) a sample-size weighted meta-analysis, combining the correlation coefficients for pairs of instruments for which data was available across multiple cohorts while accounting for cohort sample size differences. Within-cohort correlations, and the sample-size weighted meta-analysis were calculated in python using the scipy (Virtanen et al., 2020), numpy (Harris et al., 2020) and pandas (McKinney et al. 2010) libraries. Data was visualised using undirected graphs with varying node and edge sizes according to the number of cohorts and cohort pairs supporting each correlation. These were generated from the data using python and the networkx library (Hagberg et al., 2008). We analysed data from two working groups of the ENIGMA consortium, including 22 instruments across 27 cohorts worldwide. Individual level responses for 6,716 participants were included in our study (Table 1). Participants were included across a range of diagnoses including: major depressive disorder, psychotic disorders, anxiety disorders, obsessive-compulsive disorders, posttraumatic stress disorder, and bipolar disorder, along with data from healthy controls.

Table 1.

Cohorts included, sample size and instruments used to assess suicidal thoughts and/or behaviours

Cohort name Sample size Instruments
AFFDIS 29 Hamilton Depression Rating Scale (past week), Beck Depression Inventory (past two weeks), Montgomery-Asberg Depression Rating Scale (past week)
Chiba University 117 Beck Depression Inventory-I (past week), Beck Depression Inventory-II (past two weeks), Mini International Neuropsychiatric Interview suicidality module (past month, question 6 refers to lifetime attempt)
Duke/Durham VA 190 Beck Depression Inventory (past two weeks), Beck Scale for Suicidal Ideation (past week)
EPISCA (Leiden adolescents) 71 Children's Depression Inventory (past two weeks), Youth Self-report (lifetime), Revised Children’s Anxiety and Depression Scale (lifetime)
ETPB-STB 60 Hamilton Depression Rating Scale (past week), Scale for Suicidal Ideation (past week), Beck Depression Inventory (past week), Montgomery-Asberg Depression Rating Scale (past week)
FIDMAG 284 Hamilton Depression Rating Scale (past week), Montgomery-Asberg Depression Rating Scale (past week)
FOR2107 Muenster 424 Hamilton Depression Rating Scale (past week), Beck Depression Inventory (past week)
FOR2107 Marburg 792 Hamilton Depression Rating Scale (past week), Beck Depression Inventory (past week)
Grady Trauma Project Emory University 123 Beck Depression Inventory (past two weeks), Columbia Suicide Severity Rating Scale (past month)
The University of Melbourne 283 Quick Inventory of Depressive Symptoms (past week), Suicidal Ideation Questionnaire (lifetime), Columbia Suicide Severity Rating Scale (past month and lifetime), Montgomery-Asberg Depression Rating Scale (past week)
University of Minnesota Adolescent MDD 110 Children’s Depression Rating Scale (past week), Beck Depression Inventory (past two weeks), Kiddie Schedule for Affective Disorders and Schizophrenia (lifetime), Inventory of Depression and Anxiety Symptoms (past two weeks)
CHU Montpellier BICS study 66 Inventory of Depressive Symptoms - Clinician rated (past week), Beck Depression Inventory (current), Columbia Suicide Severity Rating Scale (lifetime & last month)
CHU Montpellier IMPACT study 40 Quick Inventory of Depressive Symptoms (past week), Inventory of Depressive Symptoms - Clinician rated (past week), Columbia Suicide Severity Rating Scale (last month)
CHU Montpellier Servier Study 120 Hamilton Depression Rating Scale (past week), Beck Depression Inventory (past week), Scale for Suicidal Ideation (past week)
McGill University 103 Beck Depression Inventory (past two weeks), Suicide Intent Scale (most recent attempt and most severe attempt), Scale for Suicidal Ideation (today and in the past two weeks), Quick Inventory of Depressive Symptoms (past week), Montgomery-Asberg Depression Rating Scale (past week), Hamilton Depression Rating Scale (past week), Columbia Suicide Severity Rating Scale (past month and lifetime)
Moral Dilemma 62 Quick Inventory of Depressive Symptoms (past week), Montgomery-Asberg Depression Rating Scale (past week)
Muenster Neuroimaging Cohort 1064 Hamilton Depression Rating Scale (past week), Beck Depression Inventory (past week), Montgomery-Asberg Depression Rating Scale (past week)
Fondazione Santa Lucia 288 Suicide Score Scale (past year), Mini International Neuropsychiatric Interview suicidality module (past month)
San Raffaele Hospital 447 Hamilton Depression Rating Scale (past week), Beck Depression Inventory (past two weeks), Beck Scale for Suicidal Ideation (day of assessment)
SoCAT 179 Hamilton Depression Inventory (past week), Beck Depression Inventory (past two weeks)
South Africa 117 Diagnostic Interview for Genetic Studies (lifetime), Montgomery-Asberg Depression Rating Scale (past week)
Stanford University adolescent MDD TIGER 49 Columbia Suicide Severity Rating Scale (lifetime & current (ideation in the past week, attempt in the past month), Self-Injurious Thoughts and Behaviours Interview (lifetime)
Stanford University AGG/FAA 56 Structured Clinical Interview for DSM Disorders (lifetime), Hamilton Depression Rating Scale (past week)
STRADL 1188 Structured Clinical Interview for DSM DIsorders (past month), Quick Inventory of Depressive Symptoms (past week)
Sydney Bipolar Risk Study 225 Kiddie Schedule for Affective Disorders and Schizophrenia (lifetime), Diagnostic Interview for Genetic Studies (lifetime), Montgomery-Asberg Depression Rating Scale (lifetime)
UCSF Adolescent MDD 161 Beck Depression Inventory (two weeks), Columbia Suicide Severity Scale (lifetime and current (ideation in the past week, attempt in the past two weeks))
Yale School of Medicine 178 Beck Scale for Suicidal Ideation (past month and lifetime), Suicide Intent Scale (most recent attempt, most lethal attempt), Columbia Suicide Severity Rating Scale (lifetime & past month), Hamilton Depression Rating Scale (past week), Child Depression Rating Scale (past week)

Sensitivity analyses

We performed a series of sensitivity analyses to identify sources of heterogeneity that could explain our results. These analyses consisted of repeating our framework but on subsets of the data. We mainly assessed (i) whether removing items measuring protective factors affected our results; (ii) compared lifetime and recent suicide risk assessment as well as (iii) interview-based and self report suicide risk assessment; and (iv) perform construct specific analyses to assess the degree of concordance of construct factors (i.e. suicide attempt, suicide ideation and suicide intent) derived from items in multi-item instruments and whether single-item instruments show a higher concordance with a specific construct. For aim iv, items of multi-item instruments were mapped to one of three possible constructs, suicide ideation, suicide attempt and suicide intent (Supplementary Table S1). Dimensionality reduction was then carried out for all the items mapped to each of these constructs and the analyses were repeated using these construct factors instead of the previous common factors. The following constructs were assessed: suicide Intent - defined as the seriousness or intensity of the wish of a person to end their life; suicidal ideation - defined as thoughts, ideas, and the desire to die by suicide, which can vary in severity and intensity; and suicide attempt - defined as a behaviour or act of self-harm related to an individual intentionally seeking to end their life.

Results

Literature review

An overview of the reliability and validity measures for the different suicide scales and items derived from our literature review is presented in Supplementary Table S2 and S3, respectively. Information on the reliability (inter-rater reliability, internal consistency and test-retest reliability) and validity (concurrent and predictive validity) of these measures wer es most often available for instruments specifically focused on STBs (e.g., SIS, SSI, C-SSRS), followed by items on suicidal ideation from questionnaires assessing the severity of depressive symptoms (e.g., BDI, MADRS, HAM-D). No reliability or validity measures were identified for suicide questions from diagnostic interviews (e.g., CIDI, SCID). The lowest concurrent validity measure identified was between the SIS and C-SSRS scales. Overall, mostly moderate to high correlation or concurrent validity scores (kappa range: 0.15-0.97; r range: 0.21-0.94) between instruments were identified (Figure 1a). When examining the concurrent validity between self-report measures the correlation ranged between 0.41 and 0.69, and kappa varied between 0.15 and 0.45, while for clinician-administered measures the correlation varied between 0.21 and 0.94 and the kappa varied between 0.42 and 0.97 (Figure 1a).

Figure 1. Overview of results.

Figure 1.

a) Literature review results. Reported instrument correlations are shown using an undirected graph. Nodes represent the instruments studied. Edges are coloured based on the average correlation across cohorts, edge width increases with the number of studies from which the correlations were extracted. b) ENIGMA correlation results. Each node represents one of the instruments included in the study. Edge color represents the sample-size weighted average correlation coefficient between two instruments. The thickness of the edge increases with the number of cohorts contributing to estimate the correlation. Generally speaking the thicker the edge the more confidence in the correlation estimate. c) Overview of the instruments used on the enigma analyses and the amount of overlap between them. BDI:Beck Depression Inventory suicidal ideation item; SSI: Scale for Suicidal Ideation; CDRS: Children’s Depression Rating Scale suicidal ideation item; CIDI: CIDI items on suicidal ideation and behaviour; C-SSRS: Columbia Suicide Severity Rating Scale; DIGS: Diagnostic Interview for Genetics Studies items on suicidal ideation and behaviour; HAM-D: Hamilton Depression Rating Scale item on suicidal ideation; IDAS-II: Inventory of Depression and Anxiety Symptoms suicide subscale; QIDS: Quick Inventory of Depressive Symptomatology suicidal ideation item; K-SADS: Kiddie Schedule for Affective Disorders and Schizophrenia suicide items; MADRS: Montgomery-Asberg Depression Rating Scale suicidal ideation item; MINI: Mini International Neuropsychiatric Interview suicidality module; RCADS: Revised Children’s Anxiety and Depression Scale suicidal ideation item; SCID: Structured Clinical Interview for DSM Disorders suicide questions; SIQ: Suicidal Ideation Questionnaire; SIS: Beck’s Suicide Intent Scale; SITBI: Self-Injurous Thouhts and Behaviours Interview; SSS: Suicide Score Scale; YSR: Youth Self-Report suicide item; SIQ-JR: Suicidal Ideation Questionnaire-Junior.

With regard to reliability, as expected, the strongest test-retest reliability was seen for instruments that assess lifetime or longer term suicidal ideation or behaviour (SIQ, SIQ-JR; r=0.72-0.91 and 0.89 respectively), compared to varying test-retest reliability in measures that assess short-term suicidal ideation (such as the BDI or HDRS suicidal ideation items; r=0.35 and 0.64 respectively) (see Table S1). In addition, test-retest reliability was lower for more ambiguous constructs such as suicidal gestures (kappa=0.25) in the SITBI. In general, inter-rater reliability for clinician-administered measures was strong for specialized multi-item instruments such as the C-SSRS, SIS, SSI and SITBI, and for clinical interviews such as the KSADS, but moderate for single items on recent suicidal ideation from depression severity rating scales (such as the CDRS). Finally, internal consistency was strong for multi-item specialized instruments that focus on one construct (such as recent or lifetime ideation), but varied more across studies for the C-SSRS intensity of ideation subscale (see Table S1).

The lowest concurrent validity measure identified was between the SIS and C-SSRS scale. Overall, mostly moderate to high correlation or concurrent validity scores (kappa range: 0.15-0.97; r range: 0.21-0.94) between instruments were identified (Figure 1a). When examining the concurrent validity between self-report measures the correlation ranged between 0.41 and 0.69, and kappa varied between 0.15 and 0.45, while for clinician-administered measures the correlation varied between 0.21 and 0.94 and the kappa varied between 0.42 and 0.97 (Figure 1a).

ENIGMA meta-analysis

Sample description and dimensionality reduction

The average age across cohorts was 39 years (SD=16.3). Cohorts had on average 40% male participants. The most commonly available instruments were the suicidal ideation items from the MADRS, HAM-D and BDI questionnaires. Multi-item items, such as the C-SSRS, QIDS, SSI, and SCID, were available for 32.4% of the participants. The majority of instruments were administered by a clinician or trained interviewer, but some self-reported measures were used (Table 2). For multi-item instruments dimensionality reduction was carried out by extracting common factor scores using factor analysis (see methods). Fit statistics of these models for each multi-item instrument within each cohort are summarised in Supplementary Table 4.

Table 2.

Instruments used to assess suicidal thoughts and/or behaviour by the different cohorts: self-report and clinician/interviewer administered measures are presented separately.

Cohort name Self-report Clinician/Interviewer administered
AFFDIS BDI-II suicidal ideation item MADRS and HAM-D suicidal ideation item
Chiba University BDI-I suicidal ideation item, BDI-II suicidal ideation item MINI suicide module
Duke/Durham VA BDI suicidal ideation item, Beck Scale for Suicidal Ideation
EPISCA (Leiden adolescents) CDI suicidal ideation item, RCADS suicidal ideation item, YSR suicidal ideation item
ETPB-STB BDI suicidal ideation item HAM-D suicidal ideation item, MADRS suicidal ideation item, Beck Scale for Suicidal Ideation
FIDMAG HAM-D suicidal ideation item, MADRS suicidal ideation item
FOR2107 Muenster BDI suicidal ideation item HAM-D suicidal ideation item
FOR2107 Marburg BDI suicidal ideation item HAM-D suicidal ideation item
Grady Trauma Project Emory University BDI suicidal ideation item, Columbia Suicide Severity Rating Scale
The University of Melbourne QIDS suicidal ideation item, Suicidal Ideation Questionnaire Columbia Suicide Severity Rathing Scale, MADRS suicidal ideation item
University of Minnesota Adolescent MDD IDAS, BDI suicidal ideation item Children’s Depression Rating Scale, KSADS suicide items
CHU Montpellier BICS study BDI suicidal ideation item IDS suicidal ideation item, Columbia Suicide Severity Rating Scale
CHU Montpellier IMPACT study QIDS suicidal ideation item IDS suicidal ideation item, Columbia Suicide Severity Rating Scale
CHU Montpellier Servier Study BDI suicidal ideation item HAM-D suicidal ideation item, Beck Scale for Suicidal Ideation
McGill University BDI suicidal ideation item, QIDS suicidal ideation item, Scale for Suicidal Ideation MADRS suicidal ideation item, HAM-D suicidal ideation item, Suicide Intent Scale, Columbia Suicide Severity Rating Scale
Moral Dilemma QIDS suicidal ideation item MADRS suicidal ideation item
Muenster Neuroimaging Cohort BDI suicidal ideation item HAM-D suicidal ideation item, MADRS suicidal ideation item
Fondazione Santa Lucia Suicide Score Scale, Mini International Neuropsychiatric Interview suicidality module
San Raffaele Hospital BDI suicidal ideation item, Beck Scale for Suicidal Ideation HAM-D suicidal ideation item
SoCAT BDI suicidal ideation item HAM-D suicidal ideation item
South Africa MADRS suicidal ideation item, DIGS suicide items
Stanford University adolescent MDD TIGER C-SSRS, SITBI
Stanford University AGG/FAA SCID suicide items, HAM-D suicidal ideation item
STRADL QIDS suicidal ideation item SCID suicide items
Sydney Bipolar Risk Study MADRS suicide item, DIGS suicide items, KSADS suicide items
UCSF Adolescent MDD BDI-II suicidal ideation item Columbia Suicide Severity Rating Scale
Yale School of Medicine Beck Scale for Suicidal Ideation, Suicide Intent Scale, Columbia Suicide Severity Rating Scale, Hamilton Depression Rating Scale, Child Depression Rating Scale

These multi-item instruments typically measure more than one suicidal construct. For example, the C-SSRS and SSI include sections on protective factors. Thus, a single common factor might not represent the best model underlying the latent structure of these instruments, but it serves our purpose of dimensionality reduction while capturing the main underlying latent factor related to suicidality which these instruments assess.

Correlation analyses

Results of our correlation analyses are summarised in Figure 1b. Full results are provided in Supplementary Table S5. As explained in the methods section, multi-item instruments were summarised using factor analysis for dimensionality reduction. From now on, when referring to the multi-item instruments listed in the methods, we are referring to the common factor score obtained by the dimensionality reduction approach. Overall moderate to high correlations (median r=0.44) were observed among all the studied instruments (including single-item and common-factor of multi-item instruments). Nonetheless, the common factor of the SIS showed poor correlations (median r~0.20) with most of the instruments tested. This result is not unexpected; the SIS was applied by a single cohort (N=16) and assesses suicide intent during a suicide attempt and no suicidal ideation or behaviour as the other instruments do (see discussion).

The instrument with the highest consistency (i.e., highest median weighted correlations with other instruments) was the IDS-Clinician rated measure (median r=0.76). However, few pairs of cohorts had data for this instrument. The C-SSRS and SSI instrument showed a very high concordance (r=0.83; N=191) with each other. In addition, there was a strong correlation between the HAM-D suicidal ideation item and the same item in the version of this questionnaire for children, the CDRS, but this was supported by a single cohort (r=0.94, N=20). The MADRS suicidal ideation item showed a high correlation with the HAM-D (r=0.67, N=1,087) and BDI (r=0.74, N=844) suicidal ideation items and with the SSI instrument common factor (r=0.67, N=119). The HAM-D and BDI suicide items showed only a moderate correlation (r=0.41, N=2,555) between them. Both of these measures were moderately correlated with the SSI (r=0.38, N=429 and r=0.36, N=350 respectively). Moderate to low correlations were observed for the group comprising child scales (YSR, RCADS and CDI), but these were only collected by one cohort. The MINI and SSS common factors showed a low correlation (r=0.12), which was also supported by a single cohort only (N=64). A secondary analysis excluding suicide deterrents or protective factors before dimensionality reduction showed practically the same results (Supplementary Figure 1).

Recent versus lifetime STB

Cohorts applied different instruments assessing STB with different time frames. For example, the C-SSRS could be used to assess lifetime, time since last assessment, and recent (past two weeks) suicidal behaviour information, whereas other instruments might be worded around the past month, two weeks, week or even at the time of assessment. This is a potential source of heterogeneity for studies wishing to compare these measures. For this reason, we repeated the analyses only focusing on measures applied to a recent (within the past month) time frame and compared them to the results shown above. These analyses showed similar correlations overall. Notably, concordance between the C-SSRS and the HAM-D, as well as C-SSRS and BDI, were higher when focusing only on recent instruments (Figure 2a-b).

Figure 2. Sensitivity results.

Figure 2

Undirected acyclic graph shows the results for the meta-analysis of correlations of suicide risk assessment instruments across ENIGMA cohorts for the complete results (a); using only measures assessing recent (up to past month) suicidal behaviour (b); instruments administered via self-report (c) or using interviewer administered instruments (d). Each node represents one of the instruments included in the study. Each edge color represents the sample-size weighted average correlation coefficient between two instruments. The thickness of the edge increases with the number of cohorts contributing to estimate the correlation. Generally speaking the thicker the edge the more confidence in the correlation estimate. *For multi-item instruments dimensionality reduction was carried out by extracting common factor scores.

Comparing clinician and self-report scales

We gathered information on whether the distinct instruments were administered by a clinical interview or by self-report (Table 2), and the majority of instruments were administered by a clinician or trained interviewer. Within interviewer-administered scales, high correlations (r>0.7) were observed between the SSI, C-SSRS, MADRS and QIDS instruments. A similar result was observed between HAM-D and MADRS (r=0.67). The SIS still showed a lower consistency with most other instruments (Figure 2c). For the self-report based instruments, less data was available. Among the self-reported instruments, the BDI and IDAS as well as the SSI and QIDS instruments showed a high concordance (Figure 2d). For interviewer-based measures, we were able to replicate the pattern from the main analyses: high correlations between single-item measures and measures assessing recent suicidal ideation (HAM-D, MADRS), strong correlations between detailed measures of STB (C-SSRS and SSI) and low correlations between the SIS and other measures. Nonetheless, we identified pairs of instruments such as the HAM-D and BDI whose low correlation in the main analysis might be explained by differences in administration (i.e., self-report vs interviewer).

Construct specific analyses

Overall, given the small number of cohorts with multi-item instruments, there were few instances where a comparison between construct factors was possible. Results were consistent with the full data analyses, for example the SSI and C-SSRS suicide ideation constructs showed a high correlation (r=0.84) (Supplementary Figure 2). The MADRS single item showed a high correlation with the KSADS suicide attempt construct, but not with the C-SSRS suicide attempt construct. Single-item instruments, on average, showed a higher correlation with the ideation constructs of multi-item instruments compared to the other constructs. Consistent with our previous results, single-item showed a very poor correlation with suicide intent constructs and moderate to low correlations with suicide attempt constructs (Supplementary Figure 2).

Discussion

Our study is a comprehensive assessment of how well different suicide risk assesment instruments relate to each other and the extent to which they can be used interchangeably. Harmonization reduces heterogeneity and increases power for discovery analyses, but also enables the assessment of the generalisability of studies and opens up the opportunity to investigate other aspects such as interactions and individual variation analysis (van Harmelen et al., 2020). Identification of correlates of suicide risk may be improved by increasing sample sizes, and by pooling data across studies to detect small effect sizes, which may result from large variance in underlying mechanisms. Our study aimed to examine the concurrent validity of instruments commonly used to assess STBs. To this end, we compared individual level responses across questionnaires by pooling data from 27 cohorts belonging to the ENIGMA-MDD and ENIGMA-STB working groups. We compared our results to a systematic literature search across 225 studies.

A potential source of variance is the heterogeneity introduced by using different suicide risk assessment instruments that each measure slightly different underlying phenomena. Both the results of our analysis and our literature search identified moderate to high correlations between the most commonly used instruments to assess suicidal ideation including the BDI, SSI, HAM-D and MADRS, and between multi-item instruments (such as the C-SSRS and SSI). These findings are consistent with another study which showed strong correlations between the SSI, BDI and HAM-D (Desseilles et al., 2012). Nonetheless, our results were consistently more conservative than the literature (e.g., showing a lower degree of correlation). This could be explained in several ways. This might be evidence for publication bias whereby only positive and expected associations are published. Heterogeneity arising from the way these instruments are administered (i.e. time frames for suicidal behaviours and self-report vs interview based) might also affect the results. We performed sensitivity analyses testing for these factors. For example, the low correlations between BDI and HAM-D may be explained by the fact that the BDI is a self-report measure whereas the HAM-D is typically administered via a clinician interview. There are studies showing that participants are more likely to disclose suicidal ideation in self-reported measures compared to clinical interviews (Kaplan et al., 1994; Yigletu et al., 2004). It is unclear whether a similar phenomenon underlies our results, given that the discrepancy is seen across cohorts rather than in individuals. Another example includes the correlation between the BDI and C-SSRS common factor, which became higher when focusing on the interview based cohort only. Nonetheless, this explanation does not always hold true, for example, the MADRS (also interview-based) correlated well with the BDI.

Our unique methodology was chosen to integrate slightly different versions of instruments (i.e., self-reported vs. interview administered). By extracting common factor scores for the multi-item instruments, we were able to obtain a measure of the underlying suicidal risk being measured by the instruments while reducing the need to adjust for slightly different wording or items between versions. This approach might be more conservative than those used in previous studies as it is focused on measuring suicide risk rather than a specific construct such as attempt or ideation. As such, instruments that mainly focused on a specific aspect of STB such as suicidal ideation (for example the SIQ) are expected to show a lower correlation with more broad instruments that assess a range of relevant behaviors, such as the C-SSRS.

It is worth noting that suicidal constructs may share a common liability, but they may also have partially independent aetiologies. The Suicide Intent Scale (SIS) was the only measure with overall low measures of consistency in both our analyses and the literature search. Our findings do not necessarily imply that the SIS (or any of the measures studied) lack utility, but rather that there is a lack of concordance with other measures which could be due to assessment of different constructs or other sources of heterogeneity. As such, future studies specifically focused on harmonisation across suicidal measures should account for the design differences present in the SIS or single item instruments, and assess whether their inclusion provides valuable information. However, caution is warranted in interpreting the SIS findings as they were based on two cohorts only. Although our literature search identified a low concordance between the SIS and both the SIQ and the C-SSRS, a high concordance was reported between SIS and the K-SADS. These results are likely explained by the fact that the SIS is a questionnaire mainly focused on assessing intent of a past suicide attempt. This is further complicated by the intricate relationship between suicidal ideation, attempt, and actual suicide intent. Participants might engage in a suicide attempt with a relatively low intent to die. In fact, previous studies have identified that combining the SIS with other scales increases sensitivity and specificity for predicting suicide deaths (Stefansson et al., 2015). In a secondary analysis focusing on the suicide intent construct of multi-item instruments we observed a very poor correlation between all single-item instruments and the suicide intent factors.

Our study represents a comprehensive approach to assess the concordance and reliability of commonly used suicide risk assessment instruments. Nonetheless, some limitations need to be considered when interpreting our findings. Our literature search was as exhaustive and systematic as was practicable; however, we cannot rule out the possibility that some relevant studies were excluded because they are not indexed in the databases we searched. Language and cultural differences between cohorts might also affect whether two instruments are concordant. Our study comprised predominantly English speaking participants, but some cohorts included French, Dutch and German speaking participants. Limited research is available on whether language affects reporting of psychiatric symptomatology (Erkoreka et al., 2020). Participants might have undergone evaluation at different points in time in relation to the timing of suicidal behaviors or thoughts, and recall bias could lower the concordance between instruments. In fact, our sensitivity analyses showed that the time frame and administration of the scale used can affect how different measures compare with each other and that most single-item instruments seem to correlate better with suicide ideation constructs rather than attempt or intent constructs. When focusing on multi-item instruments, we performed dimensionality reduction using factor scores derived from full-information maximum likelihood. Our approach was based on the fact that some instruments have multiple versions. Thus, our approach is not an exploratory or confirmatory factor analysis of the multi-item instruments used here. Performing such a study is outside the scope of this manuscript as it would require complete harmonisation of the questionnaires across cohorts and to focus solely on the multi-item instruments. Finally, we did not compare how single-item instruments on suicidality relate to broad depression or psychiatric interviews, future studies could tackle questions of convergence with broader constructs such as depression, and of temporal stability over time, using a similar strategy as the one we employed here.

While our sensitivity analysis only distinguished between lifetime and recent time frames, it is possible that these effects exist even within recent time frames such as instruments assessing current vs. past two week behaviours. These limitations might explain the lower concordance identified by our analysis compared to the literature synthesis, which studied a compendium of smaller albeit less heterogeneous studies. Additionally, while this was a large study, many sites had a distinct combination of measures collected (max N~ 2,500); therefore, we did not have the power to perform additional sensitivity analyses in adults or adolescents only. Future studies should focus on addressing this, as there are clear factors associated with suicidality (e.g., mood reactivity) that are more prevalent during adolescence (Armey et al., 2015). We were also unable to stratify our analyses by type of psychiatric diagnosis, while prior work found that the correlation between self-report and clinician-reported suicidal thoughts may differ across disorders (Gao et al., 2015; Kaplan et al., 1994). In addition, most sites in the analysis included single items from depression severity rating scales, and detailed information from multi-item instruments was only available for one-third of the participants. Finally, in this study we have only included cross-sectional data, which means we are unable to shed light on how different measures assess changes in ideation or suicidal behaviour over time. A recent study for instance has shown that the suicide item of the Ham-D was more strongly associated with suicidal ideation at follow-up, compared to the SSI and C-SSRS (McCall et al., 2021). Existing longitudinal studies suggest that suicidal ideation and suicidal behavior, measured here, are significant, weak predictors of death by suicide. Thus, this work may not shed light on the best tools to identify high risk individuals and states and instead is focused on convergence of existing tools.

Overall, our results suggest that the most commonly used instruments show a moderate to high concordance. Use of different measures of suicidality, might increase heterogeneity depending on the distinct dimensions and constructs assessed by each instrument. For research studies pooling data from multiple studies and interested in a broad suicidal risk construct, a strategy similar to ours could enable the implementation of composite scores by weighting more concordant measures heavily and penalising less concordant measures. This would enable the inclusion of data from cohorts which have only weakly related instruments to increase power, but at the same time prevent bias by weighing these measures accordingly. In the absence of such an approach, large-scale research collaborations could focus on strictly defined suicide constructs such as suicidal ideation, attempt, and intent that are preferentially defined using the most common instruments such as - amongst others - the HAM-D, MADRS, SSI, C-SSRS. In the absence of a common instrument, or when using an instrument that we found to have low concordance, sensitivity analyses could be performed to assess whether significant results are robust, or at least consistent, after excluding cohorts using the least common instruments. We strongly recommend that future studies that plan to collect data on suicidal thoughts and behaviours include one or more of the instruments that showed strong correlations with instruments such as the MADRS, SSI, and C-SSRS.

The moderate to high concordance between measures suggests that it is possible to harmonize across instruments and pool data from different research studies. However, it is important to acknowledge that the choice for a specific measure in a clinical setting depends on the construct that is of interest, as various instruments assess distinct constructs, for instance intent, ideation, or attempt. Therefore, these measures cannot be blindly used interchangeably. These instruments may additionally differ in terms of detail and administration time, which may also impact which measures could be preferred in a clinical setting. For research settings our results suggest that all of these factors will likely increase heterogeneity, and statistical adjustment or sensitivity analyses, along with increasing sample sizes, will be necessary for gaining insights into the aetiology of suicide.

Supplementary Material

Supplemental Material

Key points:

  • Question: To inform future suicide research in multi-site international consortia, it is important to examine how different suicide measures relate to each other and whether they can be used interchangeably.

  • Findings: Findings suggest detailed instruments (such as the Columbia Suicide Severity Rating Scale and Beck Scale for Suicidal Ideation) provide valuable information on suicidal thoughts and behaviour, and share a core factor with items on suicidal ideation from depression severity rating scale (such as the Hamilton Depression Rating Scale or the Beck Depression Inventory).

  • Importance: Results from international collaborations can mitigate biases by harmonising distinct suicide risk assessment instruments and items related to suicide.

  • Next steps: Pooling data within international suicide research consortia may reveal novel clinical, biological and cognitive correlates of suicidal thoughts and/or behaviour.

Acknowledgements:

MER received support from the Australian National Health and Medical Research Council (NHMRC) Centre for Research Excellence on Suicide Prevention (CRESP) [GNT1042580]. LS, LvV, LC, HPB, ALvH, SB and MD were supported by the MQ Brighter Futures Award MQBFC/2. LvV, LS and NJ were supported by the National Institute of Mental Health of the National Institutes of Health under Award Number R01MH117601. LS is supported by a NHMRC Career Development Fellowship (1140764). HPB was additionally supported by: R61MH111929RC1MH088366, R01MH070902, R01MH069747, American Foundation for Suicide Prevention, International Bipolar Foundation, Brain and Behavior Research Foundation, For the Love of Travis Foundation and Women’s Health Research at Yale. LC was additionally supported by Interdisziplinäres Zentrum für Klinische Forschung, UKJ. PMT was supported in part by NIH grant R01 MH116147. FJ would like to thank dr. S. Richard-Devantoy for his assistance during data collection. This work was supported by the CIBERSAM and the Catalonian Government (2014-SGR-1573 and 2017-SGR-1271 to FIDMAG). PF-C is funded by the Instituto de Salud Carlos III, co-funded by European Union (ERDF/ESF, “Investing in your future”): Sara Borrell contract (CD19/00149). The researchers based in Milan were supported by an Italian Ministry of Health grant RF-2011-02349921. ALvH was funded through the Leiden University Social Safety and Resilience Programme. The NIMH-ETPB team was supported by the Intramural Research Program at the National Institute of Mental Health, National Institutes of Health (IRP-NIMH-NIH; ZIAMH002927). MS is supported by the Phyllis and Jerome Lyle Rappaport Foundation, Ad Astra Chandaria Foundation, BIAL Foundation, Brain and Behavior Research Foundation, Anonymous donors, and the Center for Depression, Anxiety, and Stress Research at McLean Hospital. The team based at the University of Minnesota was supported by the National Institute of Mental Health (K23MH090421), the National Alliance for Research on Schizophrenia and Depression, the University of Minnesota Graduate School, the Minnesota Medical Foundation, and the Biotechnology Research Center (P41 RR008079 to the Center for Magnetic Resonance Research), University of Minnesota, and the Deborah E. Powell Center for Women’s Health Seed Grant. IHG was supported by the National Institute of Mental Health grant R37MH101495. EV thanks the support of the Spanish Ministry of Science and Innovation (PI15/00283, PI18/00805) integrated into the Plan Nacional de I+D+I and co-financed by the ISCIII-Subdirección General de Evaluación and the Fondo Europeo de Desarrollo Regional (FEDER); the Instituto de Salud Carlos III; the CIBER of Mental Health (CIBERSAM); the Secretaria d’Universitats i Recerca del Departament d’Economia i Coneixement (2017 SGR 1365), the CERCA Programme, and the Departament de Salut de la Generalitat de Catalunya for the PERIS grant SLT006/17/00357. NV thanks the support of a BITRECS project that has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 754550 and from “La Caixa” Foundation (ID 100010434), under the agreement LCF/PR/GN18/50310006. For the team in Nimes, CINES grants access to HPC facilities (A0100311413). The team based in Rome was supported by an Italian Ministry of Health grant RC17-18-19-20-21/A. Support for the TIGER study includes the Klingenstein Third Generation Foundation, the National Institute of Mental Health (K01MH117442), the Stanford Maternal Child Health Research Institute, and the Stanford Center for Cognitive and Neurobiological Imaging. TCH receives partial support from the Ray and Dagmar Dolby Family Fund. The UCSF site was supported by the National Center for Complementary and Integrative Health (NCCIH) R21AT009173 and R61AT009864 to TTY; by the National Center for Advancing Translational Sciences (CTSI), National Institutes of Health, through UCSF-CTSI UL1TR001872 to TTY; by the American Foundation for Suicide Prevention (AFSP) SRG-1-141-18 to TTY; by UCSF Research Evaluation and Allocation Committee (REAC) and J. Jacobson Fund to TTY; by the National Institute of Mental Health (NIMH) R01MH085734 and the Brain and Behavior Research Foundation (formerly NARSAD) to TTY. The team at Duke University/Durham VA medical center were supported by the VA Mid-Atlantic Mental Illness Research Education and Clinical Center (MIRECC). The Moral Dilemma study was supported by the Brain & Behavior Research Foundation, and by the National Health and Medical Research Council (ID 1125504 to SLW). The FOR2107 Marburg site was supported by the German Research Foundation (DFG, grant FOR2107 KI588/14-1 and FOR2107 KI588/14-2 to Tilo Kircher, Marburg). The FOR2107 Muenster site was supported by the German Research Foundation (DFG, grant FOR2107 DA1151/5-1 and DA1151/5-2 to UD; SFB-TRR58, Projects C09 and Z02 to UD) and the Interdisciplinary Center for Clinical Research (IZKF) of the medical faculty of Münster (grant Dan3/012/17 to UD). CGD and BJH were supported by the National Health and Medical Research Council of Australia (NHMRC) Project Grants (1064643 and 1024570). NF was supported by the National Institute of Mental Health (MH111671) and National Center for Complementary and Integrative Health (R01AT011267). TJ was supported by the National Institutes Health Project Grants (MH098212) to TJ. JR and LF thank the support of the Spanish Ministry of Science and Innovation (PI19/00394 and CPII19/00009) integrated into the Plan Nacional de I+D+I and co-financed by the ISCIII-Subdirección General de Evaluación and the Fondo Europeo de Desarrollo Regional (FEDER); the Instituto de Salud Carlos III; the CIBER of Mental Health (CIBERSAM). YH, AN, ES were supported by AMED Brain/MINDS Beyond program Grant No. 20dm0307002, YH was supported by JSPS KAKENHI Grants No. 19K03309. AFFDIS was funded by the University Medical Center Goettingen (UMG Startfoerderung), TEG and RGM were supported by German Federal Ministry of Education and Research (Bundesministerium fuer Bildung und 664 Forschung, BMBF: 01 ZX 1507, “PreNeSt - e:Med”).

Footnotes

Disclosures: Eduard Vieta has received grants and served as consultant, advisor or CME speaker for the following entities: AB-Biotics, Abbott, AbbVie, Angelini, Boehringer-Ingelheim, Dainippon Sumitomo Pharma, Ferrer, Gedeon Richter, GH Research, Janssen, Lundbeck, Novartis, Otsuka, Sage, Sanofi-Aventis, Sunovion, and Takeda, outside the submitted work. Carlos Zarate is a full-time U.S. government employee. He is listed as a co-inventor on a patent for the use of ketamine and its metabolites in major depression and suicidal ideation. Dr. Zarate has assigned his patent rights to the U.S. government but will share a percentage of any royalties that may be received by the government. Neda Jahanshad and Paul M. Thompson received partial grant support from Biogen, Inc. (Boston, USA) for research unrelated to this manuscript. Norma Verdolini has received financial support for CME activities and travel funds from the following entities (unrelated to the present work): Angelini, Janssen, Lundbeck, Otsuka. Emilie Olié has received consultancy honoraria from Janssen Cilag. Kerry J. Ressler serves on advisory boards for Takeda, Janssen, and Verily, and he has received sponsored research support from Alkermes, Alto Neuroscience, and Brainsway, and he receives funding from NIH and the Brain and Behavior Research Fund. J. Douglas Steele has received funding via an honorarium associated with a lecture for Wyeth and funding from Indivior for a study on opioid dependency. Dan J. Stein has received research grants and/or consultancy honoraria from Lundbeck, Johnson & Johnson, Servier, and Takeda. All other co-authors have indicated no conflict of interest.

References

  1. Achenbach Thomas, Achenbach TM, Achenbach TM, and Achenbach T. 1991. “Manual for the Youth Self-Report and 1991 Profile.” https://www.scienceopen.com/document?vid=2d9fe2e3-04c5-4983-a979-2ed002bf63b7. [Google Scholar]
  2. Alicandro Gianfranco, Malvezzi Matteo, Gallus Silvano, La Vecchia Carlo, Negri Eva, and Bertuccio Paola. 2019. “Worldwide Trends in Suicide Mortality from 1990 to 2015 with a Focus on the Global Recession Time Frame.” International Journal of Public Health 64 (5): 785–95. [DOI] [PubMed] [Google Scholar]
  3. Armey Michael F., Schatten Heather T., Haradhvala Natasha, and Miller Ivan W.. 2015. “Ecological Momentary Assessment (EMA) of Depression-Related Phenomena.” Current Opinion in Psychology 4 (August): 21–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bates TC 2018. “Umx: A Helper Package for Structural Equation Modeling in OpenMx.” University of Edinburgh, Edinburgh, UK. doi. [Google Scholar]
  5. Bates TC, Neale MC, and Maes HH. 2016. “Umx: A Library for Structural Equation and Twin Modelling in R.” Journal of Statistical Software. [Google Scholar]
  6. Campos Adrian I., Thompson Paul M., Veltman Dick J., Pozzi Elena, van Veltzen Laura S., Jahanshad Neda, Baune Bernhard T., et al. 2020. “Brain Correlates of Suicide Attempt in 18,925 Participants across 18 International Cohorts.” medRxiv. https://www.medrxiv.org/content/10.1101/2020.05.06.20090191v1.full. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chorpita BF, Yim L, Moffitt C, Umemoto LA, and Francis SE. 2000. “Assessment of Symptoms of DSM-IV Anxiety and Depression in Children: A Revised Child Anxiety and Depression Scale.” Behaviour Research and Therapy 38 (8): 835–55. [DOI] [PubMed] [Google Scholar]
  8. Desseilles Martin, Perroud Nader, Guillaume Sébastien, Jaussent Isabelle, Genty Catherine, Malafosse Alain, and Courtet Philippe. 2012. “Is It Valid to Measure Suicidal Ideation by Depression Rating Scales?” Journal of Affective Disorders 136 (3): 398–404. [DOI] [PubMed] [Google Scholar]
  9. Erkoreka Leire, Ozamiz-Etxebarria Naiara, Ruiz Onintze, and Ballesteros Javier. 2020. “Assessment of Psychiatric Symptomatology in Bilingual Psychotic Patients: A Systematic Review and Meta-Analysis.” International Journal of Environmental Research and Public Health 17 (11). 10.3390/ijerph17114137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. First Michael B. 1997. Structured Clinical Interview for DSM-IV Axis I Disorders SCID-I: Clinician Version, Scoresheet. American Psychiatric Press. [Google Scholar]
  11. Gao Keming, Wu Renrong, Wang Zuowei, Ren Ming, Kemp David E., Chan Philip K., Conroy Carla M., Serrano Mary Beth, Ganocy Stephen J., and Calabrese Joseph R.. 2015. “Disagreement between Self-Reported and Clinician-Ascertained Suicidal Ideation and Its Correlation with Depression and Anxiety Severity in Patients with Major Depressive Disorder or Bipolar Disorder.” Journal of Psychiatric Research 60 (January): 117–24. [DOI] [PubMed] [Google Scholar]
  12. Hamilton M 1960. “A Rating Scale for Depression.” Journal of Neurology, Neurosurgery, and Psychiatry 23 (February): 56–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Harmelen Anne-Laura van, Schmaal Lianne, and Blumberg Hilary P.. 2020. “Imaging Suicidal Thoughts and Behavior: The Promise of Computational Models.” Neuropsychopharmacology: Official Publication of the American College of Neuropsychopharmacology 46 (1): 219–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Harris Charles R., Millman K. Jarrod, van der Walt Stéfan J., Gommers Ralf, Virtanen Pauli, Cournapeau David, Wieser Eric, et al. 2020. “Array Programming with NumPy.” Nature 585 (7825): 357–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Innamorati Marco, Pompili Maurizio, Lester David, Tatarelli Roberto, and Girardi Paolo. 2008. “Recreational Drug Use and Suicidality among Italian Young Adults.” Journal of Addictive Diseases 27 (4): 51–59. [DOI] [PubMed] [Google Scholar]
  16. Kaplan ML, Asnis GM, Sanderson WC, Keswani L, De Lecuona JM, and Joseph S. 1994. “Suicide Assessment: Clinical Interview vs. Self-Report.” Journal of Clinical Psychology 50 (2): 294–98. [DOI] [PubMed] [Google Scholar]
  17. Kaufman Joan, Birmaher Boris, Brent David, Rao Uma, Flynn Cynthia, Moreci Paula, Williamson Douglas, and Ryan Neal. 1997. “Schedule for Affective Disorders and Schizophrenia for School-Age Children-Present and Lifetime Version (K-SADS-PL): Initial Reliability and Validity Data.” Journal of the American Academy of Child & Adolescent Psychiatry. 10.1097/00004583-199707000-00021. [DOI] [PubMed] [Google Scholar]
  18. McCall WV, Porter B, Pate AR, Bolstad CJ, Drapeau CW, Krystal AD, … & Nadorff MR (2021). Examining suicide assessment measures for research use: Using item response theory to optimize psychometric assessment for research on suicidal ideation in major depressive disorder. Suicide and Life-Threatening Behavior, 51(6), 1086–1094. [DOI] [PubMed] [Google Scholar]
  19. Montgomery SA, and Asberg M. 1979. “A New Depression Scale Designed to Be Sensitive to Change.” The British Journal of Psychiatry: The Journal of Mental Science 134 (April): 382–89. [DOI] [PubMed] [Google Scholar]
  20. Mullins Niamh, Kang Jooeun, Campos Adrian I., Coleman Jonathan R. I., Edwards Alexis C., Galfalvy Hanga, Levey Daniel F., et al. 2022. “Dissecting the Shared Genetic Architecture of Suicide Attempt, Psychiatric Disorders, and Known Risk Factors.” Biological Psychiatry 91 (3): 313–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Nock Matthew K., Holmberg Elizabeth B., Photos Valerie I., and Michel Bethany D.. 2007. “Self-Injurious Thoughts and Behaviors Interview: Development, Reliability, and Validity in an Adolescent Sample.” Psychological Assessment 19 (3): 309–17. [DOI] [PubMed] [Google Scholar]
  22. Nurnberger John I., Blehar Mary C., Kaufmann Charles A., York-Cooler Carolyn, Simpson Sylvia G., Harkavy-Friedman Jill, Severe Joanne B., Malaspina Dolores, and Reich Theodore. 1994. “Diagnostic Interview for Genetic Studies: Rationale, Unique Features, and Training.” Archives of General Psychiatry 51 (11): 849–59. [DOI] [PubMed] [Google Scholar]
  23. Poznanski Elva O., and Mokros Hartmut B.. 1996. Children’s Depression Rating Scale, Revised (CDRS-R). Western Psychological Services Los Angeles. [Google Scholar]
  24. Reynolds William M. 1987. “Suicidal Ideation Questionnaire (SIQ).” Odessa, FL: Psychological Assessment Resources. http://www.v-psyche.com/doc/Clinical%20Test/Suicidal%20Ideation%20Questionnaire.doc. [Google Scholar]
  25. Sheehan David V., Lecrubier Yves, Sheehan K. Harnett, Amorim Patricia, Janavs Juris, Weiller Emmanuelle, Hergueta Thierry, Baker Roxy, and Dunbar Geoffrey C.. 1998. “The Mini-International Neuropsychiatric Interview (M.I.N.I): The Development and Validation of a Structured Diagnostic Psychiatric Interview for DSM-IV and ICD-10.” The Journal of Clinical Psychiatry 59 (20): 22–33. [PubMed] [Google Scholar]
  26. Stefansson J, Nordström P, Runeson B, Åsberg M, and Jokinen J. 2015. “Combining the Suicide Intent Scale and the Karolinska Interpersonal Violence Scale in Suicide Risk Assessments.” BMC Psychiatry 15 (September): 226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Suzuki Masahiro, Dallaspezia Sara, Locatelli Clara, Uchiyama Makoto, Colombo Cristina, and Benedetti Francesco. 2016. “Discrepancy between Subjective and Objective Severity as a Predictor of Response to Chronotherapeutics in Bipolar Depression.” Journal of Affective Disorders 204 (November): 48–53. [DOI] [PubMed] [Google Scholar]
  28. Velzen Laura S. van, Dauvermann Maria R., Colic Lejla, Villa Luca M., Savage Hannah S., Toenders Yara J., Zhu Alyssa H., et al. 2021. “Structural Brain Alterations Associated with Suicidal Thoughts and Behaviors in Young People: Results across 21 International Studies from the ENIGMA Suicidal Thoughts and Behaviours Consortium.” medRxiv, September, 2021.09.27.21264068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Virtanen Pauli, Gommers Ralf, Oliphant Travis E., Haberland Matt, Reddy Tyler, Cournapeau David, Burovski Evgeni, et al. 2020. “SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python.” Nature Methods 17 (3): 261–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Watson David, O’Hara Michael W., Naragon-Gainey Kristin, Koffel Erin, Chmielewski Michael, Kotov Roman, Stasik Sara M., and Ruggero Camilo J.. 2012. “Development and Validation of New Anxiety and Bipolar Symptom Scales for an Expanded Version of the IDAS (the IDAS-II).” Assessment 19 (4): 399–420. [DOI] [PubMed] [Google Scholar]
  31. WHO. 1997. “CIDI-Auto Version 2.1: Administrator’s Guide and Reference.” [Google Scholar]
  32. World Health Organization. 2014. Preventing Suicide: A Global Imperative. World Health Organization. [Google Scholar]
  33. Yigletu H, Tucker S, Harris M, & Hatlevig J (2004). Assessing Suicide Ideation:Comparing Self-Report Versus Clinician Report. In Journal of the American Psychiatric Nurses Association (Vol. 10, Issue 1, pp. 9–15). 10.1177/1078390303262655 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material

RESOURCES