Skip to main content
General Psychiatry logoLink to General Psychiatry
. 2025 Sep 14;38(5):e101957. doi: 10.1136/gpsych-2024-101957

Predictive value of biomarker signatures for suicide risk in hospitalised patients with major depressive disorders: a multicentre study in Shanghai

Enzhao Zhu 1,0,0,0, Jiayi Wang 1,0,0, Zheya Cai 1, Guoquan Zhou 2,0, Chunbo Li 3,4,1, Fazhan Chen 1,5,1, Kang Ju 6, Liangliang Chen 6, Yichao Yin 6, Yi Chen 7,8, Yanping Zhang 9, Siqi Liu 2, Xu Zhang 1, Jianmeng Dai 1, Qianyi Yu 1, Jianping Qiu 2, Hui Wang 2, Weizhong Shi 10, Feng Wang 5, Dong Wang 5, Zhihao Chen 11, Jiaojiao Hou 12, Hui Li 3,4,*,1, Zisheng Ai 5,13,14,✉,1
PMCID: PMC12434745  PMID: 40959771

Abstract

Background

Biomarkers for predicting suicide risk in hospitalised patients with mental disorders have been understudied. Currently, suicide risk assessment tools based on objective indicators are limited in China.

Aims

To examine the value of various biomarkers in suicide risk prediction and develop a risk assessment model with clinical utility using machine learning.

Methods

This cohort study analysed patients with major depressive disorder (MDD) who were hospitalised for the first time between January 2016 and March 2023 from four specialised mental health institutions. A total of 139 features, including biomarker measurements, medical orders and psychological scales, were assessed for analysis. Their suicide risk was evaluated by qualified nurses using Nurse’s Global Assessment of Suicide Risk within 1 week after admission. Five machine learning models were trained with 10-fold cross-validation across three hospitals and were externally validated in an independent cohort. The primary performance was assessed using the area under the receiver operating characteristic curve (AUROC). The model was interpreted using the SHapley Additive exPlanations (SHAP) analysis. Biomarker importance was evaluated by comparing model performance with and without these biomarkers.

Results

Of 3143 patients with MDD included in this study, the incidence of high suicide risk within 1 week after first admission was 660 (21.0%). Among all models, the Extreme Gradient Boosting can more effectively predict future risks, with an AUROC higher than 0.8 (p<0.001). The SHAP values identified the 10 most important features, including five biomarkers. After clustering analysis, electroconvulsive therapy, physical restraint, β2-microglobulin and triiodothyronine were found to have heterogeneous effects on suicide risk. Combining biomarkers with other data from electronic health records significantly improved the performance and clinical utility of machine learning models based on demographics, diagnosis, laboratory tests, medical orders and psychological scales.

Conclusions

This study demonstrates the potential for a biomarker-based suicide risk assessment for patients with MDD, emphasising the interaction between biomarkers and therapeutic interventions.

Keywords: Depressive Disorder, Major; Suicide; Models, Statistical; Psychiatry


WHAT IS ALREADY KNOWN ON THIS TOPIC

  • Existing suicide risk assessment tools have certain limitations. Biomarkers that can predict suicide risk have been rarely studied.

WHAT THIS STUDY ADDS

  • Machine learning models based on biomarkers and other patient data can effectively predict future risk of suicide. Incorporating biomarker measurements into prognostic models significantly improved clinical utility in scenarios when symptom data were limited.

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY

  • Biomarkers have the potential to improve the predictive performance of suicide risk in clinically relevant scenarios.

Introduction

Suicide is a leading public health problem worldwide, accounting for more than 1.4% of the total mortality globally.1 Many individuals who died by suicide suffered from mental disorders, with major depressive disorder (MDD) present in half to two-thirds of these cases.2 In patients diagnosed with MDD, suicidal ideation and attempts are often the driving factors necessitating hospitalisation.3 During their hospitalisation, the prevalence of suicide attempts was 17.3% (95% confidence interval (CI) 12.4% to 23.7%), whereas the incidence of suicidal ideation during an acute episode within 1 month was 27.7% (95% CI 15.4% to 44.5%).4 Furthermore, compared with patients with MDD without a suicide attempt, those with a suicide attempt had a higher likelihood of hospitalisation within 3 years.5 Therefore, identifying individuals with MDD in hospitalisation who are at high risk of suicide represents a significant public health challenge and carries important clinical implications.

In China, the most common risk assessment tools for psychiatric inpatients are the Tool for Assessment of Suicide Risk and the Nurse’s Global Assessment of Suicide Risk (NGASR), which effectively assist professionals in identifying individuals at risk of suicide. Several risk factors have been examined, including psychological, familial, genetic and other biological factors.6 However, suicide assessments based on self-report or other-rated scales in clinical practice may be subject to inconsistency or unreliability of the evaluation.7 8 More importantly, these scales can only assess a patient’s current suicide risk. The biomarkers predicting potential future suicide risk were rarely studied.6 9 Therefore, the construction of a reliable and accurate screening model for future suicide risk based on objective data can compensate for the shortcomings of existing clinical assessment tools and provide evidence for the predictive value of biomarkers. This study aimed to examine the clinical value of various routinely tested biomarkers in suicide risk prediction and develop a prognostic model with clinical utility using machine learning.

Methods

This study followed the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis reporting guideline.10

Study participants and data source

Patient data were collected from four specialised mental health institutions (Shanghai Mental Health Center, Tongji University Mental Health Center, Shanghai Putuo District Mental Health Center and Shanghai Changning District Mental Health Center). This database includes admission information, diagnoses, scale tests, medical orders, laboratory tests and risk assessments for more than 20 000 patients. As a multicentre database, with Shanghai Mental Health Center being one of the four National Medical Centers for Mental Diseases, the data are highly representative and broadly reflect the inpatient depression population in China.

This study included detailed and comprehensive data on patients with MDD first hospitalised from 2016 to 2023. The aim of this study was to use machine learning to assess the predictive value of laboratory tests for the future risk of suicide in patients with MDD based on biomarkers and clinical data measured at or prior to admission. The eligibility criteria (figure 1A) were (1) 18–65 years old, (2) primary diagnosis of MDD on admission according to the International Classification of Diseases 10th Revision (ICD-10), (3) first hospitalisation and (4) resident in China. Exclusion criteria were (1) missing demographic information, (2) excessive missing of hospital records, (3) hospitalisation shorter than 7 days, (4) diagnosis of mental retardation, personality disorder or organic brain disorder, (5) severe physical illness, (6) long-term psychotropic drug use, (7) pregnancy or lactation and (8) no suicide risk assessment conducted. We specifically excluded patients who were already assessed as high risk for suicide at admission to ensure that the biomarkers were evaluated prior to any escalation in suicide risk.

Figure 1. The flowchart of the participants’ selection. ICD-10, International Classification of Diseases 10th Revision; MDD, major depressive disorder; NGASR, Nurse’s Global Assessment of Suicide Risk.

Figure 1

Procedure and outcome assessment

This retrospective cohort study simulated the prospective time sequence of the research procedures post hoc (figure 1B). Each patient was assigned a unique hospitalisation code. The patient’s first admission was identified by this code across the database. Demographic information included age and sex at first admission. We extracted whether the patient had concomitant psychotic symptoms based on ICD-10. From the medical order data, we extracted commonly used drug usage records, immediate enforcement of physical restraint, electroconvulsive therapy (ECT) and repetitive transcranial magnetic stimulation (TMS). The psychiatric scales included Zung Self-rating Depression Scale (SDS), Zung Self-rating Anxiety Scale (SAS), Life Event Scale (LES), and Symptom Checklist 90 (SCL-90), which are widely used in the study hospitals. The SCL-90 was recorded as factor scores and the other scales were recorded as total scores. Commonly used laboratory and other auxiliary examinations were collected, including complete blood count, liver function, renal function, blood lipid and glucose, coagulation function, hepatitis virus, immunological function, electrolyte, routine urinalysis, anaemia index, vitamin, thyroid function and myocardial infarction index. A total of 139 features, including 108 biomarkers, were assessed for analysis. See online supplemental table 1 for the detailed definition and explanation of each included variable.

The time of the included tests was recorded on the day of admission or within the previous week, as some patients may have had these tests conducted in outpatient clinics prior to admission. Considering that psychiatric drugs require time to take effect, medical order record data were extracted from 1 month before admission to the day of admission. As some patients may seek medical treatment in more than one hospital, medical order records of all patients were obtained across the study hospitals during that time period to reduce the impact of prescriptions from multiple places. The procedure to confirm the time frame of the data was as follows: we first extracted the testing/execution times for each variable, and then identified each patient’s unique hospitalisation number and admission date. We only included data collected on or prior to admission, ensuring a consistent time window across all variables. Furthermore, to ensure that the collected biomarkers and clinical data were assessed before any escalation in suicide risk, patients who were already identified as high risk for suicide at admission were excluded from the analysis. The timing of these measurements ensured that the biomarkers and other variables were assessed before any escalation in suicide risk, reinforcing their potential predictive role.

The outcome variable of this study was the risk of suicide assessed by the NGASR recorded in risk assessment records. The NGASR consists of 15 questions, and a total score of ≥9 indicates high suicide risk.11 12 The time of the outcome variables was set to be within 7 days of hospitalisation. According to institutional requirements and clinical pathways, the risk assessments were conducted weekly by qualified nurses.

Models

Models were built in Python V.3.8 with code packages of scikit-learn V.1.1.3, pytorch V.2.0.0, xgboost V.1.7.2, pykan V.0.0.5 and lightgbm V.3.3.3. Five machine learning and deep learning models were developed and compared, which included the logistic regression model (LRM), Extreme Gradient Boosting (XGBoost),13 Light Gradient Boosting Machine (LightGBM),14 multilayer perceptron (MLP) and Kolmogorov-Arnold Network (KAN).15 The LRM, XGBoost, LightGBM and MLP were commonly used in medical prediction modelling and achieved acceptable performance in previous studies.16 17 KAN is a recently proposed deep learning model and has received a lot of attention due to its superior performance over MLP in several tests.18 19 The Shanghai Mental Health Center was randomly chosen as an external testing cohort, while the remaining data source hospitals constituted a cross-validated cohort. The models were trained based on 10-fold cross-validation repeated 10 times in the cross-validated cohort for the best replicability.20 Random parameter search was used to turn the hyperparameters during cross-validation. Tuned hyperparameters for MLP and KAN included the number of layers and nodes, learning rate, the percentage of dropout and the mini-batch size, while the number of trees, tree depth and learning rate were tuned for XGBoost and LightGBM. To account for label imbalance, the training samples were oversampled to ensure that the negative and positive samples were equally distributed. Patients with more than 20% of missing values were excluded. For models that do not allow missing values, missing information was filled with the median within the cohort.

The model was interpreted using the SHapley Additive exPlanations (SHAP) analysis,21 which can explain the output of any machine learning model with a game theoretic approach. In this study, the average SHAP value for each feature is greater than zero, indicating that the presence or increase of this feature results in an added positive probability (predicted suicide risk probability) that the model outputs.

Statistical analyses

Initial data processing was conducted in PostgreSQL V.4.2. Model performance metrics were calculated in Python V.3.8 with the following packages: scikit-learn, pandas and numpy. Further data description and analysis were conducted in R V.4.1.3. Continuous variables were reported as mean (standard deviation (SD)) or median (interquartile range (IQRs)), depending on the normality of their distribution. The Anderson-Darling test was used to assess the normality of distributions. Categorical variables were presented as frequencies/numbers and percentages (%). Models were evaluated in 10-fold cross-validation with averaged performance and verified for generalisability in the external testing cohort. Primary model performance was evaluated using the area under the receiver operating characteristic curve (AUROC). Secondary metrics included area under the precision-recall curve (AUPRC), accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value and F1 score. Reported values are averages and 95% CI obtained from bootstrapping with 1000 iterations. Feature importance was assessed by measuring gain, that is, the average increase in F1 score brought by a feature to the branches it appeared on, and cumulative importance was calculated as the sum of individual feature importance within feature categories. The impact of features on model predictions was determined using SHAP values. For better visualisation and interpretation, the K-means clustering algorithm is used to downscale the SHAP values. Silhouette score was used to evaluate the clustering effect. Decision curve analysis (DCA) was used to estimate the clinical utility (net benefit) of predictive models with and without biomarkers at varying diagnostic thresholds. The blank model did not include any features except for baseline prevalence. Model metrics were compared using Nadeau and Bengio’s corrected resampled t-test,22 which adjusts for the non-independence of resampled statistics. The association of biomarker measurements with patient symptom data was evaluated using Pearson correlation. The p values less than 0.05 were considered statistically significant. Holm-Bonferroni adjustment was used to adjust for multiple testing.

Results

Patients

A total of 3143 first-admitted patients with MDD who met the inclusion criteria were included in this study. Of these, 1912 were assigned to the cross-validation cohort, and 1231 were assigned to the external testing cohort. The mean (SD) age was 42 (12.818) years; 1574 (50.1%) patients were male, and 456 (14.5%) had concomitant psychotic symptoms. The overall incidence of high suicide risk within 1 week after first admission was 660 (21.0%). See online supplemental table 2 in the supplementary file for a detailed overview of patient characteristics.

Model performance

Detailed model performance metrics in the cross-validation and the external testing cohorts are presented in table 1 and online supplemental table 3 in the supplementary file. The right-sided p values test the hypothesis that the AUROC of the model is better than that of the blank model. The AUROCs of all trained models were significantly superior to a random guess in both the cross-validated cohort and the external testing cohort (p<0.001). In the external testing cohort, XGBoost achieved the best AUROC (0.828, 95% CI 0.824 to 0.833), AUPRC (0.605, 95% CI 0.595 to 0.617), accuracy (0.819, 95% CI 0.814 to 0.823), PPV (0.660, 95% CI 0.649 to 0.674) and F1 score (0.552, 95% CI 0.541 to 0.561), consistent with its performance in the cross-validated cohort. Thus, XGBoost was determined as the best model in this study based on the primary performance metric.

Table 1. Summary of model performance in the external testing cohort.

Model AUROC (95% CI) AUPRC (95% CI) F1 score (95% CI) P value
LRM 0.484 (0.476 to 0.490) 0.223 (0.216 to 0.229) 0.371 (0.365 to 0.377) 0.960
LightGBM 0.774 (0.769 to 0.780) 0.530 (0.517 to 0.543) 0.390 (0.379 to 0.400) <0.001***
XGBoost 0.828 (0.824 to 0.833) 0.605 (0.595 to 0.617) 0.552 (0.541 to 0.561) <0.001***
MLP 0.517 (0.511 to 0.521) 0.241 (0.235 to 0.246) 0.384 (0.378 to 0.388) 0.036*
KAN 0.505 (0.504 to 0.506) 0.236 (0.232 to 0.240) 0.381 (0.375 to 0.386) <0.001***

This table summarises several core performance metrics of different models in the external testing cohort, with the reported values representing point estimates and their corresponding 95% CIs.

*p<0.05; ***p<0.001.

The 95% CI was calculated using bootstrap with 1000 iterations. The p value was calculated using right-sided Nadeau and Bengio’s corrected resampled t-test, testing the hypothesis that the AUROC is better than the blank model.

AUPRC, area under the precision-recall curve; AUROC, area under the receiver operating characteristic curve; CI, confidence interval; KAN, Kolmogorov-Arnold network; LightGBM, Light Gradient Boosting Machine; LRM, logistic regression model; MLP, multilayer perceptron; XGBoost, Extreme Gradient Boosting.

The ROC curves are presented in figure 2A, 2B, indicating good classifier performance (p<0.001).23 The calibration curves are demonstrated in figure 2C, 2D, with a mean (SD) Brier score loss of 0.139 (0.008) in the external testing cohort, significantly lower than that of the blank model (p<0.001). The DCA is shown in Figure 2E, 2F, with the optimal cut-off threshold determined by Youden’s index ranging from 7.685% to 9.753%.

Figure 2. Model performance and clinical utility. (A) The receiver operating characteristic curve in the cross-validated cohort. (B) The receiver operating characteristic curve in the external testing cohort. (C) The calibration curve in the cross-validated cohort. (D) The calibration curve in the external testing cohort. (E) Decision curve analysis in the cross-validated cohort. (F) Decision curve analysis in the external testing cohort. SD in the bootstrapping process. The p value was calculated using right-sided Nadeau and Bengio’s corrected resampled t-test, testing the hypothesis that the score is better than the blank model. None: no patients are treated, assuming no one is at risk. All: all patients are treated, assuming everyone is at risk. XGBoost, Extreme Gradient Boosting. These lines show the outcomes of treating none or all patients, helping to see if the model adds benefit. AUROC, area under the receiver operating characteristic curve; SD, standard deviation.

Figure 2

Feature importance

The ten most important variables determined by the average gain of the cross-validated cohort within XGBoost model included TMS, urine glucose positive, mean corpuscular haemoglobin, SDS, psychotic symptom, ECT, urine glucose negative, mean corpuscular haemoglobin concentration, SAS and age (online supplemental figure 1). Subsequently, in the external testing cohort, a SHAP summary plot (online supplemental figure 2) identified the ten features that have the greatest impact on the model outputs. The ranking according to importance was ECT, olanzapine (OLA), thyroid-stimulating hormone (TSH), physical restraint, age, β2-microglobulin (BMG), psychotic symptoms, uric acid (UA), cystatin C (CYS-C), and triiodothyronine (T3).

The individual correlation between each feature and SHAP values with linear interpolation is presented in online supplemental figure 3, showing the contribution of each feature to the outcome. The SHAP feature interaction plot illustrates the interaction of each feature with the one exhibiting the strongest potential interaction (online supplemental figure 4). Based on the SHAP values, ECT had the strongest interaction with five characteristics (TSH, UA, physical restraint, age and psychotic symptoms), including two biomarkers. BMG interacted with age and OLA. T3 and CYS-C interacted with the psychotic symptoms.

To provide a group-level feature contribution while considering the potential interaction, individual SHAP values of each feature were clustered using the K-means clustering algorithm. The optimal number of clusters was two groups, determined by the silhouette score (online supplemental figure 5). The clustered SHAP heatmap is presented in figure 3, which demonstrates a quantitative heterogeneous response of ECT, physical restraint, BMG and T3 to the predicted suicide risk. The clustered SHAP decision curves (online supplemental figure 6) showed the average contribution of each feature on model outputs within clusters. The Pearson correlation between the five most important biomarkers with psychopathology (online supplemental figure 7) indicated that UA was positively associated with paranoid and psychotic symptoms; T3 was related to somatisation symptoms, psychotic symptoms and the total score of SCL-90.

Figure 3. Clustered SHAP values heatmap. BGM, β2-microglobulin; CYS-C, cystatin C; ECT, electroconvulsive therapy; OLA, olanzapine; SHAP, SHapley Additive exPlanations; T3, triiodothyronine; TSH, thyroid-stimulating hormone; UA, uric acid.

Figure 3

Added benefits of biomarkers

Combining biomarkers with treatment data (0.039, 95% CI 0.021 to 0.058, p<0.001), SAS (0.054, 95% CI 0.035 to 0.074, p<0.001), SDS (0.066, 95% CI 0.044 to 0.085, p<0.001), SCL-90 (0.074, 95% CI 0.056 to 0.091, p<0.001) and LES (0.053, 95% CI 0.036 to 0.069, p<0.001) led to significant improvements in the AUROC of prognostic models (figure 4A). Regarding clinical utility under optimal decision threshold (figure 4B), biomarkers significantly enhanced performance within seven categories (demographic: 0.408, 95% CI 0.390 to 0.425, p<0.001; psychotic symptom: 0.315, 95% CI 0.297 to 0.331, p<0.001; treatment: 0.102, 95% CI 0.088 to 0.117, p<0.001; SAS: 0.225, 95% CI 0.212 to 0.237, p<0.001; SDS: 0.200, 95% CI 0.186 to 0.214, p<0.001; SCL-90: 0.126, 95% CI 0.118 to 0.135, p<0.001; LES: 0.134, 95% CI 0.119 to 0.143, p<0.001).

Figure 4. Added benefits of biomarker measurements. (A) Added predictive value of biomarker measurements; (B) Added clinical utility of biomarker measurements. The p value was calculated using right-sided Nadeau and Bengio’s corrected resampled t-test, testing the hypothesis that the score is better than blank model. Blank, blank model; LES, Life Event Scale; SAS, Zung Self-rating Anxiety Scale; SCL-90, Symptom Checklist 90; SDS, Zung Self-rating Depression Scale. ***p < 0.001.

Figure 4

Discussion

Main findings

Our findings suggest that incorporating biomarker measurements into prognostic models based on self-reported, demographic or symptomatic data enhances their ability to predict future suicide risk and may lead to clinically relevant improvements. The improved clinical utility was particularly evident in scenarios where information on psychiatric symptoms or treatment variables was unavailable, suggesting that biomarker tests may be especially beneficial to patients who do not accurately report their symptoms or whose treatment histories are unclear.

Among our models, XGBoost has the best predictive accuracy and stability. The AUROC ranged from 0.828 to 0.861, indicating a good classifier performance23 and significantly better than chance. The calibration analysis24 and DCA analysis25 also indicate a significant clinical utility of XGBoost in the screening of future suicide risk in psychiatry.

Several important features are identified through average gain in cross-validation and SHAP values. The average gain is a measure of the variable’s contribution to accuracy, while the SHAP value evaluates the impact on the predicted probability of suicide risk. We found that the 10 most important variables included five biomarkers: TSH, BMG, UA, CYS-C and T3. Our findings reaffirm the significant associations between TSH,26 UA,27 T39 and CYS-C28 levels and suicide risk, consistent with prior research. Notably, we also identify BMG as an additional biomarker, providing novel insights beyond existing studies, which largely focused on individuals with depression. These results underscore the potential utility of a broader biomarker profile, particularly thyroid and renal function indicators, in understanding and assessing suicide risk across diverse populations.

Moreover, the K-means algorithm identifies two clusters. Most clustered SHAP values showed similar trends to the univariate values. However, within one of the clusters, ECT, physical restraint, BMG and T3 showed opposite effects. As evidenced by the interaction plot, these features with heterogeneous effects typically interacted with others. Our results reveal a strong interaction between ECT and the biomarkers TSH and UA, consistent with the significant roles of TSH and UA in suicide risk prediction discussed earlier. A previous study found that the hypothalamic–pituitary–thyroid axis stands out in the effect of the ECT mechanism. An acute increase in serum TSH levels and a decrease in the TSH response to thyroid-releasing hormone after ECT treatment have been reported.29 The relationship between ECT and UA is indirect but mediated through metabolic and endocrine pathways. Research suggested that in males with normal thyroid function, higher thyroid hormone was associated with a lower risk of hyperuricaemia.30 Additionally, patients with bipolar disorder exhibit reduced adenosine triphosphate and adenosine levels, leading to increased UA.31 Given that ECT can influence both thyroid hormones and adenosine availability, it may indirectly modulate UA levels.

The impact of ECT on suicide risk itself has been widely studied, though findings are mixed: while most studies suggest ECT reduces suicide risk,32 33 some indicate no effect34 and a few suggest an increased risk in certain populations.35 In our study, the cluster-based SHAP analysis indicates that ECT may elevate suicide risk. The inconsistency in previous findings on the association between ECT and suicide risk may be attributed to differences in depression severity or subtype (such as mild depression, MDD or treatment-resistant depression), treatment course and the timing of suicide risk assessment.33 Given the complex interactions between ECT and multiple variables, further research is warranted to clarify these effects.

Strengths and limitations

The analysis benefited from the extensively characterised, multicentred and chronological real-world cohort. This study therefore provides real-world evidence based on a large sample size and rigorous design. Limitations include that the data comes from only hospitalised patients in a single city, which may lead to potential selection bias. Missing information, as well as the reliability and consistency of self-reported data, is also likely to introduce confounders.

Conclusions

This cohort study identifies a reproducible biomarker signature in patients with MDD that effectively predicts future risk of suicide and enhances the predictive value and clinical utility of prognostic models based on self-reported and other commonly accessible patient information. Our results suggest potential interactions between features, which may present heterogeneous effects on suicide risk.

Supplementary material

online supplemental figure 1
gpsych-38-5-s001.docx (1.6MB, docx)
DOI: 10.1136/gpsych-2024-101957

Acknowledgements

The authors of this study thank the hospital information department staff and directors including Zhihao Chen (East China University of Science and Technology), Dong Wang (Tongji University Mental Health Center), Feng Wang (Tongji University Mental Health Center), Yi Gu (Shanghai Putuo District Mental Health Center), Yu Mei (Shanghai Mental Health Center), Yifan Liu (Shanghai Mental Health Center) and Yichao Yin (Shanghai Changning Mental Health Center) for their work on data collection, management and curation; hospital and institutional directors including Hua Wang (Shanghai Putuo District Mental Health Center), Fazhan Chen (Tongji University Mental Health Center), Dianxu Feng (Shanghai Putuo District Health Committee), Guoquan Zhou (Shanghai Putuo District Mental Health Center), Weizhong Shi (Shanghai Hospital Development Center), Liangliang Chen (Shanghai Changning District Mental Health Center), Hui Li (Shanghai Mental Health Center), Chunbo Li (Shanghai Mental Health Center), Hong Qiu (Shanghai Mental Health Center), Lei Wang (Tongji University), Gang Zhu (Shanghai Municipal Finance Bureau), Guozhen Lin (Shanghai Ruijin Hospital), Yanping Zhang (Shanghai Jinshan District Mental Health Center), Xuehui Li (Shanghai Ruijin Hospital) for their work on project administration. They also thank other staff affiliated with the data source hospitals and colleges for their work on patient assessment, data collection and technical guidance.

Biography

Enzhao Zhu graduated from Tongji University School of Medicine, China in 2021 with a Bachelor of Science degree. He has been pursuing a PhD in the Department of Medical Statistics at Tongji University since 2021. His research focuses on digital mental health, leveraging advanced methodologies in medical statistics, deep learning, and large language models to enhance diagnostic and prognostic capabilities. His supervisor is Professor Zisheng Ai. Enzhao Zhu is adept in individual treatment effect estimation, causal inference techniques, general psychiatry-specific large language models, and automatic mental examination systems. His work aims to integrate artificial intelligence with clinical applications to optimise decision support systems in mental healthcare.

graphic file with name gpsych-38-5-g001.gif

Footnotes

Funding: This work was supported by projects from Shanghai Putuo District Municipal Health Committee (ptkwws202413); Shanghai Municipal Health Commission (202340018); Shanghai Hospital Development Center (Data Sharing and Emulation of Clinical Trials, CCS-DASET: SHDC2024CRI008); Shanghai Changning District Municipal Commission of Health (CNWJXY026); School of Innovation and Entrepreneurship, Tongji University (S202310247388, X2024085 and X2024048).

Provenance and peer review: Not commissioned; externally peer reviewed.

Patient consent for publication: Not applicable.

Ethics approval: This study was approved by the Ethics Committee of Tongji University Mental Health Center (Grant number: PDJW-IIT-2023-017CS), Shanghai Changning District Mental Health Center (Grant number: MR-31-24-043670), Shanghai Putuo District Mental Health Center (Grant number: M202409), and Shanghai Mental Health Center (Grant number: IRB00002733). Participants gave informed consent to participate in the study before taking part.

Data availability free text: The data analysed in this study are not available to the public in accordance with national legislation (Mental Health Law of the People’s Republic of China). Requests for data should be made through the corresponding author upon reasonable cause, subject to data license agreements with School of Medicine of Tongji University and related data source hospitals.

EZ and GZ are joint senior authors.

Data availability statement

Data are available upon reasonable request.

References

  • 1.Lovero KL, Dos Santos PF, Come AX, et al. Suicide in global mental health. Curr Psychiatry Rep. 2023;25:255–62. doi: 10.1007/s11920-023-01423-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Klonsky ED, May AM, Saffer BY. Suicide, suicide attempts, and suicidal ideation. Annu Rev Clin Psychol. 2016;12:307–30. doi: 10.1146/annurev-clinpsy-021815-093204. [DOI] [PubMed] [Google Scholar]
  • 3.Citrome L, Jain R, Tung A, et al. Prevalence, treatment patterns, and stay characteristics associated with hospitalizations for major depressive disorder. J Affect Disord. 2019;249:378–84. doi: 10.1016/j.jad.2019.01.044. [DOI] [PubMed] [Google Scholar]
  • 4.Dong M, Wang S-B, Li Y, et al. Prevalence of suicidal behaviors in patients with major depressive disorder in China: a comprehensive meta-analysis. J Affect Disord. 2018;225:32–9. doi: 10.1016/j.jad.2017.07.043. [DOI] [PubMed] [Google Scholar]
  • 5.Chartrand H, Robinson J, Bolton JM. A longitudinal population-based study exploring treatment utilization and suicidal ideation and behavior in major depressive disorder. J Affect Disord. 2012;141:237–45. doi: 10.1016/j.jad.2012.03.040. [DOI] [PubMed] [Google Scholar]
  • 6.Fazel S, Runeson B. Suicide. N Engl J Med. 2020;382:266–74. doi: 10.1056/NEJMra1902944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Su YA, Ye C, Xin Q, et al. Neuroimaging studies in major depressive disorder with suicidal ideation or behaviour among Chinese patients: implications for neural mechanisms and imaging signatures. Gen Psychiatr . 2024;37:e101649. doi: 10.1136/gpsych-2024-101649. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Xu S, Ju Y, Wei X, et al. Network analysis of suicide ideation and depression-anxiety symptoms among Chinese adolescents. Gen Psychiatr . 2024;37:e101225. doi: 10.1136/gpsych-2023-101225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Li X-Y, Tabarak S, Su X-R, et al. Identifying clinical risk factors correlate with suicide attempts in patients with first episode major depressive disorder. J Affect Disord. 2021;295:264–70. doi: 10.1016/j.jad.2021.08.028. [DOI] [PubMed] [Google Scholar]
  • 10.Collins GS, Reitsma JB, Altman DG, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594. doi: 10.1136/bmj.g7594. [DOI] [PubMed] [Google Scholar]
  • 11.Li X, Ge H, Zhou D, et al. Reduced serum VGF levels are linked with suicide risk in Chinese Han patients with major depressive disorder. BMC Psychiatry. 2020;20:225. doi: 10.1186/s12888-020-02634-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Cutcliffe JR, Barker P. The nurses’ global assessment of suicide risk (NGASR): developing a tool for clinical practice. Psychiatric Ment Health Nurs. 2004;11:393–400. doi: 10.1111/j.1365-2850.2003.00721.x. [DOI] [PubMed] [Google Scholar]
  • 13.Chen T, Guestrin C. XGBoost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2016. [DOI] [Google Scholar]
  • 14.Ke G, Meng Q, Finley T, et al. LightGBM: a highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems (NeurIPS) 2017;30:3146–54. [Google Scholar]
  • 15.Liu Z, Wang Y, Vaidya S, et al. KAN: Kolmogorov-Arnold Networks. ArXiv. 2024 [Google Scholar]
  • 16.Greener JG, Kandathil SM, Moffat L, et al. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. 2022;23:40–55. doi: 10.1038/s41580-021-00407-0. [DOI] [PubMed] [Google Scholar]
  • 17.Zhu E, Chen Z, Ai P, et al. Analyzing and predicting the risk of death in stroke patients using machine learning. Front Neurol. 2023;14:1096153. doi: 10.3389/fneur.2023.1096153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Wang Y, Sun J, Bai J, et al. Kolmogorov–Arnold-Informed neural network: a physics-informed deep learning framework for solving pdes based on Kolmogorov–Arnold Networks. SSRN. 2024 doi: 10.2139/ssrn.4868150. Preprint. [DOI]
  • 19.Seydi ST. Exploring the potential of polynomial basis functions in kolmogorov-arnold networks: a comparative study of different groups of polynomials. ArXiv. 2024 doi: 10.48550/arXiv.2406.02583. [DOI] [Google Scholar]
  • 20.Bouckaert RR, Frank E. PAKDD; 2004. Evaluating the replicability of significance tests for comparing learning algorithms. [Google Scholar]
  • 21.Lundberg SM, Lee S-I. Advances in neural information processing systems. NIPS; 2017. A unified approach to interpreting model predictions. [Google Scholar]
  • 22.Bengio Y, Nadeau C. Inference for the generalization error. Mach Learn. 2003;52:239–81. [Google Scholar]
  • 23.Nahm FS. Receiver operating characteristic curve: overview and practical use for clinicians. Korean J Anesthesiol. 2022;75:25–36. doi: 10.4097/kja.21209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kvamme H, Borgan Ø. The brier score under administrative censoring: problems and solutions. ArXiv. 2019 [Google Scholar]
  • 25.Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26:565–74. doi: 10.1177/0272989X06295361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Nuguru SP, Rachakonda S, Sripathi S, et al. Hypothyroidism and depression: a narrative review. Cureus. 2022;14:e28201. doi: 10.7759/cureus.28201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Chen J-X, Feng J-H, Zhang L-G, et al. Association of serum uric acid levels with suicide risk in female patients with major depressive disorder: a comparative cross-sectional study. BMC Psychiatry. 2020;20:477. doi: 10.1186/s12888-020-02891-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sun T, Chen Q, Li Y. Associations of serum cystatin C with depressive symptoms and suicidal ideation in major depressive disorder. BMC Psychiatry. 2021;21:576. doi: 10.1186/s12888-021-03509-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Rojas M, Ariza D, Ortega Á, et al. Electroconvulsive therapy in psychiatric disorders: a narrative review exploring neuroendocrine–immune therapeutic mechanisms and clinical implications. Int J Mol Sci. 2022;23:6918. doi: 10.3390/ijms23136918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Gu Y, Meng G, Zhang Q, et al. Predictive value of thyroid hormones for incident hyperuricemia in euthyroid subjects: the Tianjin chronic low-grade systemic inflammation and health cohort study. Endocr Pract. 2021;27:291–7. doi: 10.1016/j.eprac.2020.10.009. [DOI] [PubMed] [Google Scholar]
  • 31.Daniels SD, Boison D. Bipolar mania and epilepsy pathophysiology and treatment may converge in purine metabolism: a new perspective on available evidence. Neuropharmacology. 2023;241:S0028-3908(23)00346-5. doi: 10.1016/j.neuropharm.2023.109756. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kaster TS, Blumberger DM, Gomes T, et al. Risk of suicide death following electroconvulsive therapy treatment for depression: a propensity score-weighted, retrospective cohort study in Canada. Lancet Psychiatry. 2022;9:435–46. doi: 10.1016/S2215-0366(22)00077-3. [DOI] [PubMed] [Google Scholar]
  • 33.Rönnqvist I, Nilsson FK, Nordenskjöld A. Electroconvulsive therapy and the risk of suicide in hospitalized patients with major depressive disorder. JAMA Netw Open. 2021;4:e2116589. doi: 10.1001/jamanetworkopen.2021.16589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Watts BV, Peltzman T, Shiner B. Electroconvulsive therapy and death by suicide. J Clin Psychiatry. 2022;83:21m13886. doi: 10.4088/JCP.21m13886. [DOI] [PubMed] [Google Scholar]
  • 35.Spanggård A, Rohde C, Østergaard SD. Risk factors for suicide among patients having received treatment with electroconvulsive therapy: a nationwide study of 11,780 patients. Acta Psychiatr Scand. 2023;147:333–44. doi: 10.1111/acps.13536. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

online supplemental figure 1
gpsych-38-5-s001.docx (1.6MB, docx)
DOI: 10.1136/gpsych-2024-101957

Data Availability Statement

Data are available upon reasonable request.


Articles from General Psychiatry are provided here courtesy of Shanghai Mental Health Center

RESOURCES