Applying contrastive pre-training for depression and anxiety risk prediction in type 2 diabetes patients based on heterogeneous electronic health records: a primary healthcare case study

Wei Feng; Honghan Wu; Hui Ma; Zhenhuan Tao; Mengdie Xu; Xin Zhang; Shan Lu; Cheng Wan; Yun Liu

doi:10.1093/jamia/ocad228

. 2023 Dec 7;31(2):445–455. doi: 10.1093/jamia/ocad228

Applying contrastive pre-training for depression and anxiety risk prediction in type 2 diabetes patients based on heterogeneous electronic health records: a primary healthcare case study

Wei Feng ¹, Honghan Wu ^2,³, Hui Ma ⁴, Zhenhuan Tao ⁵, Mengdie Xu ⁶, Xin Zhang ^7,⁸, Shan Lu ^9,¹⁰, Cheng Wan ^11,^✉, Yun Liu ^12,^13,^✉

¹ Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China

² Institute of Health Informatics, University College London, London, WC1E 6BT, United Kingdom

³ The Alan Turing Institute, London, NW1 2DB, United Kingdom

⁴ Department of Medical Psychology, Nanjing Brain Hospital affiliated with Nanjing Medical University, Nanjing, Jiangsu, 210024, China

⁵ Department of Planning, Nanjing Health Information Center, Nanjing, Jiangsu, 210003, China

⁶ Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China

⁷ Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China

⁸ Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China

⁹ Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China

¹⁰ Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China

¹¹ Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China

¹² Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China

¹³ Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China

^✉

Corresponding authors: Yun Liu, PhD, Department of Information, The First Affiliated Hospital, Nanjing Medical University, No. 300 Rd. Guangzhou, Nanjing, Jiangsu, 210029, China (yun_liu@njmu.edu.cn). Cheng Wan, MS, Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, No.101 Rd. Longmian, Nanjing, Jiangsu, 210009, China (chengwan@njmu.edu.cn).

PMCID: PMC10797279 PMID: 38062850

Abstract

Objective

Due to heterogeneity and limited medical data in primary healthcare services (PHS), assessing the psychological risk of type 2 diabetes mellitus (T2DM) patients in PHS is difficult. Using unsupervised contrastive pre-training, we proposed a deep learning framework named depression and anxiety prediction (DAP) to predict depression and anxiety in T2DM patients.

Materials and Methods

The DAP model consists of two sub-models. Firstly, the pre-trained model of DAP used unlabeled discharge records of 85 085 T2DM patients from the First Affiliated Hospital of Nanjing Medical University for unsupervised contrastive learning on heterogeneous electronic health records (EHRs). Secondly, the fine-tuned model of DAP used case–control cohorts (17 491 patients) selected from 149 596 T2DM patients’ EHRs in the Nanjing Health Information Platform (NHIP). The DAP model was validated in 1028 patients from PHS in NHIP. Evaluation included receiver operating characteristic area under the curve (ROC-AUC) and precision-recall area under the curve (PR-AUC), and decision curve analysis (DCA).

Results

The pre-training step allowed the DAP model to converge at a faster rate. The fine-tuned DAP model significantly outperformed the baseline models (logistic regression, extreme gradient boosting, and random forest) with ROC-AUC of 0.91±0.028 and PR-AUC of 0.80±0.067 in 10-fold internal validation, and with ROC-AUC of 0.75 ± 0.045 and PR-AUC of 0.47 ± 0.081 in external validation. The DCA indicate the clinical potential of the DAP model.

Conclusion

The DAP model effectively predicted post-discharge depression and anxiety in T2DM patients from PHS, reducing data fragmentation and limitations. This study highlights the DAP model’s potential for early detection and intervention in depression and anxiety, improving outcomes for diabetes patients.

Keywords: EHR pre-trained model, type 2 diabetes mellitus, depression and anxiety, regional EHRs, deep learning

Introduction

The prevalence of depression and anxiety in patients with diabetes is twice that of the non-diabetic population.¹ These psychological disorders affect the self-management, glycemic control adversely,² and are associated with increased risk of cardiovascular complications and dementia,^3–5 more healthcare resources consumption,⁶ higher healthcare costs,⁷^,⁸ and increased risk of all-cause hospitalization for patients with diabetes.⁹ Annual mean total healthcare costs were higher for diabetes patients with comorbid depression (EUR 5629 [95% CI 4987-6407]) than without (EUR 3252 [95% CI 2976-3675]).¹⁰ Predicting depression or anxiety in patients with diabetes is critical for optimizing glycemic control and reducing the cost.

Primary healthcare services (PHS) are crucial for meeting the needs of alleviating the burden of the disease, improving diabetes management,¹¹ and also the primary sources of mental healthcare.¹² The recent proposals outlined in the “Healthy China 2030” suggested that the physicians of PHS in China will need to become a pivotal role for mental health therapies in the future.¹³ Therefore, PHS are expected to become an important mental health identification approach for patients with diabetes.¹⁴

The commonly used methods for assessing depression and anxiety present practical challenges in PHS. To assess the severity of depression or anxiety in diabetic patients, it primarily relies on the total symptom scores reported by screening tools, including Geriatric Depression Scale (GDS),¹⁵^,¹⁶ the Patient Health Questionnaire-9 (PHQ-9) scale,¹⁷ Generalized Anxiety Disorder Scale-7 (GAD-7),¹⁷ and etc. However, the aforementioned tools are prone to fluctuations in symptom perception and can be influenced by recent events.¹⁸ Primary healthcare services often lack the involvement of mental health specialists. This has resulted in a delay in alerting to the risk of depression in diabetic patients and has consequently impacted early intervention efforts.

Electronic health records (EHRs) contain demographic information, symptoms, medical treatment processes, medication, medical history, images, laboratory tests, and other data from patients’ previous follow-up visits. In addition, patients’ medical history and reported symptoms are documented in an unstructured form within free-text notes. However, despite the existence of numerous models identifying the risk of depression or anxiety by leveraging patients’ EHRs,^19–21 there are two main reasons that hinder the identification of depression and anxiety among diabetes patients in PHS.

Firstly, existing models struggle to comprehensively capture the heterogeneous nature of EHR data when exclusively utilizing structured information, leading to a decrease in their effectiveness. The majority of current EHR-based models for predicting mental health rely on structured data with extracted topics²⁰ or symptoms²¹ from text, and necessitate manual conversion of unstructured data into structured formats before training. However, the performances of existing prediction models^19–21 are far from satisfactory. Our hypothesis is that the step of defining on what to be extracted from free-text limits the ability of data driven approaches (eg, deep learning models) for identifying unknown associations with depression/anxiety complications.

Furthermore, there is a two-faceted technical challenge of deriving models directly from Chinese PHS settings. First, in China, the PHS is significantly underused²² compared to those in the United States and the United Kingdom. This leads to the fact that data from primary care is not as widely available. Second, primary care settings have limited personnel qualified to conduct psychotherapy and antidepressants standardly unavailable. Patients can only receive psychological diagnosis or mental health services through referrals.²³ The two combined lead to scarcity and fragmentation of depression-related diagnostic data for PHS patients. Consequently, these obstacles hinder the performance of (directly PHS derived) predictive models for depression in patients with diabetes.²⁴

Large-scale labeled data on EHRs is scarce, while EHR data itself is huge in volume. Recently, contrastive learning, making the model associate similar and dissociate dissimilar samples, is becoming a major form of self-supervised pre-training in the first phase.²⁵ By using large unlabeled datasets to pre-train machine-learning models, self-supervised learning improves the performance of downstream tasks.²⁶ Models with contrastive self-supervised pre-training have required fewer labelled examples to reach the same performance than models trained only through supervised learning.²⁶ Through fine-tuning in next phase, the pre-trained model can be applied to various specific tasks related to diabetes patients.

This study addressed the aforementioned challenges of predicting depression and anxiety in patients with diabetes by developing an EHR-based model called the Depression and Anxiety Prediction (DAP) model. The model utilized an unsupervised pre-training approach using discharge records from T2DM patients in multiple healthcare services and validated it in PHS. This study highlights the effectiveness of the DAP model in early detection and intervention for mental health conditions in diabetes patients, contributing to improved healthcare outcomes.

Materials and methods

Our study was conducted in two steps (Figure 1), inspired by clinical event forecasting model.²⁵ In the first step, we collected and processed the hospital records of T2DM patients from the First Affiliated Hospital of Nanjing Medical University (FAHNMU). Using unsupervised learning, we developed a contrastive pre-training model for EHRs on this dataset. In the second step, we constructed cohorts for the occurrence of depression or anxiety in post-discharged patients with T2DM from the Nanjing Health Information Platform (NHIP). Then, we fine-tuned our EHR pre-trained model to predict the risk of depression or anxiety in T2DM patients during multiple periods after discharge. This approach has been used before²⁷^,²⁸ and enables modeling of changes in risk over time (new predicted risk for each included admission) within patients. Ethical approval (2020-SR-163) for the study was received from the Ethics Committee of the First Affiliated Hospital, Nanjing Medical University, Jiangsu, China.

Data collection and cohorts construction

Discharged EHR dataset of T2DM patients from FAHNMU

First, we constructed a discharged EHR dataset for T2DM diabetic patients (169 058 participants) who visited the FAHNMU between January 1, 2016 and October 31, 2022. Patients with diabetes were included if they met the following screening criteria: hemoglobin A1C (HbA1c) $\geq$ 48 mmol/mol, or 6.5%, or use of anti-diabetic medications, or presence of a diabetes diagnosis or medical history. No exclusion criteria based on age or diagnosed disorder were applied. To ensure the use of high-quality data during the pre-training process, descriptions unrelated to the disease or symptoms (eg, physical examination, consultation, medication, etc) and non-informative chief complaint descriptions (eg, unclear, unknown, etc) were removed. This produced a total of 183 662 electronic records from 85 085 patients in the database used for pre-training process.

Each discharge document comprised structured and unstructured data. The structured data comprised demographic information (age, sex, marital status) and laboratory values. The general demographics of one patient in different hospitalizations were treated as different samples following.²⁸ Outlier detection was performed on numeric values using the IsolationForest algorithm,²⁹ followed by normalization and division into discrete intervals. The unstructured data consisted of diagnosis labels and medical notes containing patient complaints, history of the present illness, past illnesses, and family medical history. To prepare the data for the unsupervised pre-training process, we aggregated general demographics, diagnoses of discharge, laboratory values during the hospitalization, and notes of medical history for each hospitalization.

Case–control cohorts construction from NHIP

Nanjing Health Information Platform is an integrated medical information platform containing the EHRs from most healthcare services in Nanjing, Jiangsu, China. Inpatients from NHIP who had already been diagnosed with diabetes and discharged between January 1, 2020 and December 31, 2021 were selected. To avoid data leaking caused by the pre-training process, the discharge EHRs from FAHNMU and PHS were excluded. The discharge EHR dataset for T2DM patients in NHIP was constructed the same as section discharged EHR dataset of T2DM patients from FAHNMU.

The case cohort was defined as the discharge records of T2DM patients from NHIP who experienced depression or anxiety events within one year after discharge. Depression or anxiety events refer to the presence of depression or anxiety diagnoses or the use of antidepressant or anxiolytic medications. Depression refers to diagnoses with the International Classification of Diseases (ICD)-10 codes F31-F34, F39, F06.3, or the use of antidepressant medications mentioned in the Anatomical Therapeutic Chemical (ATC) code N06A (https://www.whocc.no/atc_ddd_index/?code=N06A). Anxiety refers to diagnoses with ICD-10 codes F40-F43, F06.4, or the use of anxiolytics with the ATC code N05B (https://www.whocc.no/atc_ddd_index/?code=N05B). To avoid the influence of short-term depression or anxiety in patients, we excluded hospitalizations that had occurrences of depression or anxiety within six months prior to discharge diagnosis. Additionally, we merged repeated EHR records of T2DM patients within one week and selected the latest record.

For the control-cohort, we followed the propensity score matching (PSM) approach from Lau Raket et al³⁰ by selecting hospitalizations with similar discharge times and similar outcome event times (in this case, number of days between discharge dates and last date within the time window). We constructed datasets with case–control record ratios of 1:3 to simulate the rate of depression in patients with T2DM (25%³¹^,³²) (Appendix S2, Figure SA1). Similarly, the process of constructing case–control cohorts for depression or anxiety prediction tasks within 30 and 180 days after discharge followed the same procedure.

Primary healthcare services are essential for the management and support of diabetic patients. Therefore, to validate the performance of our model in such services, we selected 78 PHS (including community medical service centers, community hospitals, etc) from NHIP as sub-cohorts and constructed a case–control sub-cohort using the same PSM method. Also, we selected a large general healthcare service ( ${GHS}_{L}$ ), a medium general healthcare service ( ${GHS}_{M}$ ), and a Traditional Chinese Medicine healthcare service (TCM-HS). These sub-cohorts (1:3) were constructed using the same PSM method.

DAP model development

Pre-training process on discharged EHR dataset of T2DM patients

We established a contrastive pre-training model ( $D A P_{C P}$ , Figure 2) based on discharged EHR dataset from section Discharged EHR dataset of T2DM patients from FAHNMU. The objective of the model was to minimize the internal distance between inpatient medical text, personal information, laboratory test data, and discharge diagnosis in each record via contrastive learning. We constructed separate encoders for unstructured and structured data due to the heterogeneous nature of EHR data.

For structured data, specifically demographic information and laboratory test results within a single record, we concatenated embedding of discretized data and the representation of feature names from a large-scale language model (LLM)³³ before feeding it into the structured encoder $ψ_{E}$ , which was a general transformer structure.³⁴ For unstructured textual data, such as medical history, diagnosis names, medication names, and laboratory test names, we first utilized LLM for static encoding to obtain their representation vectors as inputs to our model. This step can be considered as part of our data pre-processing. For the patient’s medical history, diagnostic text, and medication record text were encoded using text encoders with the same structure, denoted as $f_{X}$ , $f_{D}$ , and $f_{M}$ , respectively.

In order to fusion the unstructured and structured of data (the patient’s medical history and the laboratory test results during hospitalization), we applied Fast Linear Attention with a Single Head (FLASH) model as the fusion encoding model, denoted as $f_{X}$ that can support a length of over 2500.³⁵ The result E from $ψ_{E}$ was concatenated with an encoded medical record N from LLM as input. The algorithmic process of the fusion function $f_{X}$ is illustrated in Algorithm 1, where W denotes the learn-able variables, and S denotes the length of the input.

graphic file with name ocad228ilf1.jpg

We hypothesized a potential connection between a patient’s medical history, laboratory test data, and the diagnoses and medications prescribed by doctors, based on common sense. To model this connection, we employed contrastive learning—a discriminative technique that enhances semantic similarity among predefined instances within the same class while reducing semantic similarity between different instances.³⁶ The objective loss function is defined as eqn (1) where $τ$ is a temperature hyper-parameter, and q is similar to its positive key $k_{+}$ and dissimilar to all other keys considered negative keys for q in one batch K.

L^{(q, k)} = - log \frac{exp (q \cdot k_{+} / τ)}{\sum_{i = 0}^{K} exp (q \cdot k_{i} / τ)}

(1)

We raised a paired contrastive loss $L$ (eqn (2)) to enhance the latent relation between the patient’s information representation and the representation of diagnosis or medications, which was inspired by the contrastive loss function of the contrastive language-image pre-training (CLIP).³⁷ The outputs x, m, d were get via three fully connected networks $g_{x}$ , $g_{m}$ , and $g_{d}$ which accepted X, M, D as inputs. Then, we optimized the x similarity with corresponding discharge diagnoses d and medication m through the $L_{C P}$ .

L_{C P} = L^{(x, m)} + L^{(m, x)} + L^{(x, d)} + L^{(d, x)}

(2)

Fine-tuning process for depression and anxiety prediction

To predict the occurrence of depression or anxiety within one year after discharge in T2DM patients from case–control cohorts, we conducted a fine-tuning model ( $D A P_{F T}$ ) on the $D A P_{C P}$ model obtained from section pre-training process on discharged EHR dataset of T2DM patients. Our optimization objective function ( $L_{F T}$ , eqn (3), where y represents the ground truth label and u represents the predicted probability) was to perform prediction tasks on multiple time intervals after discharge. Firstly, we encoded EHR data from case–control cohorts using the pre-trained model, obtaining the latent representations x, m, d of each EHR record. Then, x, m, d were concentrated and inputted into a single-layer neural network as $ϕ_{U}$ to classify whether depression or anxiety existed in the following time intervals.

L_{F T} = \sum_{i} - [y_{i} \cdot log (u_{i}) + (1 - y_{i}) \cdot log (1 - u_{i})]

(3)

Evaluations and interpretation of models

The demographic characteristics of the records in the case–control cohorts were compared and tested for potential differences at a significance level of 0.05. Categorical variables were analyzed using $χ^{2}$ tests, while continuous variables were evaluated using Wilcoxon tests, all for descriptive purposes.

We employed machine learning models, such as logistic regression (LR), extreme gradient boosting (XGB), and random forest (RF), as baseline models to compare the performance of DAP model. The LR, XGB, and RF had been selected as comparator baseline models in various researches and proved effectiveness in predicting depression on EHRs.^38–41 Missing values in the structured data were filled using mean imputation, while the unstructured data were vectorized using the term frequency-inverse document frequency (TF-IDF).⁴² The two types of vectors for each record were concatenated and input into the baseline models. Additionally, we also validated the Latent Dirichlet allocation (LDA) method as vectorization of unstructured data⁴³ (see Appendix Tables SA6, SA7). In the overall NHIP cohort, after excluding PHS and FAHNMU, the $D A P_{F T}$ model was subjected to 10-fold cross-validation on both NHIP and PHS. Additionally, NHIP and three sub-cohorts ( ${GHS}_{L}$ , ${GHS}_{M}$ , TCM-HS) were used as training data for external validation on PHS.

The mean value of receiver operating characteristic area under the curve (ROC-AUC) and precision-recall area under the curve (PR-AUC) were evaluated as metrics. The t-test is used to compare the differences between two groups of indicators (10-fold ROC-AUC and PR-AUC). Furthermore, decision curve analyses (DCAs)⁴⁴ were utilized to assess the clinical utility of various prediction models by considering the balance between the benefits and harms associated with different decision thresholds. During DCA, the net benefit of each model is evaluated across a spectrum of threshold probabilities that reflect the probability at which a clinician or patient would take action based on the prediction. The model that provides the greatest net benefit over the complete range of threshold probabilities is deemed to have the highest clinical utility. In the DCA figure, the solid lines and shaded areas correspond to the means and standard deviations of the net benefit of each model in 10-fold validation.

To interpret the DAP model and find the most influential features for depression and anxiety prediction, we applied the integrated gradients (IG) method⁴⁵ using the Python Captum library⁴⁶ released by Facebook to calculate the mean attribute scores of input features. The idea behind IG is to compute the gradients of the model’s prediction with respect to the input features while integrating these gradients along a path from a reference input $x^{'}$ to the actual input x. The IG score along the $i^{t h}$ dimension for an input x is defined in eqn (4), where $α$ is the scaling coefficient, $F (x)$ represents our DAP model, and $\frac{\partial F (x)}{\partial x_{i}}$ is the gradient of DAP model $F (x)$ along the $i^{t h}$ dimension. To calculate the IG score for each feature, we summed up all the dimensions included in each feature (diagnosis, laboratory tests, medications). This produced the IG score for each feature, and we listed the top 20 feature names.

I G_{i} (x) : : = (x_{i} - x_{i}^{'}) \times \int_{α = 0}^{1} \frac{\partial F (x^{'} + α \times (x - x^{'}))}{\partial x_{i}} d α

(4)

Implementation details

The PSM method for constructing the cohorts was performed using the PsmPy package provided by Owens-Gary et al.⁴⁷ The deep learning model was implemented in PyTorch version 1.11.0. Max length of text input was 2500, layer number was 24, batch size was 6, epoch length was 10, and learning rate was 2e−5. We used 4 NVIDIA GeForce RTX 3090 GPU of 24GB graphics memory capacity. Machine learning models were implemented in scikit-learn package version 1.0.2 and Pycare 3.0. We used the default hyper-parameters for each model.

Results

Descriptive results of FAHNMU and NHIP

We employed a cohort of 85 085 hospitalized patients (183 662 discharge records) from FAHNMU to construct the $D A P_{C P}$ model for EHR (Appendix S1, Table SA1). Among these patients, males accounted for 58.94%, and the median age was 65 years. Notably, 5.76% of patients experienced depressive or anxious episodes within one year after discharge, with a median duration of 91 days post-discharge (IQR: 173.42). In the NHIP cohort, after excluding hospitalization records from FAHNMU, a total of 149 596 hospitalized patients (251 361 discharge records) remained. Males constituted 56.56% of this cohort, and the median age mirrored that of FAHNMU. Remarkably, 28.21 cases of depression or anxiety occurred for every 1000 person-year in NHIP, which was twice compared with FAHNMU. As for PHS, the prevalence of depression or anxiety was 22.11 for every 1000 person-year.

We derived five pairs of case–control cohorts from the discharge records of T2DM patients in NHIP, focusing on the occurrence of depression or anxiety within three specific post-discharge time intervals. Taking 365 days post discharge as an example (Table 1), discharge records indicating depression or anxiety within this period were classified into the case cohort. In NHIP, the case cohort contained 5445 discharge records and exhibited a higher proportion of females (51.69%) and older age of admission (median age: 68 years), in contrast to the gender and age distribution observed in the control cohort. As in PHS, 286 records showed occurrence of depression or anxiety within 365 days post discharge. The data description tables for ${GHS}_{M}$ and ${GHS}_{L}$ can be found in Table SA2 of Appendix S1.

Table 1.

Descriptive result of the case–control cohorts from NHIP and PHS.

	NHIP^a			PHS
	Case (n = 5445)	Control (n = 16 335)	P-value	Case (n = 286)	Control (n = 858)	P-value
Patients, N	3678	13 813		203	825
Gender, N (%)
Female	1901 (51.69)	5764 (41.73)	.000	119 (58.62)	437 (52.97)	.171
Male	1777 (48.31)	8049 (58.27)	.000	84 (41.38)	388 (47.03)	.171
Age, year (IQR)	68 (17)	66 (18)	.000	69 (12)	70 (12)	.814
Examination, median (IQR)
Temperature, °C	36.50 (0.30)	36.50 (0.30)	.072	36.50 (0.40)	36.50 (0.30)	.110
Pulse, times	78.00 (14.00)	78.00 (14.00)	.000	76.00 (15.00)	76.00 (14.00)	.695
SBP, mm Hg	133.00 (29.00)	133.00 (26.00)	.277	133.00 (27.00)	138.00 (21.00)	.001
DBP, mm Hg	79.00 (16.00)	80.00 (16.00)	.061	80.00 (13.00)	80.00 (13.00)	.812
GLU, mmol/L	6.33 (3.08)	6.84 (3.51)	.000	7.23 (3.24)	7.95 (4.02)	.341
HbA1c, %	6.90 (1.90)	7.30 (2.40)	.000	7.50 (2.20)	7.35 (2.62)	.839

Open in a new tab

The records in PHS were removed.

Abbreviations: DBP, diastolic blood pressure; GLU, glucose; HbA1c, glycosylated hemoglobin; NHIP, Nanjing Health Information Platform; PHS, primary healthcare services; SBP, systolic blood pressure.

Performance of fine-tuning model for depression and anxiety

Internal validation for NHIP and PHS

We conducted an ablation experiment to compare the performance of the DAP model without pre-training and the DAP on the NHIP dataset, as shown in Figure 3, with 10-fold internal validation results for each epoch. The DAP model achieved stable ROC-AUC and PR-AUC performance as early as the second epoch. This suggested that we do not actually need to train for 10 epochs to achieve optimal performance, thus saving computational resources.

Figure 3. — During the training process of the depression and anxiety prediction (DAP) model within 365 days on the Nanjing Health Information Platform (NHIP) case–control cohorts, the changes of various indicators on 10-folder validation were compared between the use of pre-training models and the absence of pre-training models. The metrics include training loss, ROC-AUC, and PR-AUC on the each fold of test dataset.

We compared our fine-tuned pre-trained model with baseline models (including XGB, RF, and LR) to evaluate the performance in predicting the occurrence of depression or anxiety risk within 365 days of the post-discharge time interval. In the 10-fold internal validation on the PHS, as shown in Table 2, the DAP model significantly outperformed the baseline models (ROC-AUC: 0.91 ± 0.028, PR-AUC: 0.80 ± 0.067, P-value $<$ .000). Furthermore, it also demonstrated excellent predictive performance on the overall NHIP dataset. For the tasks of predicting depression and anxiety at 30 days and 180 days after discharge, performance metrics can be found in the tables in Appendix S2. The predictive performance at 30 days was not significantly different from baselines.

Table 2.

Performance of DAP model and baseline models within 365 days after discharge on 10-fold internal validation for cohorts from NHIP and PHS.

Models	ROC-AUC	P-value	PR-AUC	P-value
NHIP^a
LR	0.60 ( $\pm$ 0.011)	.000	0.32 ( $\pm$ 0.009)	.000
RF	0.73 ( $\pm$ 0.011)	.000	0.51 ( $\pm$ 0.015)	.000
XGB	0.72 ( $\pm$ 0.012)	.000	0.46 ( $\pm$ 0.026)	.000
DAP	0.80 ( $\pm$ 0.010)	ref	0.61 ( $\pm$ 0.018)	ref
PHS
LR	0.60 ( $\pm$ 0.072)	.000	0.41 ( $\pm$ 0.101)	.000
RF	0.60 ( $\pm$ 0.057)	.000	0.34 ( $\pm$ 0.056)	.000
XGB	0.65 ( $\pm$ 0.072)	.000	0.41 ( $\pm$ 0.072)	.000
DAP	0.91 ( $\pm$ 0.028)	ref	0.80 ( $\pm$ 0.067)	ref

Open in a new tab

The records from PHS were removed.

The numbers in parentheses are the standard deviation. The bolded part indicates the best performance of the corresponding data under the respective metric.

Evaluation metrics included ROC-AUC and PR-AUC. We conducted a t-test to compare the differences in results between the two groups generated from the 10-fold data.

Abbreviations: DAP, depression and anxiety prediction; LR, logistic regression; NHIP, Nanjing Health Information Platform; PHS, primary healthcare services; RF, random forest; XGB, extreme gradient boosting.

External validation on PHS for sub-cohorts

We used PHS as validation data for 10-folds validations to compare the predictive performance using different types of healthcare service institution data. As shown in Table 3, using NHIP (excluding records from PHS) as the training data, we validated the performance of the DAP model in predicting the occurrence of depression and anxiety after 365 days of discharge for diabetic patients on PHS. The DAP model achieved significant advantages (ROC–AUC: 0.75±0.045, P $<$ .000; PR-AUC: 0.47±0.081, P $<$ .000). Among the sub-cohorts of the three types of healthcare services, the DAP model demonstrated significant superiority over the baselines in the TCM-HS cohorts (ROC-AUC: 0.74±0.035, P $<$ .000; PR-AUC: 0.46±0.073, P $<$ .000). However, the DAP model did not show significant performance advantages on GHS data, regardless of the scale of the general healthcare service (GHS) data. For the tasks of predicting depression and anxiety at 30 days and 180 days after discharge, performance metrics can be found in the tables in Appendix S2.

Table 3.

Performance of DAP model and baseline models within 365 days after discharge on 10-fold external validation on PHS for cohorts from NHIP, GHS, and TCM-HS.

Models	ROC-AUC	P-value	PR-AUC	P-value
NHIP^a
LR	0.51 (±0.076)	.000	0.30 (±0.074)	.000
RF	0.52 (±0.061)	.000	0.28 (±0.058)	.000
XGB	0.53 (±0.065)	.000	0.26 (±0.039)	.000
DAP	0.75 (±0.045)	ref	0.47 (±0.081)	ref
GHS_L
LR	0.50 (±0.055)	.000	0.28 (±0.060)	.009
RF	0.50 (±0.061)	.000	0.27 (±0.037)	.000
XGB	0.61 (±0.051)	.476	0.37 (±0.053)	.804
DAP	0.62 (±0.059)	ref	0.36 (±0.055)	ref
GHS_M
LR	0.40 (±0.069)	.005	0.23 (±0.042)	.281
RF	0.50 (±0.059)	.737	0.27 (±0.050)	.453
XGB	0.49 (±0.063)	.848	0.27 (±0.037)	.473
DAP	0.49 (±0.060)	ref	0.26 (±0.053)	ref
TCM-HS
LR	0.56 (±0.058)	.000	0.33 (±0.053)	.000
RF	0.55 (±0.056)	.000	0.30 (±0.036)	.000
XGB	0.56 (±0.066)	.000	0.31 (±0.066)	.000
DAP	0.74 (±0.035)	ref	0.46 (±0.073)	ref

Open in a new tab

The records from PHS were removed.

The numbers in parentheses are the standard deviation. The bolded part indicates the best performance of the corresponding data under the respective metric. Evaluation metrics included ROC-AUC and PR-AUC. We conducted a t-test to compare the differences in results between the two groups generated from the 10-fold data.

The DCA curves illustrated that the net benefit of the DAP model trained on NHIP, TCM-HS, and PHS surpassed that of the baseline models within the threshold range of 0.3 to 0.5 in 10-fold validation (Figures 4 and 5). This observation suggested that the fine-tuned pre-trained model was more adept at striking a balance between accurately identifying true positives and minimizing false positives.

Figure 4. — Comparison of decision curve analysis between depression and anxiety prediction (DAP) model and baseline models (logistic regression [LR], random forest [RF], extreme gradient boosting [XGB]) for depression or anxiety risk prediction in 365 days for cohorts from Nanjing Health Information Platform (NHIP) and primary healthcare services (PHS). The net benefit of the decision curve analysis (DCA) curve is calculated based on the 10-fold internal validation.

Figure 5. — Comparison of decision curve analysis between depression and anxiety prediction (DAP) model and baseline models (logistic regression [LR], random forest [RF], extreme gradient boosting [XGB]) for depression or anxiety risk prediction in 365 days. The net benefit of the decision curve analysis (DCA) curve is calculated based on the 10-fold validation on the primary healthcare services (PHS).

Interpretation of DAP model

To explain the model’s prediction for depression and anxiety, we summed up the input features across dimensions and obtained the IG score for each feature. The features included diagnosis, medication names, laboratory tests, and patient disease information. The diagnosis adopted the ICD-10 coding system. For medications, we removed the descriptions of dosage forms to merge similar medications.

The top 20 mean IG scores of the feature texts from captum are shown in Table 4. The top three feature items with the highest positive predictive contribution for depression or anxiety are postherpetic neuralgia diagnosis, finasteride medication, and burn corrosion.

Table 4.

Based on the top 20 feature attribution scores provided by Captum, a detail column is presented, indicating the corresponding ICD codes in relation to the textual content of the original discharge diagnosis.

Feature name	Type	IG Score
Sequelae of other and unspecified infectious and parasitic diseases, B94	Diagnosis	4.09
Finasteride	Drug	4.04
Burn and corrosion, body region unspecified, T30	Diagnosis	3.02
Malignant neoplasm of uterus, part unspecified, C56	Diagnosis	2.98
Telmisartan	Drug	2.69
Malaise and fatigue, R53	Diagnosis	2.52
Other disorders of pancreatic internal secretion, E16	Diagnosis	1.85
Malignant neoplasm of vulva, C51	Diagnosis	1.83
Other inflammatory liver diseases, K75	Diagnosis	1.81
Abnormal results of function studies, R94	Diagnosis	1.77
Omeprazole sodium	Drug	1.765
Acute myocardial infarction, I21	Diagnosis	1.70
Subarachnoid hemorrhage, I60	Diagnosis	1.67
Recombinant lysine-protein zinc insulin	Drug	1.67
Propranolol hydrochloride	Drug	1.67
Nao Xin Qing	Drug	1.63
Recombinant human insulin zinc	Drug	1.57
Edaravone	Drug	1.52
Nonorganic sleep disorders, F51	Diagnosis	1.51

Open in a new tab

Discussion

In this study, we constructed the DAP model to provide risk prediction for the occurrence of depression or anxiety in multiple time periods for patients with T2DM, and addressed the issues of data heterogeneity and scarcity in the records from PHS. The DAP model is followed by two phases of training (self-supervised pre-training called $D A P_{C P}$ and supervised fine-tuning called $D A P_{F T}$ ) on EHRs. Through validation on a large NHIP cohort and a small PHS cohort, DAP model demonstrated significantly superior ROC-AUC and PR-AUC metrics compared to the baseline models. Furthermore, to explore the clinical utility of our model, we conducted DCA, revealing its stable advantage at specific thresholds on the PHS.

The DAP model can assist PHS in identifying the risks of depression or anxiety in T2DM patients via EHR. In recent years, artificial intelligence algorithms have been widely applied in the field of depression or anxiety prediction. Previous researches have mostly been based on scale tools⁴⁸ or structured data from EHR.²⁰^,⁴⁹ Gettings et al⁴⁸ developed Patient Health Questionnaire for Adolescents (PHQ-A) to increase the sensitivity of depression screening for youth with diabetes. Hochman et al²⁰ used XGB model predicting postpartum depression among women by analyzing demographics, medication prescriptions, laboratory measurements, and other EHR data. Song et al⁴⁹ identified eight bio-markers from EHR to predict depression in diabetes mellitus using support vector machine. However, PHS lack specialized training in providing psychological health services.²⁴ In addition, scarcity of features in trainable data and low data completeness in PHS could have obstacles in prediction depression or anxiety in patients with diabetes. The DAP model achieved the best performance (0.91±0.028 of ROC-AUC and 0.80±0.067 of PR-AUC) compared to the XGB model, when using PHS data only. Also, it is worth noting that although TCM-HS have less than 20% of the NHIP training data, the performance in validating the PHS (ROC: 0.74±0.035) is close to that of using the NHIP data (ROC: 0.75±0.045). The annual incidence rate of depression among inpatients in TCM-HS is 29 per thousand persons, which is close to the rate in NHIP of 28.21, indicating that it is unrelated to the occurrence of depression. We speculate that this may be related to the content of medical record text written by doctors in TCM-HS, which requires further analysis using natural language processing tools in the future.

The initial component of the DAP model, $D A P_{C P}$ model, effectively captured the heterogeneity of EHR data, including textual data, structured laboratory tests, and demographic information. This $D A P_{C P}$ is based on long-term and high-quality FAHNMU data, serving as the foundation to the challenges predicting depression and anxiety among diabetes patients using records from PHS. In contrast to Zhang et al’s approach,²⁵ which employed contrastive learning on temporally close medical records for the same patient, we do not follow this method due to the long time span and potential dissimilarities within our inpatient cohort for T2DM in our data. Consequently, we validated our hypothesis that there exists inherent similarity among various sections (eg, medical history, examinations, medications, and diagnoses) within discharge documents. By employing our $D A P_{C P}$ pre-training model and subsequently fine-tuning it on NHIP data, we have observed a significant enhancement in the prediction model for depression and anxiety. This improvement is particularly evident when compared to models without pre-training, as our approach demonstrates faster convergence. These findings affirm the reliability of our pre-training model construction method and highlight its potential for future fine-tuning tasks targeting other diseases in T2DM patients.

We used the IG score to demonstrate the factors that contribute to the prediction of depression and anxiety in patients with diabetes. We calculate the IG score for each feature by summing the input features across dimensions. Laboratory tests have a relatively low impact on predicting the occurrence of depression and anxiety after discharge in T2DM patients. The main influential features are derived from patients’ discharge diagnosis and medication. We found that herpes zoster, burn, and finasteride are the top three features with the highest IG scores, and these three factors have been reported related to the occurrence of depression.^50–52 Additionally, we found that telmisartan is also an important feature for predicting the occurrence of depression or anxiety. Telmisartan can induce central angiotensin type 1 receptor blockade and has the potential to be an oral antidepressant.⁵³ However, studies have also shown that high doses of telmisartan can induce depressive symptoms in diabetes-induced depression rat.⁵⁴ Therefore, the antidepressant effect of this medication still need further discussion in the future.

Our study has the following limitations. First, the DAP model directly uses diagnostic text for contrastive learning in the pre-training process of comparing discharge documents, which may lack the correlation between different diagnoses. For example, diabetes and thyroiditis both belong to endocrine system diseases. In the future, this limitation could be addressed by adding hierarchical structure task prediction between diagnoses to establish associations. Second, there is a possibility of false positives among the control group in each cohort, which is also a common issue in many prediction models related to mental health. When constructing the case–control training data, for the control group of negative T2DM patients, the determination is based on the presence of medical records and the absence of depression or anxiety events. In the future, we will enhance and improve our model by integrating with the medical data platform of Jiangsu Province. Third, the inclusion of hospitalized patients could introduce a potential source of bias, and patients may differ in significant ways from those who do not require hospitalization. In future research, we could explore these differences in more detail and strength our model’s generalizability. Finally, the performance of predicting the occurrence risk of depression or anxiety after discharge for T2DM patients should be confirmed through prospective clinical experiments. Previous studies have conducted prospective clinical validation of prediction models for short-term mental health crises after healthcare visits⁴⁰ and affirmed their clinical value. Diabetes is a chronic disease, so it is crucial to predict the long-term mental status of diabetic patients and conduct prospective validation. Additional research is needed in the future to not only evaluate the model’s performance but also assess whether it can provide benefits for glycemic management.

Conclusion

Overall, our study validates the feasibility of constructing a T2DM patient EHR contrastive pre-training model on FAHNMU and using it to fine-tune the risk of depression or anxiety in discharged patients across multiple time periods in regional EHR data, especially in PHS. Through data validation across multiple institutions, the model has demonstrated potential for clinical applications. Population-based validation and addressing challenges that may arise in clinical practice should be included in future considerations.

Supplementary Material

ocad228_Supplementary_Data

Click here for additional data file.^{(1.2MB, zip)}

Contributor Information

Wei Feng, Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China.

Honghan Wu, Institute of Health Informatics, University College London, London, WC1E 6BT, United Kingdom; The Alan Turing Institute, London, NW1 2DB, United Kingdom.

Hui Ma, Department of Medical Psychology, Nanjing Brain Hospital affiliated with Nanjing Medical University, Nanjing, Jiangsu, 210024, China.

Zhenhuan Tao, Department of Planning, Nanjing Health Information Center, Nanjing, Jiangsu, 210003, China.

Mengdie Xu, Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China.

Xin Zhang, Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China; Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China.

Shan Lu, Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China; Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China.

Cheng Wan, Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China.

Yun Liu, Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China; Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China.

Author contributions

WF, HHW, and HM conceived the experiments; ZHT, XZ, and SL collected and preprocessed the data; WF, MDX, and CW analyzed the results; WF, CW and YL wrote and reviewed the manuscript; WF, CW, and HHW contributed to manuscript drafting and revision.

Supplementary material

Supplementary material is available at Journal of the American Medical Informatics Association online.

Funding

This work was supported by Nanjing Life and Health Technology Special Project “Cooperative research, development and transformation of active intelligent health management platform for diabetes mellitus” (202205053); Nanjing City Health Science and Technology Development Special Fund in 2023 (Grant no. YKK23197); Jiangsu Provincial Health Commission’s medical research project (2023/288) 2023. This work was also supported by UK’s Medical Research Council (MR/S004149/1, MR/X030075/1); National Institute for Health Research (NIHR202639); British Council (UCL-NMU-SEU International Collaboration On Artificial Intelligence In Medicine: Tackling Challenges Of Low Generalisability And Health Inequality); HW’s role in this research was partially funded by the Legal & General Group (research grant to establish the independent Advanced Care Research Centre at University of Edinburgh). The funders had no role in conduct of the study, interpretation, or the decision to submit for publication. The views expressed are those of the authors and not necessarily those of Legal & General.

Conflict of interest

No competing interest is declared.

Data availability

The data underlying this article cannot be shared publicly due to the potential identifying nature of the records. The data will be shared on reasonable request to the corresponding author. The code of this study is available in https://github.com/inseptember/DAP.

Ethical approval

The studies involving human participants were reviewed and approved by the Ethics Committee of the First Affiliated Hospital, Nanjing Medical University (2020-SR-163).

References

1. Tomic D, Shaw JE, Magliano DJ.. The burden and risks of emerging complications of diabetes mellitus. Nat Rev Endocrinol. 2022;18(9):525-539. [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Tabák AG, Akbaraly TN, Batty GD, Kivimäki M. Depression and type 2 diabetes: a causal association? Lancet Diabetes Endocrinol. 2014;2(3):236-245. [DOI] [PubMed] [Google Scholar]
3. Boehmer K, Lakkad M, Johnson C, Painter JT. Depression and diabetes distress in patients with diabetes. Prim Care Diabetes. 2023;17(1):105-108. [DOI] [PubMed] [Google Scholar]
4. Ascher-Svanum H, Zagar A, Jiang D, et al. Associations between glycemic control, depressed mood, clinical depression, and diabetes distress before and after insulin initiation: an exploratory, post hoc analysis. Diabetes Ther. 2015;6(3):303-316. [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Katon W, Pedersen HS, Ribe AR, et al. Effect of depression and diabetes mellitus on the risk for dementia: a national population-based cohort study. JAMA Psychiatry. 2015;72(6):612-619. [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Hsieh H-M, Lin C-H, Weng S-F, Lin P-C, Liu T-L, Huang C-J. Health-related quality of life, medical resource use and physical function in patients with diabetes mellitus and depression: a cross-sectional analysis from the National Health and Nutrition Examination Survey. J Affect Disord. 2023;327:93-100. [DOI] [PubMed] [Google Scholar]
7. Iturralde E, Chi FW, Grant RW, et al. Association of anxiety with high-cost health care use among individuals with type 2 diabetes. Diabetes Care. 2019;42(9):1669-1674. [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Wang H-I, Han L, Jacobs R, et al. Healthcare resource use and costs for people with type 2 diabetes mellitus with and without severe mental illness in England: longitudinal matched-cohort study using the Clinical Practice Research Datalink. Br J Psychiatry. 2022;221(1):402-409. [DOI] [PubMed] [Google Scholar]
9. Tardif I, Guenette L, Zongo A, Demers E, Lunghi C. Depression and the risk of hospitalization in type 2 diabetes patients: a nested case-control study accounting for non-persistence to antidiabetic treatment. Diabetes Metab. 2022;48(4):101334. [DOI] [PubMed] [Google Scholar]
10. Brüne M, Linnenkamp U, Andrich S, et al. Health care use and costs in individuals with diabetes with and without comorbid depression in Germany: results of the cross-sectional DiaDec study. Diabetes Care. 2021;44(2):407-415. [DOI] [PubMed] [Google Scholar]
11. Wang X, Song K, Zhu P, Valentijn P, Huang Y, Birch S. How do type 2 diabetes patients value urban integrated primary care in China? Results of a discrete choice experiment. Int J Environ Res Public Health. 2020;17(1):117. [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Franco MI, Staab EM, Zhu M, et al. Pragmatic clinical trial of population health, portal-based depression screening: the PORTAL-depression study. J Gen Intern Med. 2023;38(4):857-864. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Li X, Lu J, Hu S, et al. The primary health-care system in China. Lancet. 2017;390(10112):2584-2594. [DOI] [PubMed] [Google Scholar]
14. Dibato J, Montvida O, Ling J, Koye D, Polonsky WH, Paul SK. Temporal trends in the prevalence and incidence of depression and the interplay of comorbidities in patients with young- and usual-onset type 2 diabetes from the USA and the UK. Diabetologia. 2022;65(12):2066-2077. [DOI] [PMC free article] [PubMed] [Google Scholar]
15. de Souza Moreira B, Maria da Cruz Dos Anjos D, Pereira DS, et al. The geriatric depression scale and the timed up and go test predict fear of falling in community-dwelling elderly women with type 2 diabetes mellitus: a cross-sectional study. BMC Geriatr. 2016;16(1):56. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Lavie I, Schnaider Beeri M, Schwartz Y, et al. Decrease in gait speed over time is associated with increase in number of depression symptoms in older adults with type 2 diabetes. J Gerontol A Biol Sci Med Sci. 2023;78(8):1504-1512. [DOI] [PMC free article] [PubMed] [Google Scholar]
17. Cherry MG, Brown SL, Purewal R, Fisher PL. Do metacognitive beliefs predict rumination and psychological distress independently of illness representations in adults with diabetes mellitus? A prospective mediation study. Br J Health Psychol. 2023;28(3):814-828. [DOI] [PubMed] [Google Scholar]
18. Robinson J, Khan N, Fusco L, Malpass A, Lewis G, Dowrick C. Why are there discrepancies between depressed patients’ global rating of change and scores on the Patient Health Questionnaire depression module? A qualitative study of primary care in England. BMJ Open. 2017;7(4):e014519. [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Jackson RG, Patel R, Jayatilleke N, et al. Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project. BMJ Open. 2017;7(1):e012012. [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Hochman E, Feldman B, Weizman A, et al. Development and validation of a machine learning-based postpartum depression prediction model: a nationwide cohort study. Depress Anxiety. 2021;38(4):400-411. [DOI] [PubMed] [Google Scholar]
21. Irving J, Patel R, Oliver D, et al. Using natural language processing on electronic health records to enhance detection and prediction of psychosis risk. Schizophr Bull. 2021;47(2):405-414. [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Wu D, Lam TP.. Underuse of primary care in China: the scale, causes, and solutions. J Am Board Fam Med. 2016;29(2):240-247. [DOI] [PubMed] [Google Scholar]
23. Searle K, Blashki G, Kakuma R, Yang H, Zhao Y, Minas H. Current needs for the improved management of depressive disorder in community healthcare centres, Shenzhen, China: a view from primary care medical leaders. Int J Ment Health Syst. 2019;13(1):47. [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Grazier KL, Smiley ML, Bondalapati KS.. Overcoming barriers to integrating behavioral health and primary care services. J Prim Care Community Health. 2016;7(4):242-248. [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Zhang Z, Yan C, Zhang X, Nyemba SL, Malin BA. Forecasting the future clinical events of a patient through contrastive learning. J Am Med Inform Assoc. 2022;29(9):1584-1592. [DOI] [PMC free article] [PubMed] [Google Scholar]
26. Krishnan R, Rajpurkar P, Topol EJ.. Self-supervised learning in medicine and healthcare. Nat Biomed Eng. 2022;6(12):1346-1352. [DOI] [PubMed] [Google Scholar]
27. Simon GE, Johnson E, Lawrence JM, et al. Predicting suicide attempts and suicide deaths following outpatient visits using electronic health records. Am J Psychiatry. 2018;175(10):951-960. [DOI] [PMC free article] [PubMed] [Google Scholar]
28. Seymour CW, Kennedy JN, Wang S, et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA. 2019;321(20):2003-2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
29. Liu FT, Ting KM, Zhou Z-H.. Isolation-based anomaly detection. ACM Trans Knowl Discov Data. 2012;6(1):1-39. [Google Scholar]
30. Lau Raket L, Jaskolowski J, Kinon BJ, et al. Dynamic ElecTronic hEalth reCord deTection (DETECT) of individuals at risk of a first episode of psychosis: a case–control development and validation study. Lancet Digit Health. 2020;2(5):e229-e239. [DOI] [PubMed] [Google Scholar]
31. Maimaitituerxun R, Chen W, Xiang J, et al. Prevalence of comorbid depression and associated factors among hospitalized patients with type 2 diabetes mellitus in Hunan, China. BMC Psychiatry. 2023;23(1):158. [DOI] [PMC free article] [PubMed] [Google Scholar]
32. Liu X, Li Y, Guan L, et al. A systematic review and meta-analysis of the prevalence and risk factors of depression in type 2 diabetes patients in China. Front Med (Lausanne). 2022;9:759499. [DOI] [PMC free article] [PubMed] [Google Scholar]
33. Su J, Lu Y, Pan S, Wen B, Liu Y. Roformer: enhanced transformer with rotary position embedding, 2021.
34. Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17. Curran Associates Inc.; 2017:6000–6010.
35. Hua W, Dai Z, Liu H, Le Q. Transformer quality in linear time. In: Proceedings of the 39th International Conference on Machine Learning. PMLR, 2022:9099–9117.
36. Saunshi N, Plevrakis O, Arora S, Khodak M, Khandeparkar H. A theoretical analysis of contrastive unsupervised representation learning. In: Proceedings of the 36th International Conference on Machine Learning. PMLR, 2019:5628–5637.
37. Radford A, Kim JW, Hallacy C, et al. Learning transferable visual models from natural language supervision. In: Proceedings of the 38th International Conference on Machine Learning. PMLR, 2021:8748–8763.
38. Jin H, Wu S, Di Capua P.. Development of a clinical forecasting model to predict comorbid depression among diabetes patients and an application in depression screening policy making. Prev Chronic Dis. 2015;12:E142. [DOI] [PMC free article] [PubMed] [Google Scholar]
39. Shin D, Lee KJ, Adeluwa T, Hur J. Machine learning-based predictive modeling of postpartum depression. J Clin Med. 2020;9(9):2899. [DOI] [PMC free article] [PubMed] [Google Scholar]
40. Garriga R, Mas J, Abraha S, et al. Machine learning model to predict mental health crises from electronic health records. Nat Med. 2022;28(6):1240-1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
41. Wei Z, Wang X, Ren L, et al. Using machine learning approach to predict depression and anxiety among patients with epilepsy in China: a cross-sectional study. J Affect Disord. 2023;336:1-8. [DOI] [PubMed] [Google Scholar]
42. Salton G, Buckley C.. Term-weighting approaches in automatic text retrieval. Inform Process Manage. 1988;24(5):513-523. [Google Scholar]
43. Meng Y, Speier W, Ong M, Arnold CW. HCET: hierarchical clinical embedding with topic modeling on electronic health records for predicting future depression. IEEE J Biomed Health Inform. 2021;25(4):1265-1272. [DOI] [PMC free article] [PubMed] [Google Scholar]
44. Vickers AJ, Elkin EB.. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26(6):565-574. [DOI] [PMC free article] [PubMed] [Google Scholar]
45. Sundararajan M, Taly A, Yan Q. Axiomatic attribution for deep networks. In: International Conference on Machine Learning. PMLR, 2017:3319–3328.
46. Kokhlikyan N, Miglani V, Martin M, et al. Captum: a unified and generic model interpretability library for PyTorch, 2020.
47. Owens-Gary MD, Zhang X, Jawanda S, Bullard KM, Allweiss P, Smith BD. The importance of addressing depression and diabetes distress in adults with type 2 diabetes. J Gen Intern Med. 2019;34(2):320-324. [DOI] [PMC free article] [PubMed] [Google Scholar]
48. Gettings J, Patil O, Vajravelu ME, et al. 871-P: integrating the PHQ-A depression screening tool within the electronic health record increases detection of depression symptoms. Diabetes. 2021;70(Suppl 1):871–P. [Google Scholar]
49. Song X, Zheng Q, Zhang R, et al. Potential biomarkers for predicting depression in diabetes mellitus. Front Psychiatry. 2021;12:731220. [DOI] [PMC free article] [PubMed] [Google Scholar]
50. Irwig MS. Depressive symptoms and suicidal thoughts among former users of finasteride with persistent sexual side effects. J Clin Psychiatry. 2012;73(9):1220-1223. [DOI] [PubMed] [Google Scholar]
51. Wiechman S, Kalpakjian CZ, Johnson KL.. Measuring depression in adults with burn injury: a systematic review. J Burn Care Res. 2016;37(5):e415-e426. [DOI] [PubMed] [Google Scholar]
52. Chen M-H, Wei H-T, Su T-P, et al. Risk of depressive disorder among patients with Herpes Zoster: a nationwide population-based prospective study. Psychosom Med. 2014;76(4):285-291. [DOI] [PubMed] [Google Scholar]
53. Wang JM, Tan J, Leenen FHH.. Central nervous system blockade by peripheral administration of AT1 receptor blockers. J Cardiovasc Pharmacol. 2003;41(4):593-599. [DOI] [PubMed] [Google Scholar]
54. Aswar U, Chepurwar S, Shintre S, Aswar M. Telmisartan attenuates diabetes induced depression in rats. Pharmacol Rep. 2017;69(2):358-364. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ocad228_Supplementary_Data

Click here for additional data file.^{(1.2MB, zip)}

Data Availability Statement

[ocad228-B1] 1. Tomic D, Shaw JE, Magliano DJ.. The burden and risks of emerging complications of diabetes mellitus. Nat Rev Endocrinol. 2022;18(9):525-539. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B2] 2. Tabák AG, Akbaraly TN, Batty GD, Kivimäki M. Depression and type 2 diabetes: a causal association? Lancet Diabetes Endocrinol. 2014;2(3):236-245. [DOI] [PubMed] [Google Scholar]

[ocad228-B3] 3. Boehmer K, Lakkad M, Johnson C, Painter JT. Depression and diabetes distress in patients with diabetes. Prim Care Diabetes. 2023;17(1):105-108. [DOI] [PubMed] [Google Scholar]

[ocad228-B4] 4. Ascher-Svanum H, Zagar A, Jiang D, et al. Associations between glycemic control, depressed mood, clinical depression, and diabetes distress before and after insulin initiation: an exploratory, post hoc analysis. Diabetes Ther. 2015;6(3):303-316. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B5] 5. Katon W, Pedersen HS, Ribe AR, et al. Effect of depression and diabetes mellitus on the risk for dementia: a national population-based cohort study. JAMA Psychiatry. 2015;72(6):612-619. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B6] 6. Hsieh H-M, Lin C-H, Weng S-F, Lin P-C, Liu T-L, Huang C-J. Health-related quality of life, medical resource use and physical function in patients with diabetes mellitus and depression: a cross-sectional analysis from the National Health and Nutrition Examination Survey. J Affect Disord. 2023;327:93-100. [DOI] [PubMed] [Google Scholar]

[ocad228-B7] 7. Iturralde E, Chi FW, Grant RW, et al. Association of anxiety with high-cost health care use among individuals with type 2 diabetes. Diabetes Care. 2019;42(9):1669-1674. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B8] 8. Wang H-I, Han L, Jacobs R, et al. Healthcare resource use and costs for people with type 2 diabetes mellitus with and without severe mental illness in England: longitudinal matched-cohort study using the Clinical Practice Research Datalink. Br J Psychiatry. 2022;221(1):402-409. [DOI] [PubMed] [Google Scholar]

[ocad228-B9] 9. Tardif I, Guenette L, Zongo A, Demers E, Lunghi C. Depression and the risk of hospitalization in type 2 diabetes patients: a nested case-control study accounting for non-persistence to antidiabetic treatment. Diabetes Metab. 2022;48(4):101334. [DOI] [PubMed] [Google Scholar]

[ocad228-B10] 10. Brüne M, Linnenkamp U, Andrich S, et al. Health care use and costs in individuals with diabetes with and without comorbid depression in Germany: results of the cross-sectional DiaDec study. Diabetes Care. 2021;44(2):407-415. [DOI] [PubMed] [Google Scholar]

[ocad228-B11] 11. Wang X, Song K, Zhu P, Valentijn P, Huang Y, Birch S. How do type 2 diabetes patients value urban integrated primary care in China? Results of a discrete choice experiment. Int J Environ Res Public Health. 2020;17(1):117. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B12] 12. Franco MI, Staab EM, Zhu M, et al. Pragmatic clinical trial of population health, portal-based depression screening: the PORTAL-depression study. J Gen Intern Med. 2023;38(4):857-864. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B13] 13. Li X, Lu J, Hu S, et al. The primary health-care system in China. Lancet. 2017;390(10112):2584-2594. [DOI] [PubMed] [Google Scholar]

[ocad228-B14] 14. Dibato J, Montvida O, Ling J, Koye D, Polonsky WH, Paul SK. Temporal trends in the prevalence and incidence of depression and the interplay of comorbidities in patients with young- and usual-onset type 2 diabetes from the USA and the UK. Diabetologia. 2022;65(12):2066-2077. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B15] 15. de Souza Moreira B, Maria da Cruz Dos Anjos D, Pereira DS, et al. The geriatric depression scale and the timed up and go test predict fear of falling in community-dwelling elderly women with type 2 diabetes mellitus: a cross-sectional study. BMC Geriatr. 2016;16(1):56. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B16] 16. Lavie I, Schnaider Beeri M, Schwartz Y, et al. Decrease in gait speed over time is associated with increase in number of depression symptoms in older adults with type 2 diabetes. J Gerontol A Biol Sci Med Sci. 2023;78(8):1504-1512. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B17] 17. Cherry MG, Brown SL, Purewal R, Fisher PL. Do metacognitive beliefs predict rumination and psychological distress independently of illness representations in adults with diabetes mellitus? A prospective mediation study. Br J Health Psychol. 2023;28(3):814-828. [DOI] [PubMed] [Google Scholar]

[ocad228-B18] 18. Robinson J, Khan N, Fusco L, Malpass A, Lewis G, Dowrick C. Why are there discrepancies between depressed patients’ global rating of change and scores on the Patient Health Questionnaire depression module? A qualitative study of primary care in England. BMJ Open. 2017;7(4):e014519. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B19] 19. Jackson RG, Patel R, Jayatilleke N, et al. Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project. BMJ Open. 2017;7(1):e012012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B20] 20. Hochman E, Feldman B, Weizman A, et al. Development and validation of a machine learning-based postpartum depression prediction model: a nationwide cohort study. Depress Anxiety. 2021;38(4):400-411. [DOI] [PubMed] [Google Scholar]

[ocad228-B21] 21. Irving J, Patel R, Oliver D, et al. Using natural language processing on electronic health records to enhance detection and prediction of psychosis risk. Schizophr Bull. 2021;47(2):405-414. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B22] 22. Wu D, Lam TP.. Underuse of primary care in China: the scale, causes, and solutions. J Am Board Fam Med. 2016;29(2):240-247. [DOI] [PubMed] [Google Scholar]

[ocad228-B23] 23. Searle K, Blashki G, Kakuma R, Yang H, Zhao Y, Minas H. Current needs for the improved management of depressive disorder in community healthcare centres, Shenzhen, China: a view from primary care medical leaders. Int J Ment Health Syst. 2019;13(1):47. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B24] 24. Grazier KL, Smiley ML, Bondalapati KS.. Overcoming barriers to integrating behavioral health and primary care services. J Prim Care Community Health. 2016;7(4):242-248. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B25] 25. Zhang Z, Yan C, Zhang X, Nyemba SL, Malin BA. Forecasting the future clinical events of a patient through contrastive learning. J Am Med Inform Assoc. 2022;29(9):1584-1592. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B26] 26. Krishnan R, Rajpurkar P, Topol EJ.. Self-supervised learning in medicine and healthcare. Nat Biomed Eng. 2022;6(12):1346-1352. [DOI] [PubMed] [Google Scholar]

[ocad228-B27] 27. Simon GE, Johnson E, Lawrence JM, et al. Predicting suicide attempts and suicide deaths following outpatient visits using electronic health records. Am J Psychiatry. 2018;175(10):951-960. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B28] 28. Seymour CW, Kennedy JN, Wang S, et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA. 2019;321(20):2003-2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B29] 29. Liu FT, Ting KM, Zhou Z-H.. Isolation-based anomaly detection. ACM Trans Knowl Discov Data. 2012;6(1):1-39. [Google Scholar]

[ocad228-B30] 30. Lau Raket L, Jaskolowski J, Kinon BJ, et al. Dynamic ElecTronic hEalth reCord deTection (DETECT) of individuals at risk of a first episode of psychosis: a case–control development and validation study. Lancet Digit Health. 2020;2(5):e229-e239. [DOI] [PubMed] [Google Scholar]

[ocad228-B31] 31. Maimaitituerxun R, Chen W, Xiang J, et al. Prevalence of comorbid depression and associated factors among hospitalized patients with type 2 diabetes mellitus in Hunan, China. BMC Psychiatry. 2023;23(1):158. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B32] 32. Liu X, Li Y, Guan L, et al. A systematic review and meta-analysis of the prevalence and risk factors of depression in type 2 diabetes patients in China. Front Med (Lausanne). 2022;9:759499. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B33] 33. Su J, Lu Y, Pan S, Wen B, Liu Y. Roformer: enhanced transformer with rotary position embedding, 2021.

[ocad228-B34] 34. Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17. Curran Associates Inc.; 2017:6000–6010.

[ocad228-B35] 35. Hua W, Dai Z, Liu H, Le Q. Transformer quality in linear time. In: Proceedings of the 39th International Conference on Machine Learning. PMLR, 2022:9099–9117.

[ocad228-B36] 36. Saunshi N, Plevrakis O, Arora S, Khodak M, Khandeparkar H. A theoretical analysis of contrastive unsupervised representation learning. In: Proceedings of the 36th International Conference on Machine Learning. PMLR, 2019:5628–5637.

[ocad228-B37] 37. Radford A, Kim JW, Hallacy C, et al. Learning transferable visual models from natural language supervision. In: Proceedings of the 38th International Conference on Machine Learning. PMLR, 2021:8748–8763.

[ocad228-B38] 38. Jin H, Wu S, Di Capua P.. Development of a clinical forecasting model to predict comorbid depression among diabetes patients and an application in depression screening policy making. Prev Chronic Dis. 2015;12:E142. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B39] 39. Shin D, Lee KJ, Adeluwa T, Hur J. Machine learning-based predictive modeling of postpartum depression. J Clin Med. 2020;9(9):2899. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B40] 40. Garriga R, Mas J, Abraha S, et al. Machine learning model to predict mental health crises from electronic health records. Nat Med. 2022;28(6):1240-1248. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B41] 41. Wei Z, Wang X, Ren L, et al. Using machine learning approach to predict depression and anxiety among patients with epilepsy in China: a cross-sectional study. J Affect Disord. 2023;336:1-8. [DOI] [PubMed] [Google Scholar]

[ocad228-B42] 42. Salton G, Buckley C.. Term-weighting approaches in automatic text retrieval. Inform Process Manage. 1988;24(5):513-523. [Google Scholar]

[ocad228-B43] 43. Meng Y, Speier W, Ong M, Arnold CW. HCET: hierarchical clinical embedding with topic modeling on electronic health records for predicting future depression. IEEE J Biomed Health Inform. 2021;25(4):1265-1272. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B44] 44. Vickers AJ, Elkin EB.. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26(6):565-574. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B45] 45. Sundararajan M, Taly A, Yan Q. Axiomatic attribution for deep networks. In: International Conference on Machine Learning. PMLR, 2017:3319–3328.

[ocad228-B46] 46. Kokhlikyan N, Miglani V, Martin M, et al. Captum: a unified and generic model interpretability library for PyTorch, 2020.

[ocad228-B47] 47. Owens-Gary MD, Zhang X, Jawanda S, Bullard KM, Allweiss P, Smith BD. The importance of addressing depression and diabetes distress in adults with type 2 diabetes. J Gen Intern Med. 2019;34(2):320-324. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B48] 48. Gettings J, Patil O, Vajravelu ME, et al. 871-P: integrating the PHQ-A depression screening tool within the electronic health record increases detection of depression symptoms. Diabetes. 2021;70(Suppl 1):871–P. [Google Scholar]

[ocad228-B49] 49. Song X, Zheng Q, Zhang R, et al. Potential biomarkers for predicting depression in diabetes mellitus. Front Psychiatry. 2021;12:731220. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ocad228-B50] 50. Irwig MS. Depressive symptoms and suicidal thoughts among former users of finasteride with persistent sexual side effects. J Clin Psychiatry. 2012;73(9):1220-1223. [DOI] [PubMed] [Google Scholar]

[ocad228-B51] 51. Wiechman S, Kalpakjian CZ, Johnson KL.. Measuring depression in adults with burn injury: a systematic review. J Burn Care Res. 2016;37(5):e415-e426. [DOI] [PubMed] [Google Scholar]

[ocad228-B52] 52. Chen M-H, Wei H-T, Su T-P, et al. Risk of depressive disorder among patients with Herpes Zoster: a nationwide population-based prospective study. Psychosom Med. 2014;76(4):285-291. [DOI] [PubMed] [Google Scholar]

[ocad228-B53] 53. Wang JM, Tan J, Leenen FHH.. Central nervous system blockade by peripheral administration of AT1 receptor blockers. J Cardiovasc Pharmacol. 2003;41(4):593-599. [DOI] [PubMed] [Google Scholar]

[ocad228-B54] 54. Aswar U, Chepurwar S, Shintre S, Aswar M. Telmisartan attenuates diabetes induced depression in rats. Pharmacol Rep. 2017;69(2):358-364. [DOI] [PubMed] [Google Scholar]

PERMALINK

Applying contrastive pre-training for depression and anxiety risk prediction in type 2 diabetes patients based on heterogeneous electronic health records: a primary healthcare case study

Wei Feng, BBA

Honghan Wu, PhD

Hui Ma, MD

Zhenhuan Tao, MS

Mengdie Xu, MS

Xin Zhang, MS

Shan Lu, PhD

Cheng Wan, MS

Yun Liu, PhD

Abstract

Objective

Materials and Methods

Results

Conclusion

Introduction

Materials and methods

Figure 1.

Data collection and cohorts construction

Discharged EHR dataset of T2DM patients from FAHNMU

Case–control cohorts construction from NHIP

DAP model development

Pre-training process on discharged EHR dataset of T2DM patients

Figure 2.

Fine-tuning process for depression and anxiety prediction

Evaluations and interpretation of models

Implementation details

Results

Descriptive results of FAHNMU and NHIP

Table 1.

Performance of fine-tuning model for depression and anxiety

Internal validation for NHIP and PHS

Figure 3.

Table 2.

External validation on PHS for sub-cohorts

Table 3.

Figure 4.

Figure 5.

Interpretation of DAP model

Table 4.

Discussion

Conclusion

Supplementary Material

Contributor Information

Author contributions

Supplementary material

Funding

Conflict of interest

Data availability

Ethical approval

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases