A Comprehensive Review of Computer-Aided Diagnosis of Major Mental and Neurological Disorders and Suicide: A Biostatistical Perspective on Data Mining

Mahsa Mansourian; Sadaf Khademi; Hamid Reza Marateb

doi:10.3390/diagnostics11030393

. 2021 Feb 25;11(3):393. doi: 10.3390/diagnostics11030393

A Comprehensive Review of Computer-Aided Diagnosis of Major Mental and Neurological Disorders and Suicide: A Biostatistical Perspective on Data Mining

Mahsa Mansourian ¹, Sadaf Khademi ², Hamid Reza Marateb ^2,^*

Editor: Panteleimon Giannakopoulos

PMCID: PMC7996506 PMID: 33669114

Abstract

The World Health Organization (WHO) suggests that mental disorders, neurological disorders, and suicide are growing causes of morbidity. Depressive disorders, schizophrenia, bipolar disorder, Alzheimer’s disease, and other dementias account for 1.84%, 0.60%, 0.33%, and 1.00% of total Disability Adjusted Life Years (DALYs). Furthermore, suicide, the 15th leading cause of death worldwide, could be linked to mental disorders. More than 68 computer-aided diagnosis (CAD) methods published in peer-reviewed journals from 2016 to 2021 were analyzed, among which 75% were published in the year 2018 or later. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) protocol was adopted to select the relevant studies. In addition to the gold standard, the sample size, neuroimaging techniques or biomarkers, validation frameworks, the classifiers, and the performance indices were analyzed. We further discussed how various performance indices are essential based on the biostatistical and data mining perspective. Moreover, critical information related to the Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) guidelines was analyzed. We discussed how balancing the dataset and not using external validation could hinder the generalization of the CAD methods. We provided the list of the critical issues to consider in such studies.

Keywords: Alzheimer’s disease, bipolar disorder, computer-aided diagnosis, data mining, dementias, depressive disorders, mental disorders, neurological disorders, schizophrenia, validation methods

1. Introduction

Mental health is a state of successful cognitive function resulting in adapting to change and coping with everyday stresses of life [1,2]. Mental disorders refer to a wide range of conditions affecting mood, thinking, and behavior. They could be occasional or chronic [3]. Some major mental disorders include depression, bipolar disorder (BD), and schizophrenia (SZ) [4]. Mental illnesses are globally among the leading causes of disability in Disability Adjusted Life Years (DALYs) [5]. Figure 1 shows the composition of mental disorder DALYs by type of disorder for both sexes combined worldwide from 1990 to 2019 [6]. Depressive disorders (29.74%), followed by anxiety disorders (22.86%), and schizophrenia (11.66%) are the top three contributors to mental disorder DALYs [6].

The contribution of mental disorders to Disability Adjusted Life Years (DALYs) worldwide, for both sexes combined, 2019 [6].

Among mental disorders, depressive disorders account for 1.84%, anxiety disorders for 1.13%, schizophrenia for 0.60%, and BD for 0.33% of total DALYs [6]. As mentioned in Figure 2 (Source: Institute for Health Metrics Evaluation. Used with permission. All rights reserved.), countries with the highest age-standardized mental disorder DALYs rates were Portugal 2603.92, Greece 2510.55, Greenland 2486.44, Iran 2436.44, and Spain 2396.768 DALYs per 100,000, in 2019 [6]. The World Health Organization (WHO) reported that over 450 million people worldwide suffer from mental disorders [7].

Mental disorders, age-standardized DALY rates (per 100 000) by location, both sexes combined, 2019 (reproduced with permission from [6]).

Every year, almost 25% of people experience a mental disorder [8]. However, due to the lack of access to adequate mental illness services and stigmatization, most patients do not receive help [9]. The increasing rate of mental disorders could be related to political and social violence, economic change, and cultural disruptions [10].

In addition to mental disorders, neurological disorders are illnesses causing psychological symptoms [11]. Such disorders have become important causes of death and disability worldwide [12]. The primary neurological disorders include Alzheimer’s disease (AD) and other dementias [12]. Figure 3 shows the composition of neurological disorders DALYs by type of disorder for both sexes combined worldwide from 1990 to 2019 [6]. About 20% of neurological disorders are AD and other dementias [13]. Today, almost 35.6 million people suffer from AD worldwide. This number will approximately double to 65.7 million cases by 2030 and may even triple to 115.4 million cases by 2050 [14]. The rapidly growing potential of sufferers and the inevitable enormous economic effects of AD on health and social services have led governments to take swift action to eradicate the disease [15]. Therefore, although AD is not at the top in Figure 3, it could be one of the most critical neurological disorders.

Contribution by neurological disorders to DALYs worldwide, both sexes combined, 2019 [6].

According to the Global Burden of Disease (GBD) 2019, AD and other dementias account for 1% of total DALYs. As mentioned in Figure 4 (Source: Institute for Health Metrics Evaluation. Used with permission. All rights reserved.), countries with the highest age-standardized neurological disorders DALYs rates were: Japan 1612.77, Italy 1109.73, Greece 923.58, France 880.49, and Estonia 854.71 DALYs per 100,000, in 2019 [6].

Alzheimer’s disease and other dementias, age-standardized DALY rates (per 100 000) by location, both sexes combined, 2019 (reproduced with permission from [6]).

Suicide, a death caused by intentional termination of one’s own life, has been known to be a critical public health issue by the WHO [16]. Each year, around one million people die due to suicide [17]. It is also one of the leading causes of death among young people worldwide, and, as such, it is responsible for a massive amount of pointless suffering and a substantial number of premature deaths [18]. Suicide has disruptive psychosocial effects [18] and is thus a global public health issue [19]. It shows considerable differences between geographic regions, socio-political realities, age groups, and genders [19]. Suicide was in the leading ten causes of death in five GBD regions [20].

The WHO data suggests that mental disorders, neurological disorders, and suicide are growing causes of morbidity [16,21]. The World Health Report 2001 and the Mental Health Action Plan 2013–2020 focused on mental disorders such as depression and schizophrenia, some neurological disorders like AD [22], and suicide [16]. In 2017, mental disorders were the sixth leading cause of DALYs and the second leading cause of disease burden in terms of years lived with disability (YLDs) in the world [23]. Furthermore, neurological disorders ranked as the second-leading cause of death and DALYs’ major cause in 2015 [12]. Suicide is the 15th leading cause of death worldwide [24]. Meanwhile, the total number of deaths from suicide increased by 6.7% globally from 1990 to 2016 [20]. It is also considered the second cause of unnatural death for those between 15 and 29 years old [25,26].

Significant proportions of mental and neurological disorders arise in low- and middle-income countries [27,28]. Mental disorders lead to significant social, personal, and economic loss, including functional impairment, psychosocial disability [29], low quality of life [30], and loss of productivity [31]. Patients with mental disorders have a shorter life expectancy than the general population; there is a strong dose–response effect between mortality and psychological distress [32]. Furthermore, milder disorders could impair functional capacity, which causes difficulties in social and marital relations [33].

Although in low-income and middle-income countries, 75·5% of deaths by suicide occur, suicide’s prevalence is higher in high-income countries [24]. Suicide could be linked to mental disorders [34]. Almost 90% of individuals who committed suicide have been subjected, at least, to one mental disorder [35]. Mental disorders contribute between 47% and 74% of suicide risks [18]. In around 50–65% of suicide cases, depression was observed [18]. Schizophrenia also accounts for very few of all youth suicides [36]. Furthermore, associations between suicide and anxiety disorders have been observed [18]. Accordingly, suicide prediction and diagnosis were also analyzed in our study.

Failure to detect mental disorders results in not receiving potentially effective treatment for the patients [32]. Long-lasting psychological distress has profound effects on the prospect of having a reasonable quality of life in patients and their work capacity and family [32]. It has been shown that early detection of mental disorders could shorten the duration of a disorder, reduce the number of further consultations, and result in less social impairment [32]. Furthermore, early detection of neurological disorders is critical to achieve optimal disease control [37].

There are various methods to detect and diagnose mental and neurological disorders at early stages, from interpreting participants’ answers to questions about their lives to using diagnostic equipment such as electroencephalogram (EEG), magnetoencephalogram (MEG), positron emission tomography (PET), magnetic resonance imaging (MRI), etc. [38,39]. However, manual assessment of such techniques is time-consuming and sensitive to error [39]. In fact, because of the differences in experts’ experience, manual methods of diagnosis are subjective to the examiner and are thus prone to errors and biases. Computer-aided diagnosis (CAD) was recently used as the second opinion to assist the diagnosis procedure [39].

Machine learning methods, with the inputs from different sources such as functional MRI (fMRI) [40], clinical and sociodemographic variables [41], information posted on social networks [26], or Patient Health and other related Questionnaire [42], were used in the literature for suicide diagnosis and prediction. CAD systems have been used to help clinicians, medical doctors, or neurologists diagnose certain diseases or disorders [43]. CAD systems’ goal is to improve the accuracy of experts interpreting big medical data so that the analysis time can be reduced and the diagnosis consistency is improved [44]. Numerous CAD frameworks and methods have been developed in the literature to analyze medical signals and images [43]. CAD systems are suitable to complete the neuropsychological assessments conducted by expert clinicians and improve prediction accuracy. In this sense, many studies used the CAD system to detect mental disorders, neurological disorders, and suicide. Thus, this review aimed to analyze the current CAD method for diagnosing depressive disorders, BD, schizophrenia, AD, dementia, and suicide.

2. Materials and Methods

2.1. Gold Standard

Due to the multiplicity of mental disorders and the importance of proper diagnosis and treatment, the need to classify these disorders has always existed and led to the publication of the Diagnostic and Statistical Manual of Mental Disorders (DSM). Its latest version, DSM-5, was released in 2013. Structured Clinical Interview for DSM-5 (SCID-5) is a structured diagnostic interview to diagnose mental disorders according to the criteria characterized in the DSM-5, which a trained clinician should prescribe. This structure specifies the order of the questions, how the questions are worded, and how the subject’s responses are classified. The primary diagnosis methods are summarized as the following [45].

2.1.1. Depression Disorder

SCID is considered to be the commonly used gold standard for a depression diagnosis. Major depressive disorder (MDD) is a type of depression characterized by separate episodes of at least 14 days. Critical symptoms of MDD are depressed mood, loss of interest, weight loss or weight gain without any particular diet, insomnia or hypersomnia, frequent thoughts of death or suicide, decreased ability to concentrate and think, feelings of being worthless and guilty, psychomotor agitation or retardation, feelings of energy loss and indecisiveness. Five or more of the above symptoms, when at least one of them is one of the first two symptoms is required for a depression diagnosis [46]

2.1.2. Bipolar Disorder

SCID is used as the gold standard among diagnostic interviews, but its validity will not be known until the discovery of related biomarkers. At least one period of mania is necessary for a specific diagnosis of bipolar disorder I (BD-I), while one hypomania and major depressive episode without a manic episode is essential for bipolar II (BD-II) diagnosis [47,48]

2.1.3. Schizophrenia

Patients’ description of symptoms, mental state tests, and behavioral observations help psychiatrists diagnose schizophrenia based on DSM-5 criteria, which is the gold standard of diagnosis to date. The most important symptoms are delusions, hallucinations, disorganized speech, extremely catatonic behavior, and negative symptoms such as decreased emotional expression. Two or more of these symptoms, when at least one of them is one of the first three symptoms is required for a schizophrenia diagnosis, and each of them should be present for a considerable period within a month [49,50].

2.1.4. Alzheimer’s

AD is a specific type of dementia. The gold standard hallmarks for definitive diagnosis of AD are cortical atrophy, amyloid-predominant neuritic plaques, and tau-predominant neurofibrillary tangles validated by postmortem histopathological examination. Amyloid precursor protein (APP), presenilin 1 (PSENl), or presenilin 2 (PSEN2) are known causative genes of the AD where genetic tests can show their mutation in early-onset cases. Furthermore, amyloid-based diagnostic tests such as positron emission tomography (PET) and cerebrospinal fluid (CSF) scans can be useful diagnostic tools [51]

2.1.5. Dementia

In DSM-5, major neurocognitive disorder (MCD) is considered an alternative term for dementia that was used in previous versions. A significant decrease in the level of the subject’s cognitive performance; for example, in learning and memory functions, followed by interference with independent daily activities, is a sign of dementia. Clinical Dementia Rating (CDR) is a cognitive diagnostic assessment widely used as the gold standard for diagnosing dementia. The CDR test is a semi-structured interview with the patient and a trustful informant, consisting of 46 questions, that takes 30–90 min to be completed and must be done by a trained clinician [52,53,54].

2.1.6. Suicide

Validated questionnaires have been used in the literature to diagnose high-risk individuals for suicidal behaviors [55]. Suicide Behaviors Questionnaire-Revised (SBQ-R) is a globalized test for identifying individuals at increased risk of suicidal behaviors, including ideation and attempts [56]. The SBQ-R test was designed based on the SBQ test, a 34-item questionnaire measuring the suicide tendency. It is a self-report test distinguishing between suicidal and non-suicidal subjects. The SBQ-R test includes four Likert-type questions that measure the risk of suicide according to the subject’s suicide ideation/attempt during lifetime, suicidal ideation rate in the last year, expressing thoughts of committing suicide with others, and suicidal behavior occurrence probability in the future. Each question has different points from 0 to 6 based on the subject’s choice. Two scoring criteria have been proposed so far to classify suicidal and non-suicidal individuals based on SBQ-R results: SBQ-R Item 1 and SBQ-R total score varying between 3 and 18. Clinical and non-clinical samples have an identical cutoff score of 2 in the SBQ-R Item 1. The SBQ-R total score’s cutoff scores were 7 and 8 for clinical and non-clinical samples, respectively [42].

2.2. The Literature Review

There are currently not enough biomarkers in psychiatry to classify disease state from the normal state, so diagnosis mostly depends on patient–physician interactions and questionnaires. Clinical observations based on patient self-reports are subjective and inaccurate even if they are based on DSM-5 criteria since they cannot identify false positives and recognize disorders from risks. This is where artificial intelligence (AI) comes in handy. AI is a general term in psychiatry that denotes the use of advanced computerized techniques and algorithms to diagnose, prevent, and treat mental disorders, such as automatic speech processing and machine learning algorithms applied on electronic medical databases and health records to assess a patient’s mental state. AI-based interventions reduce false negative and positive diagnoses and annihilate the stigma associated with mental illness symptoms to the clinician. They are also affordable and have significant benefits for patients suffering from restricted movement due to their symptoms. AI-based methods are not replacing clinicians; they can complement human clinical decisions by providing more comprehensive information to empower the health care system [57,58]. Here, we provided the literature review of the CAD systems for suicide, neurological disorders, and mental disorders focusing on the sample size, input features, classifiers, type of validations, and their performance indices.

2.2.1. PRISMA Guideline

We reviewed the works focusing on the diagnosis and prediction of CAD methods proposed in the literature for suicide, neurological disorders, and mental disorders. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement [59,60] was proposed in the literature to enrich and standardize medical reviewer papers [61]. We adopted the PRISMA guideline to select the relevant studies.

2.2.2. Search Strategy

A literature search of the online database of PubMed between 2016 and 2021 was performed using the terms (“bipolar” OR “bipolar disorder” OR “schizophrenia” OR “suicide” OR “Alzheimer” OR “dementia” OR “major depressive disorder” OR “depression”) AND (“machine learning” OR “deep learning”) AND “accuracy”. The reference lists of the identified publications were also reviewed. Peer-reviewed articles in English on Humans were analyzed.

2.2.3. Eligibility Criteria

Published studies were included in the review if they met the following criteria: (1) at least a measure of the diagnostic accuracy was provided, (2) at least the classifier, the validation framework, or the validation type were provided. Figure 5 shows a flow diagram describing the study selection process. Among 563 records screened, 71 studies were excluded as irrelevant to the original research question. Among the remaining 492 studies, 424 studies did not meet the eligibility criteria. Thus, 68 studies were included in our analysis.

Flow diagram of the study selection process (reproduced with permission from [60]).

2.2.4. Data Abstraction

The following characteristics were recorded for each study included in our analysis: publication reference (the first author’s surname and the year of publication), the sample size, the case and control groups, input features, classifiers, internal or external validation, type of validation (holdout or resampling), and the diagnostic accuracy.

3. Results

The CAD methods for mental and neurological disorders are listed in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, while the CAD methods for suicide prediction are provided in Table 8, Table 9, Table 10, Table 11.

Table 1.

CAD methods for mental and neurological disorders.

References	Goal	Sample Size	Data	Classifier	Internal, External, Validation	Type of Validation	Performance Indices
Lee et al. (2020) [62]	BD-II	(BD-II: n = 20, C: n = 20	Blood sample, Serum miRNA	Support vector machine (SVM)	Internal	Holdout	AUC: 0.91
Alici et al. (2019) [63]	BD	BD = 80, C = 80	Optical coherence tomography	logistic regression analysis	-	-	AUC: 0.69
Zhao et al. (2016) [64]	major depressive disorder (MDD) and BD	C = 44, MDD = 37 BD = 24	Blood sample	logistic regression	-	-	AUC: 0.86
Haenisch et al. (2016) [65]	BD	C = 44 l, BD = 66 (validation) Test: (First-onset MDD = 90, un-diagnosed BD = 12, C = 184 Pre-diagnostic = 110)	Blood sample	lasso regression	Both	10-fold CV	AUC: 0.8 (BD vs. first onset MDD), AUC: 0.79 (BD vs. C)
Fernandes et al. (2020) [66]	BD or SZ	blood-based domain = 323 (BD = 121, SZ = 71, C = 131), cognitive domain = 372 (SZ = 84, C = 171), multi-domain composed by the immune blood-based domain plus the cognitive domain = 279 (BD = 98, SZ = 5, C = 123)	peripheral blood sample cognitive biomarkers	linear discriminative analysis (LDA)	Internal	10-fold CV	(BD vs. C) Accuracy: 80, AUC: 0.86 (SZ vs. C) Accuracy: 86.18, AUC: 0.89 (BD vs. SZ) Accuracy: 76.43, AUC: 0.80
Tsujii et al. (2019) [67]	Distinguishing BD and MDD	58 healthy C: 58 BD: 79 MDD: 44	Blood sample, NIRS	Logistic Regression Analysis	-	-	AUC: 0.92
Faurholt-Jepsen et al. (2019) [68]	BD	BD (Euthymia, Depression, Mania): 29, C: 37	objective smartphone data reflecting behavioral activities	Gradient boosting	Internal	10-fold CV (random oversampling, sampling the minority class with replacement)	AUC: 0.66

Open in a new tab

C: (healthy) control; BD: Bipolar Disorder; SZ: Schizophrenia; MDD: Major Depressive Disorder; CV: Cross-Validation; AUC: Area Under the ROC Curve.