Artificial intelligence-based classification of schizophrenia: A high density electroencephalographic and support vector machine study

Sai Krishna Tikka; Bikesh Kumar Singh; S Haque Nizamie; Shobit Garg; Sunandan Mandal; Kavita Thakur; Lokesh Kumar Singh

doi:10.4103/psychiatry.IndianJPsychiatry_91_20

. 2020 May 15;62(3):273–282. doi: 10.4103/psychiatry.IndianJPsychiatry_91_20

Artificial intelligence-based classification of schizophrenia: A high density electroencephalographic and support vector machine study

Sai Krishna Tikka ^1,^✉, Bikesh Kumar Singh ¹, S Haque Nizamie ², Shobit Garg ³, Sunandan Mandal ⁴, Kavita Thakur ⁴, Lokesh Kumar Singh ¹

PMCID: PMC7368447 PMID: 32773870

Abstract

Background:

Interview-based schizophrenia (SCZ) diagnostic methods are not completely valid. Moreover, SCZ-the disease entity is very heterogeneous. Supervised-Machine-Learning (sML) application of Artificial-Intelligence holds a tremendous promise in solving these issues.

Aims:

To sML-based discriminating validity of resting-state electroencephalographic (EEG) quantitative features in classifying SCZ from healthy and, positive (PS) and negative symptom (NS) subgroups, using a high-density recording.

Settings and Design:

Data collected at a tertiary care mental-health institute using a cross-sectional study design and analyzed at a premier Engineering Institute.

Materials and Methods:

Data of 38-SCZ patients and 20-healthy controls were retrieved. The positive-negative subgroup classification was done using Positive and Negative Syndrome Scale operational-criteria. EEG was recorded using 256-channel high-density equipment. Eight priori regions-of-interest were selected. Six-level wavelet decomposition and Kernel-Support Vector Machine (SVM) method were used for feature extraction and data classification.

Statistical Analysis:

Mann–Whitney test was used for comparison of machine learning-features. Accuracy, sensitivity, specificity, and area under receiver operating characteristics-curve were measured as discriminatory indices of classifications.

Results:

Accuracy of classifying SCZ from healthy and PS from NS SCZ, were 78.95% and 89.29%, respectively. While beta and gamma frequency related features most accurately classified SCZ from healthy controls, delta and theta frequency related features most accurately classified positive from negative SCZ. Inferior frontal gyrus features most accurately contributed to both the classificatory instances.

Conclusions:

SVM-based classification and sub-classification of SCZ using EEG data is optimal and might help in improving the “validity” and reducing the “heterogeneity” in the diagnosis of SCZ. These results might only be generalized to acute and moderately ill male SCZ patients.

Keywords: Feature-extraction, machine-learning, negative symptoms, positive symptoms, validity

INTRODUCTION

Schizophrenia (SCZ) is clinically diagnosed based on a set of criteria that are elicited by one-to-one interviews. As diagnostic interviews are conducted by a broad set of mental health professionals who vary in terms of qualification, experience, and time at their disposal to conduct the interview, they are bound to lack “reliability.” Structured interview schedules and chart-based diagnoses with structured assessments have been put forth to overcome this problem.[1] Although regarded to possess “utility,” even such methods are far from being termed “valid;” In true sense, diagnostic entities defined by clinical criteria are deemed “valid” only when they can be established as truly discrete entities.[2] Moreover, SCZ is a very heterogeneous entity, and in fact, it has been referred to in the plural i.e., “schizophrenias.”[3,4] The complexity of causal mechanisms and diversity of clinical presentations are considerably greater for SCZ vis-à-vis other psychiatric and medical disorders, at large. Till “personalized psychiatry” becomes a genuine reality, “heterogeneity” continues to pose a notable challenge to the validity of the diagnosis. Categorical/cluster subgrouping of SCZ into more homogeneous entities has been the answer so far. Of the various sub-classifications proposed thus far, Andreasen and Oslen's[5] division of SCZ into “positive” and “negative” clusters has been the most consistently used model in clinical research.[6] More importantly, this long-trusted division has been shown to be the most reliable in identifying the best treatment for a given individual.[7]

With a critical focus on the validity of clinical interview-based diagnosis and growing emphasis on enabling “personalized” or “precision” treatment, the need to identify biomarkers for diagnosing SCZ has risen tremendously.[8] Electroencephalographic (EEG) activity, particularly resting-state activity, has been proposed as potential biomarkers for diagnosing SCZ.[9,10,11,12] Besides the advantage of being cost-effective and of unparalleled temporal resolution inherent to EEG, studying resting-state activity enhances the ease and reproducibility of the procedure. In fact, some studies have shown resting-state EEG to differentiate positive and negative symptom (NS) subgroups of SCZ.[13,14,15] However, by and large, the ability of resting-state EEG to distinguish subgroups is weak, when other study results are considered.[10]

This lack of consistency in the association between conventional EEG markers, such as spectral power and coherence, and SCZ psychopathology has paved way for more advanced methods such as machine learning (ML). ML, an application of artificial intelligence (AI), enables machines to learn automatically, and improvise with progressive experience without being programmed explicitly.[16]

Broadly, ML paradigms are classified into-(1) supervised learning, (2) unsupervised learning and (3) reinforcement learning. Unsupervised and reinforcement learning paradigms are used in robotics and big data visualization, respectively. Supervised ML (sML) is used for classification and regression; and hence becomes pertinent to the context of classifying SCZ. sML based classification has shown “exuberant” promise in solving the problem of heterogeneity in SCZ.[17,18] sML-based approaches for discriminating EEG patterns of SCZ and healthy individuals have gained much attention in recent years. Recently, several studies have investigated ML-based diagnosis of SCZ using EEG data.[19,20,21,22] However, classifying positive and negative subgroups of SCZ, using sML based tools has not been explored.

With this literature back-up, we aimed to assess sML-based discriminating validity of resting-state EEG quantitative features in classifying SCZ from healthy, and positive and negative subgroups, using high-density recording. We also intended to identify specific frequencies and regions that classify the groups most accurately.

MATERIALS AND METHODS

Study design

We used a cross-sectional study design with a comparative group.

Participants

The data were retrieved from a project titled “transcranial magnetic stimulation in modulating neurodevelopmental factors in SCZ” (EEG was one of the secondary outcome variables) that was approved by the Institute Ethics Committee of a tertiary care mental-health institute in Eastern India; this project was registered in the clinical trials registry India (2014-12-005280) prior to recruitment of subjects. Written informed consent was taken from all the participants (and their legally qualified representatives in case of patients) before enrolling them for the study. Data of a total of 38 patients and 20 healthy controls were retrieved. Patients were recruited by purposive sampling from various inpatient wards of the institute. Right-handed, male patients in the age group of 18–50 years, having a diagnosis of SCZ as per the ICD-10 DCR[23] on a stable dose of antipsychotic medications, i.e., no change in the dosage for at least the last 5 days. These patients were recruited within a week of admission in the hospital for the first episode or for an acute exacerbation or relapse; all patients were deemed “symptomatic.” Patients having history of neurological illness, significant head injury, comorbid substance dependence (excluding nicotine and caffeine), other psychiatric disorder, significant medical disorders (such as uncontrolled diabetes, hypertension, and tuberculosis), disruptive behavior (suicidal or homicidal) that warranted immediate intervention or history of electroconvulsive therapy within previous 6 months were excluded. The healthy control “HC” group included right-handed age-matched subjects, recruited among the hospital staff and community living in the vicinity of the institute. The HC group did not include subjects with any psychiatric or neurological illnesses, substance dependence (excluding nicotine and caffeine), and significant medical disorders.

We defined the patient sample based on following criteria for categorizing positive and negative sub-groups according to Andreasen et al.:[24]

Criteria a: Scores ≥4 on P1, P3, and G9 of the Positive and Negative Syndrome Scale (PANSS)[25]
Criteria b: Scores ≥4 on N1, N4 and N6 of PANSS.

Those fulfilling “criteria a” and “criteria b” exclusively were designated as a positive symptom (NS) and negative symptom (NS) group, respectively. Those patients who did not exclusively meet any of these criteria (i.e., fulfilled both criteria [n = 8] or none of the two [n = 2]) were not included for sub-analysis. Consequently, 18 patients were designated to PS group and 10 patients to the NS group.

Tools

Clinical assessments

Relevant sociodemographic and clinical data were collected from all the participants. Handedness was assessed using the Sidedness Bias Schedule-Hindi version.[26] The baseline severity of psychopathology in patients was evaluated by administering PANSS.[25] Healthy controls were screened with the General Health Questionnaire-12;[27] only those with scores <3 were included.

Electroencephalographic recording

All participants underwent an EEG recording. The recording was carried out between 0900 and 1200 h at the Institute's Centre for Cognitive Neurosciences. Participants were advised to avoid use of tea, coffee, or nicotine for at least 1 h before recording. In a light and sound-attenuated room, 10 min of resting-state EEG was recorded for each participant, while sitting, eyes closed, on a reclining chair. EEG was acquired on the Geodesic EEG System 400 (Electrical Geodesics, Inc., Eugene, Oregon, USA) system with 256 EEG channel Geodesic Sensor Nets; electrodes were placed according to the international 10–10 system of electrode placement [Figure 1]. Eye movement potentials were monitored using the right and left electrooculogram channels. Electrode impedance was kept <50 kΩ. EEG was filtered (time constant – 0.1 s, high-frequency filter – 120 Hz), digitized (sampling rate – 256 Hz) and artifact-rejected (performed on a moving average of 80 ms (bad channels [5 or more]; >200 mV); eye movements (>55 mV for >20% of each 6-s epoch) and eye blinks (>140 mV for >20% of each epoch)) using Net Station 5.1 software (Electrical Geodesics, Inc., Eugene, Oregon, USA). The EEG signals were further preprocessed using EEGLAB software for baseline correction and common average referencing.[28]

Showing channel placement of the 256 sensor net according to 10-10 international system

Analysis

Feature extraction

First, 60-s epochs of artifact-free EEG data were visually selected from each recording after carefully excluding segments with eye movement, blink and electromyogram, movement, electrode, and perspiration artifacts or drowsiness changes.

EEG signals corresponding to eight regions of interest (ROI)-right and left inferior frontal gyrus (IFG), dorsolateral prefrontal cortex (DLPFC), inferior parietal lobule (IPL), and superior temporal gyrus (STG) [Figure 1] were averaged. Channels corresponding to these regions were selected as per estimated anatomical cortical projections for the international 10–10 system of electrode placement.[29] Six level wavelet decomposition with Daubechies-9 mother wavelet was carried out to extract five EEG bands, namely-delta (0–4 Hz), theta 4–8 Hz), alpha (8–12 Hz), beta (16–32), and gamma (32–64 Hz), as shown in Supplementary Figure 1^{(1.1MB, tif)}. Wavelet decomposition results in approximate (cA) and detail coefficients (cD) corresponding to low frequency and high-frequency components, respectively, of EEG signals. As shown in Supplementary Figure 1, the gamma and beta bands are obtained after the 3^rd and 4^th level decomposition, respectively, from detail cD. Further, the alpha band is obtained after the first level decomposition of details cD cD5. Finally, delta and theta bands are obtained at the 6^th level wavelet decomposition.

After extracting the five EEG bands, the corresponding time-domain signals were obtained using wavelet reconstruction. Twelve statistical features, namely mean, kurtosis, skewness, entropy, variance, standard deviation, minimum value, maximum value, range, crest factor, form factor, power, were then extracted from each band resulting in 60 features from each EEG signal. This step was repeated for all the eight selected ROIs, resulting in 480 features.

Statistical significance analysis

Statistical significance analysis was carried out to assess the statistical significance of the extracted features. In this study, non-parametric-Mann–Whitney U-test was used for assessing the statistical significance of the extracted 480 features. This test was conducted using the Statistical Package for Social Sciences (SPSS) Version 21.0 (IBM Corp., Armonk, NY, USA) for a 95% confidence interval. All the features with value of P < 0.05 were considered statistically significant. Only significant features were used further for analysis in the next step.

Classification using machine learning

Figure 2 shows the scheme for the use of sML, the method in classifying the EEG data sets.

General machine learning pipeline for classification of positive and negative samples. In experiment 1, schizophrenia was considered a positive group, and healthy was considered as a negative group. In experiment 2, positive symptom was considered positive and negative symptom was considered a negative group

The dataset was divided into two groups-training and testing. The training group was used to develop the ML model, while the testing group was used to evaluate its performance. During the training phase, the classifier was supplied with both EEG signals as well as the corresponding class/group (ground truth). In the testing phase, the model was supplied only with EEG signals, and the model predicted its category.

The data division strategies used were “hold-out” and “k-fold” cross-validation methods. In the hold-out method, the dataset is divided into two groups randomly wherein 67% of the samples were used for training and 33% of the samples were used for testing the Support Vector Machine (SVM) classifier model. In k-fold cross-validation protocol, the whole dataset was split into “k” groups, consisting of an approximately identical number of samples. Out of “k” groups, “k-1” groups were used for training, while residual one group was used for testing the SVM classifier model. The process is continued “k” times, and average performance over “k” rounds was calculated. The value of k = 10 was used in this study for both the experiments. Two classification experiments were performed in this study-(1) SCZ versus healthy control (HC) group and (2) Positive (PS) versus NS group. The whole dataset, i.e., n = 58 (38 SCZ + 20 healthy) in experiment-1 and n = 28 (18 positive symptoms + 10 NSs) in experiment-2 were divided into two sets namely, training and testing group.

For training, one of the most popular sML methods called SVM was used. In SVM, the machine constructs numerous hyperplanes using training data set, which splits the two groups. The maximum margin maintaining hyperplane from samples of one group to that of others is chosen as the finest hyperplane for classification.[30] In SVM classifier, the kernel function maps the input to suitable feature space. Non-linear kernels are used when the decision boundary among the two groups to be classified is nonlinear. Several SVMs with dissimilar kernel functions were used for classification in the present study, namely-linear SVM, quadratic SVM, cubic SVM, fine Gaussian SVM, medium Gaussian SVM and coarse Gaussian SVM. Supplementary Table 1 shows the parameters used for different SVM classifiers.

Supplementary Table 1.

Different types of support vector machine classifier and the parameters used in this study

Classification method	Kernel type	Description
Linear SVM	Linear kernel
Quadratic SVM	Polynomial kernel
Cubic SVM	Polynomial kernel
Fine Gaussian SVM	Gaussian radial basis function
Medium Gaussian SVM	Gaussian radial basis function
Course Gaussian SVM	Gaussian radial basis function

Open in a new tab

f is the kernel function, x_i and x_j are the input feature vectors and σ is kernel parameter. SVM – Support vector machine

Performance evaluation

The testing performance of the SVM classifier model was evaluated using four evaluation measures-accuracy (ACC), sensitivity (SEN), specificity (SPE), and Area Under receiveroperating characteristics Curve (AUC) [Supplementary Table 2].

Supplementary Table 2.

Different performance measures used for evaluation of support vector machine classifier

Measure	Description	Mathematical expression
Accuracy	Percentage of correctly classified samples
Sensitivity	Percentage of correctly classified samples belonging to schizophrenia (experiment 1)/positive symptoms (experiment 2)
Specificity	Percentage of correctly classified samples belonging to the healthy group (experiment 1)/positive symptoms (experiment 2)
Area under receiver operating characteristic curve	A common measure of sensitivity and specificity

Open in a new tab

tp – True positives (number of correctly classified positive samples), tn – True negatives (number of correctly classified negative samples), fp – False positives (number of wrongly classified positive samples), fn – False negatives (number of wrongly classified negative samples). In experiment 1 – Schizophrenia was considered as positive group and healthy was considered as negative group. In experiment 2 – Positive symptom was considered as positive group and negative symptom was considered as negative group. SVM – Support vector machine; AUC – Area under receiver operating curve

This article has been written and reports the findings of the study according to the available guidelines for reporting ML studies.[31]

RESULTS

Sample characteristics

Table 1 shows the comparison of sociodemographic and clinical variables between the groups-SCZ versus HC and PS versus NS. On SCZ versus HC comparison, both groups were comparable on all variables except employment and habitat. Significantly higher number of SCZ patients were unemployed and belonged to rural habitat than the healthy controls. As the scores on PANSS were the basis for PS-NS subdivision, on PS versus NS comparison, significantly higher positive syndrome scores in the PS group and significantly higher negative syndrome scores in the NS group were found. These sub-groups were comparable to other clinical variables.

Table 1.

Comparison of sociodemographic and clinical variables across the groups

Variables	SCZ (n=38) Mean±SD/n (%)	HC (n=20) Mean±SD/n (%)	t/χ², df=1, 57	PS (n=18) Mean±SD/n (%)	NS (n=10) Mean±SD/n (%)	t/χ², df=1, 27
Age (years)	31.56±7.05	29.95±3.78	0.93	32.50±8.71	29.30±4.71	1.07
Marital status
Unmarried	15 (39.5)	8 (40)	0.01	8 (44.4)	3 (30)	0.56
Married	23 (60.5)	12 (60)		10 (55.6)	7 (70)
Religion
Hindu	28 (73.7)	18 (90)	2.17	12 (66.7)	8 (80)	0.58
Non-Hindu	10 (26.3)	2 (10)		6 (33.3)	2 (20)
Education^f
Illiterate/primary	13 (34.2)	7 (35)	4.45	3 (16.7)	4 (40)	5.72
Secondary	7 (18.4)	0 (0)		7 (38.9)	0 (0)
Graduate	18 (47.4)	13 (65)		8 (44.4)	6 (60)
Employment
Unemployed	30 (78.9)	6 (30)	13.33***	5 (27.8)	2 (20)	0.20
Employed	8 (21.1)	14 (70)		13 (72.2)	8 (80)
Socioeconomic status^f
Lower	16 (42.1)	3 (15)	4.40	5 (27.8)	5 (50)	1.74
Middle	21 (55.3)	16 (80)		12 (66.7)	5 (50)
Higher	1 (2.6)	1 (5)		1 (5.6)	0 (0)
Habitat^f
Rural	23 (60.5)	5 (25)	8.20*	9 (50)	7 (70)	2.14
Semi urban	3 (7.9)	6 (30)		3 (16.7)	0 (0)
Urban	12 (31.6)	9 (45)		6 (33.3)	3 (30)
Duration of illness (months)	47.18±26.04			43.22.±27.40	55.20±31.44	1.05
Chlorpromazine equivalents	425.66±163.65			438.33±127.67	418.92±198.90	0.19
PANSS
Positive	21.84±5.23			25.83±3.36	16.90±2.51	7.31***
Negative	20.94±8.11			15.56±3.28	30.70±5.54	9.14***
General psychopathology	29.24±8.36			27.61±5.91	33.50±9.74	2.00
Total	72.03±13.31			69.00±8.32	81.10±12.06	3.14**

Open in a new tab

*P<.05; **P<0.01; ***P<0.001; ^fFisher exact test used. SCZ – Schizophrenia patients; HC – Healthy controls; PS – Schizophrenia patients with predominantly positive symptoms; NS – Schizophrenia patients with predominantly negative symptoms; PANSS – Positive and negative syndrome scale; SD – Standard deviation

Significant machine-learning-based features

Schizophrenia versus healthy controls (experiment-1)

Comparison of all the extracted features (12 features × 5 frequencies × 8 regions = 480) between patient and the healthy control groups found 36 features to be significantly different [Table 2A]-mostly belonging to high-frequency activity (beta and gamma) in all ROIs.

Table 2.

Significant features found statistically significant between the groups

A. Experiment-1: SCZ (n=38) versus HC (n=20)

Regions	Frequency	Feature	Mean±SEM		P

			SCZ	HC
Left IFG	Theta	Entropy	2.7352±0.07582	2.4468±0.06875	0.003
		Variance	92.1948±57.55483	121.8455±81.11564	0.033
		SD	5.7851±1.75544	6.0636±1.43535	0.038
		Power	92.1886±57.55100	121.8374±81.11023	0.033
	Gamma	Skewness	0.1112±0.06864	−0.0382±0.02244	0.003
Right IFG	Delta	Skewness	−0.3342±0.22691	0.2836±0.08275	0.016
	Beta	Skewness	0.1463±0.10157	−0.0264±0.02857	0.039
	Gamma	Skewness	0.1203±0.07139	−0.0278±0.03549	0.014
Left DLPFC	Delta	Skewness	−0.6782±0.20258	−0.0545±0.09706	0.008
	Delta	Crest form	3.0215±0.13749	3.6768±0.17393	0.023
	Gamma	Skewness	0.1165±0.06885	−0.0054±0.01480	0.015
Right DLPFC	Delta	Skewness	−0.3277±0.26277	0.3257±0.10590	0.024
	Beta	Skewness	0.1520±0.10229	0.0255±0.02659	0.010
	Gamma	Skewness	0.1208±0.07167	−0.0170±0.00951	0.002
Left IPL	Theta	Form factor	1955.7576±1398.23182	−552.9008±365.59427	0.012
Left IPL	Gamma	Mean	−0.0001±0.00009	0.0000±0.00004	0.036
Right IPL	Alpha	Variance	32.3363±14.53552	32.4062±25.70312	0.031
		SD	4.2728±0.85405	3.1815±0.77514	0.041
		Power	32.3342±14.53455	32.4041±25.70141	0.031
	Beta	Variance	44.6849±16.87896	33.8945±20.78437	0.036
		SD	5.3190±0.91786	3.8982±0.70642	0.039
		Minimum	−58.7084±21.69012	−25.9060±9.70250	0.010
		Maximum	61.7134±23.12278	26.2544±10.16121	0.005
		Range	120.4218±44.76786	52.1604±19.86322	0.010
		Power	44.6819±16.87784	33.8922±20.78298	0.036
	Gamma	Variance	43.6071±29.07713	15.9474±12.94389	0.008
		SD	3.9643±1.20929	2.1990±0.54751	0.011
		Maximum	61.7252±27.53687	19.6251±9.35317	0.036
		Range	124.1787±55.65012	38.7074±18.20058	0.048
		Power	43.6041±29.07519	15.9463±12.94303	0.008
Left STG	Theta	Form factor	734.5629±1152.24726	−339.0403±789.73024	0.048
	Gamma	Variance	374244.9815±374230.11109	15.8634±12.85715	0.033
		SD	64.7792±61.99634	2.1937±0.54428	0.039
		Power	374220.0319±374205.16242	15.8624±12.85629	0.033
Right STG	Beta	Maximum	61.5651±23.25161	27.9247±10.31370	0.041
		Range	119.5335±45.01290	55.2043±20.07156	0.048

B. Experiment-2: PS (n=18) versus NS (n=10)

Regions	Frequency	Feature	Mean±SEM		P

			PS	NS

Left IFG	Delta	Mean	−0.0038±0.00208	0.0011±0.00064	0.021
	Delta	Crest factor	4.1242±0.29585	2.9817±0.18331	0.014
	Theta	Mean	0.0031±0.00157	−0.0012±0.00055	0.027
	Theta	Form factor	953.9041±1738.82570	−1961.1192±816.33388	0.002
Right IFG	Delta	Mean	−0.0016±0.00131	0.0020±0.00090	0.024
		Variance	36089.0665±9247.95745	2906.2327±1010.58574	0.035
		SD	134.2456±23.89547	43.8882±8.09026	0.049
		Minimum	−463.3971±82.96620	−139.9365±23.01746	0.027
		Range	1090.9231±209.12290	287.8238±56.07730	0.039
		Power	36086.6608±9247.34097	2906.0389±1010.51837	0.035
	Theta	Mean	0.0003±0.00115	−0.0021±0.00080	0.031
Left DLPFC	Delta	Form factor	−11239.8840±12182.92907	8501.4417±3611.42483	0.007
	Theta	Form factor	581.4022±241.65339	−1426.4185±361.01432	<.001
	Beta	Kurtosis	16.1049±10.66669	40.2326±35.88180	0.017
Right DLPFC	Delta	Mean	−0.0051±0.00199	0.0022±0.00140	0.007
	Delta	Entropy	1.1264±0.07445	1.2432±0.04452	0.049
	Theta	Mean	0.0029±0.00132	−0.0021±0.00118	0.010
	Alpha	Mean	0.0031±0.00108	−0.0002±0.00028	0.035
	Beta	Mean	−0.0009±0.00028	0.0000±0.00019	0.027
Left IPL	Delta	Mean	0.0143±0.00337	−0.0002±0.00137	0.007
	Theta	Mean	−0.0085±0.00195	0.0009±0.00122	0.004
		Kurtosis	16.1318±4.67997	4.4192±0.34235	0.027
		Crest factor	5.8362±0.50224	4.2454±0.14854	0.017
	Alpha	Skewness	−0.4935±0.18339	−0.0008±0.00808	0.006
Right IPL	Alpha	Form factor	957.9047±361.44474	−1626.4591±1132.25393	0.019
	Beta	Kurtosis	5.8536±1.82523	4.8403±0.32138	0.049
	Beta	Crest factor	4.9846±0.38732	5.7063±0.31853	0.027
Left STG	Alpha	Form factor	−4635.7358±4093.85566	1340.2171±666.57497	0.017

Open in a new tab

SEM – Standard error of mean; IFG – Inferior frontal gyrus; DLPFC – Dorsolateral prefrontal cortex; IPL – Inferior parietal lobule; STG – Superior temporal gyrus; SCZ – Schizophrenia patients; HC – Healthy controls; PS – Schizophrenia patients with predominant positive symptoms; NS – Schizophrenia patients with predominant negative symptoms

Positive symptoms versus negative symptoms (experiment-2)

On comparing SCZ patients with predominantly PS and those with predominantly NSs, of all the extracted features, 28 of those mostly belonging to low-frequency activity (delta, theta, and alpha) in bilateral IFG, DLPFC, and IPL were found to be significantly different [Table 2B].

Machine-learning-Based classification

Schizophrenia versus healthy controls (experiment-1)

Across various data division protocols [Table 3A], the “hold-out” method using Quadratic SVM was able to classify the originally grouped SCZ patients and healthy controls most accurately (Accuracy 78.95%; Sensitivity-92.31%; Specificity-50%; AUC-71.15%). Further when significant features specific to individual ROIs were input separately to the model, we found gamma activity related features in the right IFG to be most accurate (Accuracy 78.95%; Sensitivity-84.62%; Specificity-66.67%; AUC-75.64%), followed by left DLPFC (Accuracy 73.68%; Sensitivity-91.67%; Specificity-42.86%; AUC-67.26%) and right STG (Accuracy 73.68%; Sensitivity-100%; Specificity-16.67%; AUC-58.33%).

Table 3.

Classification results using significant features

A. Experiment-1: SCZ (n=38) versus HC (n=20)

10-fold method				SVM Model	Hold-out method

ACC (%)	SEN (%)	SPE (%)	AUC (%)		ACC (%)	SEN (%)	SPE (%)	AUC (%)
72.41	86.84	45.00	65.92	Linear SVM	73.68	92.31	33.33	62.82
70.69	78.95	55.00	66.97	Quadratic SVM	78.95^††	92.31	50.00	71.15
63.79	68.42	55.00	61.71	Cubic SVM	68.42	76.92	50.00	63.46
70.69	97.37	20.00	58.68	Fine Gaussian SVM	68.42	100.00	0.00	50.00
63.79	92.11	10.00	51.05	Medium Gaussian SVM	73.68	100.00	16.67	58.33
65.52	97.37	5.00	51.18	Coarse Gaussian SVM	68.42	100.00	0.00	50.00

B. Experiment-2: PS (n=18) versus NS (n=10)

10-fold method				SVM Model	Hold-out method

ACC (%)	SEN (%)	SPE (%)	AUC (%)		ACC (%)	SEN (%)	SPE (%)	AUC (%)

85.71	88.89	80.00	84.44	Linear SVM	88.89	100.00	75.00	87.50
82.14	83.33	80.00	81.67	QUADRATIC SVM	88.89	100.00	75.00	87.50
82.14	83.33	80.00	81.67	Cubic SVM	88.89	100.00	75.00	87.50
64.29	100.00	0.00	50.00	Fine Gaussian SVM	55.56	100.00	0.00	50.00
89.29^††	100.00	70.00	85.00	Medium Gaussian SVM	77.78	100.00	50.00	75.00
64.29	100.00	0.00	50.00	Coarse Gaussian SVM	55.56	100.00	0.00	50.00

Open in a new tab

^†† Model showing highest accuracy. SVM – Support vector machine; ACC – Accuracy; SEN – Sensitivity; SPE – Specificity; AUC – Area under receiver operating curve; SCZ –Schizophrenia patients; HC – Healthy controls; PS – Schizophrenia patients with predominant positive symptoms; NS – Schizophrenia patients with predominant negative symptoms

Positive symptoms versus negative symptoms (experiment-2)

For the classification of PS and NS groups [Table 3B], the 10-fold method using medium Gaussian SVM was found to most accurately (Accuracy 89.29%; Sensitivity-100%; Specificity-70%; AUC-85%) classify the conventionally grouped patients. Subsequent region-specific analysis revealed that delta activity-related features in the left IFG were found to be most accurate (Accuracy 89.29%; Sensitivity-88.89%; Specificity-90%; AUC-89.44%) followed by right IFG (Accuracy 82.14%; Sensitivity-83.33%; Specificity-80%; AUC-81.67%).

DISCUSSION

AI-based interpretation of brain activity allows for limitless interesting applications for man-machine interactions. The recently developed neuro-prosthetic-the exoskeleton, which uses brain signals to drive a tetraplegic patient to move,[32] is an extreme advancement that highlights its importance to neuroscience. As introduced, more essential for psychiatry is a valid and objective classification of “disease” entities. Gathering brain signals of SCZ patients and applying them to translate into tools that can identify “disorder” from “normal” or classify sub-groups accurately without referring to any subjective diagnostic criteria has been the emphasis. Although much more work needs to be done from genetic, physiological, and clinico-social standpoint in “validating” the diagnosis of SCZ, application of AI- or ML-based models on various clinical parameters represent a small yet significant futuristic step towards that goal.

This study, a first from India applying sML-based methods on high-density EEG data, demonstrates that this technique can distinguish SCZ probands from healthy and also classify SCZ patients with positive symptoms from those with negative symptoms, with optimum accuracies. The sML tool used in our study, i.e., Kernel-SVM method, which avoids over-fitting and hence, better generalization capabilities, has been used in earlier SCZ classification studies as well.[19,21,33,34,35] The accuracy of 78.95%, although comparable, is lower than those reported by these studies, where the rates range from 81% to 91%. Some of these studies have additionally used top-up models, such as adaptive boosting[33,34] or add-on features such as complexity[33] or source-level features in addition to sensor level ones.[19] However, some studies have used very few features.[35] Apart from the use of SVM for SCZ classification, other ML-methods used in previous studies are the Stockwell transform[20] and deep learning models.[36]

Ours is the first study that attempted to sub-classify SCZ probands using EEG and sML methods. Moreover, we found that the features selected could classify PS and NS subgroups fairly accurately (89.29%). Perhaps, they could classify the subgroups with 100% sensitivity. Such perfect measures have been linked to the Stockwell transform ML-tool.[20] This particular finding suggests that SVM models might be better used for sub-classification of SCZ, which consequently helps in reducing the heterogeneity within SCZ.

More importantly, this is the first-ever study to use data recorded using 256 channels. This allowed us to select ROIs, which has been possible only in neuroimaging studies using MRI data[37,38] so far. Our results showed that right IFG > left DLPFC > right STG features classified SCZ from healthy the best. Using sequential ROI selection, Chin et al.[38] using structural MRI data identified 7 ROIs as optimal discriminatory subset of which only superior-, middle- and frontal gyrus and STG are the cortical regions that can be anatomically projected from EEG data. ROIs found to be the best classifiers in our study precisely correspond to these regions. Further, we show that left IFG > right IFG were the regions with most accurate features for classifying PS and NS sub-groups.

Another interesting finding from our results is the way low frequency and high-frequency EEG activities characterize themselves in the two experiments. While on one hand, predominantly high frequency related features predict the classification of SCZ and healthy controls, on the other, predominantly low frequency related features predict the classification of positive and NS subgroups. This finding is important to the application of ML-based tools to functional connectivity measures.[39] While low-frequency activity implies long-range connectivity, high-frequency activity underpins local or short-range connectivity.[40] Hence, our study results might suggest specifically investigating local connectivity measures for classifying SCZ from healthy and long-range connectivity for classifying SCZ sub-groups.

Strengths and limitations

The obvious strength is in the use of high-density EEG data, which allowed for features to be selected for predefined ROIs. The use of resting-state data allows for an easier reproducibility. The use of an array of methods for feature selection and classification is a highlight of the study. Although fairly adequate for the classification of SCZ from healthy, the sample size used for sub-classification is deemed limited. Furthermore, the unequal distribution of samples in the second experiment might have contributed to the low and inconsistent specificity scores. The current study used an existing dataset and hence had to use the operational criteria for sub-classifying patients retrospectively. Fresh recruitment of SCZ sub-groups is suggested for future studies. Another important limitation of the study is the confound of multiple comparisons. We did not apply a correction for multiple comparisons in the “statistical significance analysis” because even after assuming a high (i.e., 50%) false detection rates in the Benjamini–Hochberg correction method, which is recommended for analysis with very high number of comparisons (such as 480 as in our case), none of the variables in experiment-1 and only one variable in experiment-2 found statistically significant at P < 0.05 could survive significance after applying the correction.[41] Hence, the possibility of inclusion of false-positively significant extracted features in the classification using ML cannot be refuted. Finally, the results of the study might only be generalized to acutely symptomatic and moderately ill male SCZ patients.

CONCLUSIONS

We conclude that SVM-based classification and sub-classification of SCZ using EEG data is optimal and might help in improving the “validity” and reducing the “heterogeneity” in the diagnosis of SCZ. High- and low- frequency-related features accurately classify SCZ from healthy and positive from NSs SCZ, respectively. Moreover, ROI contributing most accurate features is the IFG for both the classificatory instances. Caution needs to be exercised while generalizing these results as they may be limited to only acutely symptomatic and moderately ill male SCZ patients.

Financial support and sponsorship

Data used in this study was retrieved from the DBT-INCRE fellowship project funded by the Department of Bio Technology (DBT), Ministry of Science and Technology, Government of India. The Fellowship was awarded to SKT. All India Institute of Medical Sciences. Raipur and National Institute of Technology, Raipur have signed a Memorandum of Understanding for Research Collaboration.

Conflicts of interest

There are no conflicts of interest.

Supplementary Figure 1

Scheme for extraction of electroencephalographic bands using wavelet decomposition and feature extraction

IJPsy-62-273_Suppl1.tif^{(1.1MB, tif)}

REFERENCES

1.Harvey PD, Heaton RK, Carpenter WT, Jr, Green MF, Gold JM, Schoenbaum M. Diagnosis of schizophrenia: Consistency across information sources and stability of the condition. Schizophr Res. 2012;140:9–14. doi: 10.1016/j.schres.2012.03.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Jablensky A. Psychiatric classifications: Validity and utility. World Psychiatry. 2016;15:26–31. doi: 10.1002/wps.20284. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Bleuler E. Dementia Praecox or the Group of Schizophrenias. Oxford, UK: International Universities Press; 1950. [Google Scholar]
4.Wang EW. The Collected Schizophrenias. Minneapolis, USA: Graywolf Press; 2019. [Google Scholar]
5.Andreasen NC, Olsen S. Negative v positive schizophrenia. Definition and validation. Arch Gen Psychiatry. 1982;39:789–94. doi: 10.1001/archpsyc.1982.04290070025006. [DOI] [PubMed] [Google Scholar]
6.Keller WR, Fischer BA, Carpenter WT., Jr Revisiting the diagnosis of schizophrenia: Where have we been and where are we going? CNS Neurosci Ther. 2011;17:83–8. doi: 10.1111/j.1755-5949.2010.00229.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Lefort-Besnard J, Varoquaux G, Derntl B, Gruber O, Aleman A, Jardri R, et al. Patterns of schizophrenia symptoms: Hidden structure in the PANSS questionnaire. Transl Psychiatry. 2018;8:237. doi: 10.1038/s41398-018-0294-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Buckley PF, Miller BJ. Personalized medicine for schizophrenia. NPJ Schizophr. 2017;3:2. doi: 10.1038/s41537-016-0001-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Başar E, Başar-Eroǧlu C, Güntekin B, Yener GG. Brain's alpha, beta, gamma, delta, and theta oscillations in neuropsychiatric diseases: Proposal for biomarker strategies. Suppl Clin Neurophysiol. 2013;62:19–54. doi: 10.1016/b978-0-7020-5307-8.00002-8. [DOI] [PubMed] [Google Scholar]
10.Newson JJ, Thiagarajan TC. EEG frequency bands in psychiatric disorders: A review of resting state studies. Front Hum Neurosci. 2018;12:521. doi: 10.3389/fnhum.2018.00521. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Tikka SK, Garg S, Sinha VK, Nizamie SH, Goyal N. Resting state dense array gamma oscillatory activity as a response marker for cerebellar-repetitive transcranial magnetic stimulation (rTMS) in schizophrenia. J ECT. 2015;31:258–62. doi: 10.1097/YCT.0000000000000242. [DOI] [PubMed] [Google Scholar]
12.Tikka SK, Nizamie SH, Das AK, Agarwal N, Goyal N. Schneiderian first rank symptoms in schizophrenia: A developmental neuroscience evaluation. Int J Dev Neurosci. 2016;50:39–46. doi: 10.1016/j.ijdevneu.2016.02.001. [DOI] [PubMed] [Google Scholar]
13.John JP, Rangaswamy M, Thennarasu K, Khanna S, Nagaraj RB, Mukundan CR, et al. EEG power spectra differentiate positive and negative subgroups in neuroleptic-naive schizophrenia patients. J Neuropsychiatry Clin Neurosci. 2009;21:160–72. doi: 10.1176/jnp.2009.21.2.160. [DOI] [PubMed] [Google Scholar]
14.Mitra S, Nizamie SH, Goyal N, Tikka SK. Electroencephalogram alpha-to-theta ratio over left fronto-temporal region correlates with negative symptoms in schizophrenia. Asian J Psychiatr. 2017;26:70–6. doi: 10.1016/j.ajp.2017.01.013. [DOI] [PubMed] [Google Scholar]
15.Baradits M, Kakuszi B, Bálint S, Fullajtár M, Mód L, Bitter I, et al. Alterations in resting-state gamma activity in patients with schizophrenia: A high-density EEG study. Eur Arch Psychiatry Clin Neurosci. 2019;269:429–37. doi: 10.1007/s00406-018-0889-z. [DOI] [PubMed] [Google Scholar]
16.Carbonell JG, Michalski RS, Mitchell TM, editors. Machine Learning: An Artificial Intelligence Approach. Vol. 1. Burlington, USA: Morgan Kaufmann Publishers Inc; 1983. An overview of machine learning; pp. 3–24. [Google Scholar]
17.Tandon N, Tandon R. Will machine learning enable us to finally cut the Gordian knot of schizophrenia. Schizophr Bull. 2018;44:939–41. doi: 10.1093/schbul/sby101. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Tandon N, Tandon R. Using machine learning to explain the heterogeneity of schizophrenia. Realizing the promise and avoiding the hype. Schizophr Res. 2019;214:70–5. doi: 10.1016/j.schres.2019.08.032. [DOI] [PubMed] [Google Scholar]
19.Shim M, Hwang HJ, Kim DW, Lee SH, Im CH. Machine-learning-based diagnosis of schizophrenia using combined sensor-level and source-level EEG features. Schizophr Res. 2016;176:314–9. doi: 10.1016/j.schres.2016.05.007. [DOI] [PubMed] [Google Scholar]
20.Dvey-Aharon Z, Fogelson N, Peled A, Intrator N. Schizophrenia detection and classification by advanced analysis of EEG recordings using a single electrode approach. PLoS One. 2015;10:e0123033. doi: 10.1371/journal.pone.0123033. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Johannesen JK, Bi J, Jiang R, Kenney JG, Chen CA. Machine learning identification of EEG features predicting working memory performance in schizophrenia and healthy adults. Neuropsychiatr Electrophysiol. 2016;2:3. doi: 10.1186/s40810-016-0017-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Khodayari-Rostamabad A, Hasey GM, Maccrimmon DJ, Reilly JP, de Bruin H. A pilot study to determine whether machine learning methodologies using pre-treatment electroencephalography can predict the symptomatic response to clozapine therapy. Clin Neurophysiol. 2010;121:1998–2006. doi: 10.1016/j.clinph.2010.05.009. [DOI] [PubMed] [Google Scholar]
23.World Health Organization. The ICD-10 Classification of Mental and Behavioral Disorders: Diagnostic Criteria for Research. Geneva: World Health Organization; 1992. [Google Scholar]
24.Andreasen NC, Carpenter WT, Jr, Kane JM, Lasser RA, Marder SR, Weinberger DR. Remission in schizophrenia: Proposed criteria and rationale for consensus. Am J Psychiatry. 2005;162:441–9. doi: 10.1176/appi.ajp.162.3.441. [DOI] [PubMed] [Google Scholar]
25.Kay SR, Fiszbein A, Opler LA. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr Bull. 1987;13:261–76. doi: 10.1093/schbul/13.2.261. [DOI] [PubMed] [Google Scholar]
26.Mandal MK, Pandey G, Singh KS, Asthana SH. Hand preference in India. Int J Psychol. 1992;27:433–42. [Google Scholar]
27.Goldberg DP, William P. A user Guide to General Health Questionnaire. Windsor: NFER-Nelson; 1998. [Google Scholar]
28.Delorme A, Makeig S. EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods. 2004;134:9–21. doi: 10.1016/j.jneumeth.2003.10.009. [DOI] [PubMed] [Google Scholar]
29.Koessler L, Maillard L, Benhadid A, Vignal JP, Felblinger J, Vespignani H, et al. Automated cortical projection of EEG sensors: Anatomical correlation via the international 10-10 system. Neuroimage. 2009;46:64–72. doi: 10.1016/j.neuroimage.2009.02.006. [DOI] [PubMed] [Google Scholar]
30.Cortes C, Vapnik V. Support-vector networks. Machine learning. 1995;20:273–97. [Google Scholar]
31.Tandon N, Tandon R. Machine learning in psychiatry- standards and guidelines. Asian J Psychiatr. 2019;44:A1–4. doi: 10.1016/j.ajp.2019.09.009. [DOI] [PubMed] [Google Scholar]
32.Benabid AL, Costecalde T, Eliseyev A, Charvet G, Verney A, Karakas S, et al. An exoskeleton controlled by an epidural wireless brain-machine interface in a tetraplegic patient: A proof-of-concept demonstration. Lancet Neurol. 2019;18:1112–22. doi: 10.1016/S1474-4422(19)30321-7. [DOI] [PubMed] [Google Scholar]
33.Sabeti M, Katebi S, Boostani R. Entropy and complexity measures for EEG signal classification of schizophrenic and control participants. Artif Intell Med. 2009;47:263–74. doi: 10.1016/j.artmed.2009.03.003. [DOI] [PubMed] [Google Scholar]
34.Sabeti M, Katebi S, Boostani R, Price GW. A new approach for EEG signal classification of schizophrenic and control participants. Expert Syst Appl. 2011;38:2063–71. [Google Scholar]
35.Hiesh MH, Lam Andy YY, Shen CP, Chen W, Lin FS, Sung HY, et al. Classification of schizophrenia using genetic algorithm-support vector machine (GA-SVM) Conf Proc IEEE Eng Med Biol Soc. 2013;2013:6047–50. doi: 10.1109/EMBC.2013.6610931. [DOI] [PubMed] [Google Scholar]
36.Durstewitz D, Koppe G, Meyer-Lindenberg A. Deep neural networks in psychiatry. Mol Psychiatry. 2019;24:1583–98. doi: 10.1038/s41380-019-0365-9. [DOI] [PubMed] [Google Scholar]
37.Lu X, Yang Y, Wu F, Gao M, Xu Y, Zhang Y, et al. Discriminative analysis of schizophrenia using support vector machine and recursive feature elimination on structural MRI images. Medicine (Baltimore) 2016;95:e3973. doi: 10.1097/MD.0000000000003973. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Chin R, You AX, Meng F, Zhou J, Sim K. Recognition of schizophrenia with regularized support vector machine and sequential region of interest selection using structural magnetic resonance imaging. Sci Rep. 2018;8:13858. doi: 10.1038/s41598-018-32290-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Du Y, Fu Z, Calhoun VD. Classification and prediction of brain disorders using functional connectivity: promising but challenging. Front Neurosci. 2018;12:525. doi: 10.3389/fnins.2018.00525. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Moran LV, Hong LE. High vs. low frequency neural oscillations in schizophrenia. Schizophr Bull. 2011;37:659–63. doi: 10.1093/schbul/sbr056. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.McDonald JH. Handbook of Biological Statistics. 3rd ed. Baltimore, Maryland, USA: Sparky House Publishing; 2014. Multiple comparisons; pp. 254–60. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figure 1

Scheme for extraction of electroencephalographic bands using wavelet decomposition and feature extraction

IJPsy-62-273_Suppl1.tif^{(1.1MB, tif)}

[ref1] 1.Harvey PD, Heaton RK, Carpenter WT, Jr, Green MF, Gold JM, Schoenbaum M. Diagnosis of schizophrenia: Consistency across information sources and stability of the condition. Schizophr Res. 2012;140:9–14. doi: 10.1016/j.schres.2012.03.026. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref2] 2.Jablensky A. Psychiatric classifications: Validity and utility. World Psychiatry. 2016;15:26–31. doi: 10.1002/wps.20284. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref3] 3.Bleuler E. Dementia Praecox or the Group of Schizophrenias. Oxford, UK: International Universities Press; 1950. [Google Scholar]

[ref4] 4.Wang EW. The Collected Schizophrenias. Minneapolis, USA: Graywolf Press; 2019. [Google Scholar]

[ref5] 5.Andreasen NC, Olsen S. Negative v positive schizophrenia. Definition and validation. Arch Gen Psychiatry. 1982;39:789–94. doi: 10.1001/archpsyc.1982.04290070025006. [DOI] [PubMed] [Google Scholar]

[ref6] 6.Keller WR, Fischer BA, Carpenter WT., Jr Revisiting the diagnosis of schizophrenia: Where have we been and where are we going? CNS Neurosci Ther. 2011;17:83–8. doi: 10.1111/j.1755-5949.2010.00229.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref7] 7.Lefort-Besnard J, Varoquaux G, Derntl B, Gruber O, Aleman A, Jardri R, et al. Patterns of schizophrenia symptoms: Hidden structure in the PANSS questionnaire. Transl Psychiatry. 2018;8:237. doi: 10.1038/s41398-018-0294-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref8] 8.Buckley PF, Miller BJ. Personalized medicine for schizophrenia. NPJ Schizophr. 2017;3:2. doi: 10.1038/s41537-016-0001-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref9] 9.Başar E, Başar-Eroǧlu C, Güntekin B, Yener GG. Brain's alpha, beta, gamma, delta, and theta oscillations in neuropsychiatric diseases: Proposal for biomarker strategies. Suppl Clin Neurophysiol. 2013;62:19–54. doi: 10.1016/b978-0-7020-5307-8.00002-8. [DOI] [PubMed] [Google Scholar]

[ref10] 10.Newson JJ, Thiagarajan TC. EEG frequency bands in psychiatric disorders: A review of resting state studies. Front Hum Neurosci. 2018;12:521. doi: 10.3389/fnhum.2018.00521. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref11] 11.Tikka SK, Garg S, Sinha VK, Nizamie SH, Goyal N. Resting state dense array gamma oscillatory activity as a response marker for cerebellar-repetitive transcranial magnetic stimulation (rTMS) in schizophrenia. J ECT. 2015;31:258–62. doi: 10.1097/YCT.0000000000000242. [DOI] [PubMed] [Google Scholar]

[ref12] 12.Tikka SK, Nizamie SH, Das AK, Agarwal N, Goyal N. Schneiderian first rank symptoms in schizophrenia: A developmental neuroscience evaluation. Int J Dev Neurosci. 2016;50:39–46. doi: 10.1016/j.ijdevneu.2016.02.001. [DOI] [PubMed] [Google Scholar]

[ref13] 13.John JP, Rangaswamy M, Thennarasu K, Khanna S, Nagaraj RB, Mukundan CR, et al. EEG power spectra differentiate positive and negative subgroups in neuroleptic-naive schizophrenia patients. J Neuropsychiatry Clin Neurosci. 2009;21:160–72. doi: 10.1176/jnp.2009.21.2.160. [DOI] [PubMed] [Google Scholar]

[ref14] 14.Mitra S, Nizamie SH, Goyal N, Tikka SK. Electroencephalogram alpha-to-theta ratio over left fronto-temporal region correlates with negative symptoms in schizophrenia. Asian J Psychiatr. 2017;26:70–6. doi: 10.1016/j.ajp.2017.01.013. [DOI] [PubMed] [Google Scholar]

[ref15] 15.Baradits M, Kakuszi B, Bálint S, Fullajtár M, Mód L, Bitter I, et al. Alterations in resting-state gamma activity in patients with schizophrenia: A high-density EEG study. Eur Arch Psychiatry Clin Neurosci. 2019;269:429–37. doi: 10.1007/s00406-018-0889-z. [DOI] [PubMed] [Google Scholar]

[ref16] 16.Carbonell JG, Michalski RS, Mitchell TM, editors. Machine Learning: An Artificial Intelligence Approach. Vol. 1. Burlington, USA: Morgan Kaufmann Publishers Inc; 1983. An overview of machine learning; pp. 3–24. [Google Scholar]

[ref17] 17.Tandon N, Tandon R. Will machine learning enable us to finally cut the Gordian knot of schizophrenia. Schizophr Bull. 2018;44:939–41. doi: 10.1093/schbul/sby101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref18] 18.Tandon N, Tandon R. Using machine learning to explain the heterogeneity of schizophrenia. Realizing the promise and avoiding the hype. Schizophr Res. 2019;214:70–5. doi: 10.1016/j.schres.2019.08.032. [DOI] [PubMed] [Google Scholar]

[ref19] 19.Shim M, Hwang HJ, Kim DW, Lee SH, Im CH. Machine-learning-based diagnosis of schizophrenia using combined sensor-level and source-level EEG features. Schizophr Res. 2016;176:314–9. doi: 10.1016/j.schres.2016.05.007. [DOI] [PubMed] [Google Scholar]

[ref20] 20.Dvey-Aharon Z, Fogelson N, Peled A, Intrator N. Schizophrenia detection and classification by advanced analysis of EEG recordings using a single electrode approach. PLoS One. 2015;10:e0123033. doi: 10.1371/journal.pone.0123033. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref21] 21.Johannesen JK, Bi J, Jiang R, Kenney JG, Chen CA. Machine learning identification of EEG features predicting working memory performance in schizophrenia and healthy adults. Neuropsychiatr Electrophysiol. 2016;2:3. doi: 10.1186/s40810-016-0017-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref22] 22.Khodayari-Rostamabad A, Hasey GM, Maccrimmon DJ, Reilly JP, de Bruin H. A pilot study to determine whether machine learning methodologies using pre-treatment electroencephalography can predict the symptomatic response to clozapine therapy. Clin Neurophysiol. 2010;121:1998–2006. doi: 10.1016/j.clinph.2010.05.009. [DOI] [PubMed] [Google Scholar]

[ref23] 23.World Health Organization. The ICD-10 Classification of Mental and Behavioral Disorders: Diagnostic Criteria for Research. Geneva: World Health Organization; 1992. [Google Scholar]

[ref24] 24.Andreasen NC, Carpenter WT, Jr, Kane JM, Lasser RA, Marder SR, Weinberger DR. Remission in schizophrenia: Proposed criteria and rationale for consensus. Am J Psychiatry. 2005;162:441–9. doi: 10.1176/appi.ajp.162.3.441. [DOI] [PubMed] [Google Scholar]

[ref25] 25.Kay SR, Fiszbein A, Opler LA. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr Bull. 1987;13:261–76. doi: 10.1093/schbul/13.2.261. [DOI] [PubMed] [Google Scholar]

[ref26] 26.Mandal MK, Pandey G, Singh KS, Asthana SH. Hand preference in India. Int J Psychol. 1992;27:433–42. [Google Scholar]

[ref27] 27.Goldberg DP, William P. A user Guide to General Health Questionnaire. Windsor: NFER-Nelson; 1998. [Google Scholar]

[ref28] 28.Delorme A, Makeig S. EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods. 2004;134:9–21. doi: 10.1016/j.jneumeth.2003.10.009. [DOI] [PubMed] [Google Scholar]

[ref29] 29.Koessler L, Maillard L, Benhadid A, Vignal JP, Felblinger J, Vespignani H, et al. Automated cortical projection of EEG sensors: Anatomical correlation via the international 10-10 system. Neuroimage. 2009;46:64–72. doi: 10.1016/j.neuroimage.2009.02.006. [DOI] [PubMed] [Google Scholar]

[ref30] 30.Cortes C, Vapnik V. Support-vector networks. Machine learning. 1995;20:273–97. [Google Scholar]

[ref31] 31.Tandon N, Tandon R. Machine learning in psychiatry- standards and guidelines. Asian J Psychiatr. 2019;44:A1–4. doi: 10.1016/j.ajp.2019.09.009. [DOI] [PubMed] [Google Scholar]

[ref32] 32.Benabid AL, Costecalde T, Eliseyev A, Charvet G, Verney A, Karakas S, et al. An exoskeleton controlled by an epidural wireless brain-machine interface in a tetraplegic patient: A proof-of-concept demonstration. Lancet Neurol. 2019;18:1112–22. doi: 10.1016/S1474-4422(19)30321-7. [DOI] [PubMed] [Google Scholar]

[ref33] 33.Sabeti M, Katebi S, Boostani R. Entropy and complexity measures for EEG signal classification of schizophrenic and control participants. Artif Intell Med. 2009;47:263–74. doi: 10.1016/j.artmed.2009.03.003. [DOI] [PubMed] [Google Scholar]

[ref34] 34.Sabeti M, Katebi S, Boostani R, Price GW. A new approach for EEG signal classification of schizophrenic and control participants. Expert Syst Appl. 2011;38:2063–71. [Google Scholar]

[ref35] 35.Hiesh MH, Lam Andy YY, Shen CP, Chen W, Lin FS, Sung HY, et al. Classification of schizophrenia using genetic algorithm-support vector machine (GA-SVM) Conf Proc IEEE Eng Med Biol Soc. 2013;2013:6047–50. doi: 10.1109/EMBC.2013.6610931. [DOI] [PubMed] [Google Scholar]

[ref36] 36.Durstewitz D, Koppe G, Meyer-Lindenberg A. Deep neural networks in psychiatry. Mol Psychiatry. 2019;24:1583–98. doi: 10.1038/s41380-019-0365-9. [DOI] [PubMed] [Google Scholar]

[ref37] 37.Lu X, Yang Y, Wu F, Gao M, Xu Y, Zhang Y, et al. Discriminative analysis of schizophrenia using support vector machine and recursive feature elimination on structural MRI images. Medicine (Baltimore) 2016;95:e3973. doi: 10.1097/MD.0000000000003973. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref38] 38.Chin R, You AX, Meng F, Zhou J, Sim K. Recognition of schizophrenia with regularized support vector machine and sequential region of interest selection using structural magnetic resonance imaging. Sci Rep. 2018;8:13858. doi: 10.1038/s41598-018-32290-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref39] 39.Du Y, Fu Z, Calhoun VD. Classification and prediction of brain disorders using functional connectivity: promising but challenging. Front Neurosci. 2018;12:525. doi: 10.3389/fnins.2018.00525. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref40] 40.Moran LV, Hong LE. High vs. low frequency neural oscillations in schizophrenia. Schizophr Bull. 2011;37:659–63. doi: 10.1093/schbul/sbr056. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref41] 41.McDonald JH. Handbook of Biological Statistics. 3rd ed. Baltimore, Maryland, USA: Sparky House Publishing; 2014. Multiple comparisons; pp. 254–60. [Google Scholar]

PERMALINK

Artificial intelligence-based classification of schizophrenia: A high density electroencephalographic and support vector machine study

Sai Krishna Tikka

Bikesh Kumar Singh

S Haque Nizamie

Shobit Garg

Sunandan Mandal

Kavita Thakur

Lokesh Kumar Singh

Abstract

Background:

Aims:

Settings and Design:

Materials and Methods:

Statistical Analysis:

Results:

Conclusions:

INTRODUCTION

MATERIALS AND METHODS

Study design

Participants

Tools

Clinical assessments

Electroencephalographic recording

Figure 1.

Analysis

Feature extraction

Statistical significance analysis

Classification using machine learning

Figure 2.

Supplementary Table 1.

Performance evaluation

Supplementary Table 2.

RESULTS

Sample characteristics

Table 1.

Significant machine-learning-based features

Schizophrenia versus healthy controls (experiment-1)

Table 2.

Positive symptoms versus negative symptoms (experiment-2)

Machine-learning-Based classification

Schizophrenia versus healthy controls (experiment-1)

Table 3.

Positive symptoms versus negative symptoms (experiment-2)

DISCUSSION

Strengths and limitations

CONCLUSIONS

Financial support and sponsorship

Conflicts of interest

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases