Skip to main content
Frontiers in Psychiatry logoLink to Frontiers in Psychiatry
. 2021 Apr 15;12:655292. doi: 10.3389/fpsyt.2021.655292

Deep Neural Network to Differentiate Brain Activity Between Patients With First-Episode Schizophrenia and Healthy Individuals: A Multi-Channel Near Infrared Spectroscopy Study

Po-Han Chou 1,2,3,*, Yun-Han Yao 4,5, Rui-Xuan Zheng 4,5, Yi-Long Liou 4,5, Tsung-Te Liu 5, Hsien-Yuan Lane 6,7,8, Albert C Yang 9,10,11, Shao-Cheng Wang 12,13,14,*
PMCID: PMC8081971  PMID: 33935840

Abstract

Backgrounds: Reduced brain cortical activity over the frontotemporal regions measured by near infrared spectroscopy (NIRS) has been reported in patients with first-episode schizophrenia (FES). This study aimed to differentiate between patients with FES and healthy controls (HCs) on basis of the frontotemporal activity measured by NIRS with a support vector machine (SVM) and deep neural network (DNN) classifier. In addition, we compared the accuracy of performance of SVM and DNN.

Methods: In total, 33 FES patients and 34 HCs were recruited. Their brain cortical activities were measured using NIRS while performing letter and category versions of verbal fluency tests (VFTs). The integral and centroid values of brain cortical activity in the bilateral frontotemporal regions during the VFTs were selected as features in SVM and DNN classifier.

Results: Compared to HCs, FES patients displayed reduced brain cortical activity over the bilateral frontotemporal regions during both types of VFTs. Regarding the classifier performance, SVM reached an accuracy of 68.6%, sensitivity of 70.1%, and specificity of 64.6%, while DNN reached an accuracy of 79.7%, sensitivity of 88.8%, and specificity of 74.9% in the classification of FES patients and HCs.

Conclusions: Compared to findings of previous structural neuroimaging studies, we found that using DNN to measure the NIRS signals during the VFTs to differentiate between FES patients and HCs could achieve a higher accuracy, indicating that NIRS can be used as a potential marker to classify FES patients from HCs. Future additional independent datasets are needed to confirm the validity of our model.

Keywords: deep neural network, near infrared spectroscopy, schizophrenia, machine learning, fNIRS, deep learning

Introduction

Schizophrenia (SZ) is a chronic psychiatric disorder characterized by psychotic symptoms, negative symptoms, and cognitive deficits and poses considerable burdens to society (1). Therefore, accurate diagnosis and early intervention are critical (2, 3). In clinical practice, schizophrenia is diagnosed by clinicians using diagnostic criteria from the Diagnostic and Statistical Manual of Mental Disorders (DSM-5), based on patient reports of symptoms, observation of behavior and functional changes; however, traditional clinical practice might be confounded because patients with SZ may deny their symptoms, and even experienced psychiatrists may have difficulty differentiating SZ from other mental illnesses (i.e., psychotic bipolar disorder) owing to similar symptomologies at acute stage (4).

To overcome these limitations of clinical interviews-based diagnosis of psychiatric disorders, many studies have attempted to develop objective biomarkers that can improve the accuracy of diagnosis and the ability to predict a patient's response to treatment and prognosis. Among a variety of neuroimaging modalities, functional near infrared spectroscopy (fNIRS) is a functional neuroimaging tool that measures the spatio-temporal neural activity of the brain non-invasively. Compared to existing imaging techniques, such as positron emission tomography (PET), single-photon emission computed tomography (SPECT), and magnetic resonance image (MRI), fNIRS is easier to administer, low-cost, and provides fair temporal and spatial resolutions (5). Many previous fNIRS studies reported reduced brain activity over the bilateral frontotemporal regions during various cognitive tasks in patients with SZ compared to controls [reviewed by Koike et al. (6) and Chou et al. (7)].

Recently, many studies have attempted to accurately classify patients with heterogeneous mental disorders. For instance, several studies used machine learning (ML) methods to accurately differentiate patients with SZ and healthy individuals with structural or functional neuroimaging tools and showed promising results (8). ML methods are capable of representing latent features of structural or functional changes in the brain, and this allows for better representation of SZ-related processes. Among ML methods, support vector machines (SVMs) are mostly adopted. SVM is an ML method which estimates a hyperplane with an optimal margin that could provide the best separation between two classes, which is determined by the maximum distance from any data point. Once defined, this hyperplane is used to classify the data (8, 9).

Recently, deep learning (DL) methodology such as deep neural network (DNN) has significantly improved the representation learning and classification in various areas such as speech recognition, natural image classification, and text mining (9). Two main features have made DNN unique compared to SVM. First, DNN is capable of data-driven automatic feature learning, which enables to remove the subjectivity in selecting the relevant features when there are too many features or no prior knowledge in selecting features. Second, by applying a hierarchy of non-linear layers, DNN can analyze complicated data patterns (8). Recently, DL methods have been applied in medical image analyses with promising results, including characterizing patterns of brain imaging data in patients with neurocognitive disorders (1013) and schizophrenia (9, 14). However, most previous studies analyzed MRI data.

In the present work, we aimed to discriminate between patients with first-episode schizophrenia (FES) and healthy controls (HCs) on the basis of brain cortical activity during a verbal fluency test (VFT) measured using NIRS. We focused on the bilateral frontotemporal regions. We compared classification accuracies for two different machine learning methods: SVM and DNN. To the best of our knowledge, this is the first study using deep learning to automatically differentiate FES from HC based on brain cortical activity features.

Materials and Methods

Study Subjects

A total of 33 patients with FES (18 men and 15 women; mean age [SD] =29.1 [6.4] years) were recruited at the Department of Psychiatry in Taichung Veterans General Hospital. Patients who fulfilled the criteria for SZ listed in the DSM-5 were recruited and the diagnoses were validated using the Mini International Neuropsychiatric Interview (MINI) (15) by board-certified psychiatrists (P.H.C). All patients were experiencing their first episode of psychosis and had received no more than 12 weeks of previous antipsychotic medication (16, 17). Thirty-four HCs (17 men and 17 women; mean age [SD] = 28.2 [9.9] years) were recruited and screened using the MINI. All study participants were right-handed, which was assessed by the Edinburgh Inventory (18). Other characteristics such as education level, VFT performance of study subjects, as well as the age of onset and duration of illness of FES patients were also recorded. Subjects were excluded if they had a history of substance abuse or dependence, intellectual disability, neurological disorders, or a medical condition that may affect brain function. This study complied with the Declaration of Helsinki, and all participants received a complete explanation of the study and provided written informed consent. This study was approved by the Institutional Review Board of Taichung Veterans General Hospital (approval No. CF13044).

Clinical Assessments

We used the Positive and Negative Syndrome Scale (PANSS) (19) to evaluate psychiatric symptoms of the FES patients on the same day as the NIRS measurements. Patient antipsychotic doses are presented as chlorpromazine-equivalent doses (20, 21).

Verbal Fluency Test

Patients received 160-s block-design VFTs (both letter and category version) which has been adopted in many previous fNIRS studies (17, 2226). There were three different time periods for the VFT: a 30-s pre-task period, a 60-s task period, and a 70-s post-task period. In the pre- and post-task periods, patients were asked to repeatedly count from one to five to control for and remove task-related motion artifacts. For the 60-s task period, study participants were instructed to say words that started with a phonological syllable presented by NIRS machine. In the letter fluency test (LFT), there three continuous 20-s sub-periods, which were initiated by a single Chinese syllable selected from nine options (first, /ㄅ(b)/, /ㄆ(p)/, or /ㄉ(d)/; second, /ㄊ(t)/, /ㄌ(l)/, or /ㄋ(n)/; third, /ㄇ(m)/, /ㄈ(f)/,or /ㄘ(dz)/). We chose these syllables based on their frequencies at the beginning of Chinese words. For the category fluency test (CFT), subjects were asked to produce as many words based on a given semantic cue for 20 s each (first: “fish,” “birds,” or “insects”; second: “sweets,” “vegetables,” or “fruits”; third: “vehicles,” “home appliances,” or “stationery items,”). Before beginning each task session, subjects were instructed on how to generate correct answers for VFTs. Each subject practiced three times to ensure that they understood the tests.

NIRS Instrument

A 52-channel NIRS instrument (ETG-4000; Hitachi Medical Co., Tokyo, Japan) was used to measure changes in concentrations of oxygenated hemoglobin [oxy-Hb] of the brain in the present study. The NIRS probe attachments are thermoplastic 3 × 11 shells set, comprising 52 channels (Figure 1). The lowest probe line was set along the Fp1–Fp2 line, as defined by the international 10–20 system used in electroencephalography. The NIRS instrument measures changes in both [oxy-Hb] and [deoxy-Hb] (in mM) using two wavelengths (695 and 830 nm) of near-infrared light. The calculations were based on the Beer–Lambert law (27). We recorded the changes of [oxy-Hb] from baseline to the activation period and relative changes in [oxy-Hb] assessed with units of mM·mm. The data sampling rate for NIRS instrument was 0.1 s. A moving average methodology using a 5-s window width was applied and any motion artifacts were automatically detected and rejected by the machine (28).

Figure 1.

Figure 1

Probe setting and measurement points for 52-channel NIRS. (A) The localizations of channels are set based on the international 10–20 electroencephalography system. Red and blue circles, respectively indicate the emitter and detector of the near-infrared light. (B) Probes with thermoplastic 3 × 11 shells are placed over the bilateral frontotemporal area. (C) Two regions of interest (Regions 1 and 2) in the present study. Region 1 consists of 11 channels (ch 25–28, ch 36–38 and ch 46–49); Region 2 consists of 20 channels; Right: (ch 22–24, ch 32–35 and ch 43–45); Left: (ch 29–31, ch 39–42 and ch 50–52).

The spatial information for each channel was estimated by using data from the Functional Brain Science Laboratory at Chuo University in Japan (29) based on the LONI Probabilistic Brain Atlas (LPBA40) (30). Because previous study indicated that [oxy-Hb] had stronger correlations with fMRI blood-oxygenation level-dependent signals (31), we used it as an indicator of brain cortical activity.

NIRS Signals and Feature Selection

Similar to Takizawa et al.'s study (32), two regions of interest (ROIs) were selected (Figure 1): the frontal region (R1, 11 channels) and the bilateral temporal region (R2, 20 channels). The changes in [oxy-Hb] and [deoxy-Hb] in the channels of these two respective regions of interest were averaged and transformed into representative “Region 1 (R1)” and “Region 2 (R2)” NIRS signals for each individual. According to the LBPA40 (30), the “Region 1 (R1)” NIRS signal consisted of signals from channels located approximately in the fronto-polar and dorsolateral prefrontal cortical regions, while the “Region 2 (R2)” NIRS signal consisted of signals from channels located approximately in the bilateral ventro-lateral prefrontal cortex and the superior and middle temporal cortical regions.

Two visual indices, integral and centroid value, of the bilateral frontotemporal regions during LFT and CFT (Figure 2) were generated automatically from the NIRS machine by evaluating the hemodynamic changes in [oxy-Hb] of the 10-s pre-task, 60-s task, and 55-s post-task period from the original 160-s VFTs. Details regarding the definition of integral and centroid value can be found elsewhere (32). In brief, integral value was calculated using the hemodynamic response of [oxy-Hb] during the 60-s activation task period by averaging the signal from channels within each region; the centroid value is an index of time-course changes throughout the VFT, with periods representing the timing of the hemodynamic response. The centroid value is indicated by the time shown with a perpendicular line from the centroid of the [oxy-Hb] signal change area during the entire task periods [from 0 (s) to 125 (s) [= 10 (s) + 60 (s) + 55 (s)]]; the integral value describes the size of the hemodynamic response during the 60-s test period (32). Therefore, a total of eight datasets were collected (integral and centroid values of R1 and R2, during an LFT and CFT, respectively).

Figure 2.

Figure 2

Typical time-course pattern of NIRS signals during the VFT. The “centroid value” is defined by a perpendicular line from the centroid of the brain activation area during the task period (A). The “integral value (Area B)” indicated the brain activity during the test (B); Oxy-Hb, oxygenated hemoglobin; deoxy-Hb, deoxygenated hemoglobin.

Deep Neural Network

A DNN was utilized as a classifier to discriminate the patients with schizophrenia from healthy control (HC). The network had eight features as inputs (which included the integral and centroid values of the frontal and temporal regions during the two types of VFTs. in the NIRS signal) (Figure 3). The topology of this classifier is a fully connected, four-layer, feedforward network, which comprises two hidden layers with 512 neurons for each layer. The activation function of all neurons in the network is the rectified linear unit (ReLU) function. The network outputs 2 indices in the last layer that the index with larger value indicates positive (FES) or negative (HC).

Figure 3.

Figure 3

Time courses of the hemodynamic responses of [oxy-Hb] in Region 1 (R1) and Region 2 (R2) in FES and HC groups. (A) and (B) show the time courses of the hemodynamic responses in R1 (frontal region) and R2 (temporal region), respectively.

Training of DNN

This DNN was trained with supervised learning since labeled data (FES or HC) were given. During training, stochastic gradient descent (SGD) was employed for optimizing the parameters; error gradients were propagated backwards through layers, which was backpropagation. Each parameter in the network was randomly initialized and adjusted according to its corresponding gradient to loss to minimize the error between the predicted results and the labeled data. In addition, the dropout technique was incorporated to avoid overfitting, and the dropout rate was 0.18. Here a criterion was set that the learning stage was stopped when the value of cost function changes little through epochs, and according to experiences, the learning duration was expected to be 100 to 300 epochs.

In the training procedure, cross-validation was required since the performance of the DNN was evaluated here by validation accuracy, practically, which was shuffling the 67-example dataset (33 FES patients and 34 HCs) at first and then dividing it into seven groups. In each turn of cross-validation, one of the groups was used as the validation set and the other six groups were training sets, and the validation set contained 10 samples, and the training set the other 57 samples.

Support Vector Machines

In the present study, Support Vector Machine (SVM) was used to compare the performances with which by deep learning. In machine learning, SVM is a supervised-learning method that learns model from labeled training data, and has been used for classification of patients with different psychiatric diseases (33). The SVM methodology has been detailed elsewhere (34). Given a training dataset for classification, the SVM algorithm optimizes for the support vectors that is a subset of training data and represents a hyperplane dividing the training data into their labeled categories with gaps as wide as possible. The prediction is then made by evaluating the decision function with test data as input and support vectors as parameters. A model of SVM with hyper parameter c = 1.0 and with radial basis function (RBF) kernel (γ = 1/Nt, Nt: the number of training examples) is built to run the SVM algorithm. The formation of the input data and processing of cross-validation are exactly the same to ensure consistency and fair comparisons.

Statistical Analysis

Firstly, Kolmogorov–Smirnov or Shapiro-Wilk tests were used to examine the distribution of the data. Basic characteristics in each group were compared using Student's t-tests for continuous variables and X2 test for categorical variables. When the data was not normally distributed, a non-parametrical analysis, that is, Spearman's rho was used to examine correlations and the Mann–Whitney U-test was used to compare means. Otherwise, the t-test was used to compare means and Pearson's correlation coefficient was used to examine correlations. P-value < 0.05 was defined statistically significant. All statistical analyses were performed using STATA version 15.1. DNN and SVM were performed using Python with open source library packages including Keras, scikit-learn, and TensorFlow.

Results

Demographic Characteristics

The study participants' demographic characteristics are presented in Table 1. There were no significant differences between the HC and FES groups in terms of age, sex, or education. However, the HC group had significantly better performance on the LFT (HC group, mean= 14.0, SD = 0.8; FES group, mean = 9.7, SD=0.8, P < 0.001) and CFT (HC group, mean = 17.5, SD = 0.8; FES group, mean = 12.1, SD = 0.7, P < 0.001) compared to the FES group.

Table 1.

Characteristics of study participants.

FES (N = 33) HC (N = 34) Statistics/analyses P-value
Age 29.1 (6.4) 28.2 (9.9) T = 0.42 0.68
Education (graduate/undergraduate/high school degrees) 7/18/8 3/27/4 X2 test 0.09
Right handed 33 33 X2 test 1
Gender(M/F) (18/15) (17/17) X2 test 0.71
LFT performance 9.7 (0.8) 14.0 (0.8) t = −3.80 P < 0.001
CFT performance 12.1 (0.7) 17.5 (0.8) t = −5.03 P < 0.001
Onset age 27.2 (6.1)
DOI (week) 102.8 (126.5)
PANSS
Positive 16.8 (5.3)
Negative 17.5 (5.5)
General psychopathology 33.9 (7.6)
Total 68.1 (14.7)
antipsychotics 426.9 (236.1)

FES, first-episode schizophrenia; HC, healthy control; LFT, letter version of verbal fluency test; CFT, category version of verbal fluency test; DOI, duration of illness; PANSS, positive and negative symptom scale.

Comparison of Hemodynamic Response of ROIs Across Clinical Groups

As shown in Table 2, during the LFT, significantly smaller integral values of [oxy-Hb] in the SZ than the HC group (R1: P < 0.001, t = 3.859; R2: P = 0.003, t = 3.047) were noted. On the other hand, there were no significant differences between two groups with regard to centroid values in both regions (R1: P = 0.667, t = −0.433; R2: P = 0.138, t = −1.515). During the CFT, smaller integral values of [oxy-Hb] in the SZ than the HC group were noted in both regions (R1: P = 0.015, t = 2.507; R2: P = 0.006, t = 2.845), and no significantly different centroid values between the two groups (R1: P = 0.528, t = −0.635; R2: P = 0.796, t = −0.259).

Table 2.

Comparison of frontal or temporal integral and centroid value of NIRS signals between FES and HC groupsa.

Frontal region (R1) Temporal region (R2)
Integral P-value Centroid P-value Integral P-value Centroid P-value
LFT
HC group 131.9 (12.0) 0.0003 56.5 (1.4) 0.6633 204.7 (17.4) 0.0033 56.8 (0.7) 0.1299
FES group 47.6 (18.4) 57.8 (2.8) 114.9 (23.9) 60.7 (2.4)
CFT
HC group 82.5 (12.8) 0.0147 57.2 (1.8) 0.5279 178.7 (17.8) 0.0059 60.7 (1.0) 0.7950
FES group 36.3 (13.3) 59.3 (2.8) 100.2 (21.2) 61.2 (1.6)
a

The unit for NIRS signal is (mM.mm). Statistical significance was marked with bold character.

FES, first-episode schizophrenia; HC, healthy control; LFT, Letter version of Verbal Fluency Test; CFT, Category version of Verbal fluency Test.

Classification Performance of DNN and SVM

In DNN, the topology of the network is determined by the experiments on network with different number of hidden layers and different number of neurons per layer, as shown in Table 3. A larger or deeper network generally performs better but harder to train. According to the results from the experiments, a 4-hidden-layer and 512-neurons-per-layer neural network was selected in the present study.

Table 3.

Nnetwork topology demonstrating comparison accuracy (%) of DNN.

graphic file with name fpsyt-12-655292-i0001.jpg 64 128 256 512 1,024
1 55.7 58.2 63.6 62.0 69.4
2 61.4 65.1 74.2 79.1 78.8
3 60.2 65.1 76.0 78.0 79.1
4 61.7 66.5 77.1 79.7 79.4
5 51.7 63.4 72.9 79.1 78.6

DNN, deep neural network. Bold value indicated the best accuracy of performance.

To reduce the effect of randomness, the cross-validation accuracy in each training set group is the average of the accuracies obtained by training and testing the network with different initializations five times. Therefore, the classification accuracy of DNN was 79.7%, sensitivity of 88.8%, and specificity of 74.9%. On the other hand, the result of classification accuracy using the eight features analyzed by SVM was 68.6%, sensitivity of 70.1.8%, and specificity of 64.6%.

Correlational Analyses

During the LFT, there was a significant negative correlation between R1 integral values and PANSS general psychopathology score (rho = −0.371, P = 0.034). In addition, there were significant negative associations between R2 integral values and PANSS negative (rho = −0.551, P = 0.001) and general psychopathology scores (rho = −0.433, P = 0.012). With regard to CFT, there was a significant positive correlation between R1 integral values and antipsychotic dosage (rho = 0.403, P = 0.020). In addition, there was significantly negative associations between R2 integral values and PANSS general psychopathology scores (rho = −0.501, P = 0.003).

Discussion

To our knowledge, this study is the first to evaluate the classification performance of artificial intelligence to distinguish patients with FES and HCs using NIRS signals. In the present study, we employed SVM and DNN methods to automatically differentiate FES patients from HCs. The main findings can be summarized as follows. (1) We reached a fair discrimination accuracy using SVM on integral and centroid values of R1 and R2 during both types of VFTs (68.6%). (2) DNN achieved modestly higher predictive performance than the SVM approach (79.7%). (3) Compared to HCs, there was decreased cortical activity in FES patients during the LFT but not the CFT, indicating that deficits in cortical activity during phonemic processing may occur early in the course of SZ.

Comparison of Classification Performance Between DNN and SVM

In the present study, we found classification accuracy of DNN is better than SVM, which is consistent with many previous MRI studies demonstrating superiority of DNN over SVM (9, 14, 35). SVM, a shallow-structured architecture, are effective in solving many simple or well-constrained problems. However, several recent studies have demonstrated the benefits of using deep structures. DNN may be more robust in the wide variety of functions that can be parameterized by composing weakly non-linear transformations. DNN allows a system input to be compositing from raw data, thus allowing automatic discovery of the representations required for machine learning tasks (36). Finally, the appeal of hierarchical representations and the potential for combining unsupervised and supervised methods also contribute to the use of deep neural networks (9). However, in this study, we did not explore all possible deep learning advantages, such as the use of input data without feature extraction. Instead, we selected the features generated by NIRS machine. Nevertheless, our results showed that when using NIRS signals, the DNN-based model can achieve better classification performance than SVM model.

Comparison With Previous fNIRS/MRI Studies Using Deep Learning or Machine Learning

Until now, there have been few NIRS studies using ML or DL method to classify patients with SZ and healthy individuals (37, 38). Li et al. (37) recruited a large sample of 120 SZ patients and 120 HCs and measured the hemoglobin response in the prefrontal cortex during the VFT using a multichannel NIRS instrument. They used PCA-based feature selection for data extracted from three types of NIRS data in each channel, and they achieved a maximum accuracy of 85.83% and an overall mean accuracy of 83.37% using SVM classifier. Yang et al. (38) measured the functional connectivity strength (FCS) as features derived from an individual channel during the VFT in 100 patients with schizophrenia and 100 healthy controls, and applied principal component analysis. They found that FCS from three channels on the medial prefrontal and left ventrolateral prefrontal cortices rendered accuracy as high as 84.67%, sensitivity at 92.00%, and specificity at 70%. However, due to the differences in study population recruited, usage of fNIRS features, and machine learning algorithms, it was difficult for us to directly compare these two studies.

On the other hand, there have been many structural or functional MRI studies using machine learning (e.g., SVM) technique reporting heterogeneous classification performances (with accuracies ranging from 60 to over 95%) in the classification of patients with chronic or first episode SZ against healthy individuals [reviewed by (8)]. However, there have been few MRI studies using deep learning to discriminate patients with schizophrenia and healthy controls. In the structural MRI study conducted by Pinaya et al. (9), the authors compared classification performance of deep belief network (DBN) and SVM between patients with schizophrenia and healthy individuals. They found DBN was slightly more accurate as a classifier (accuracy = 73.6%) than the SVM (accuracy = 68.1%) between patients with SZ and healthy individuals. However, the error rate of the DBN in classifying patients with first-episode psychosis (FEP) was 56.3%. In another study conducted by Vieira et al. (39), they used DNN to analyze a total of 956 participants (514 FEP and 444 HCs) and found that the best accuracies (70%) were achieved when DNN was applied compared to that when SVM was used (61.3%). In the present study, we demonstrated the classification accuracy of DNN (79.7%) was superior to that of SVM (68.6%), a finding similar to that reported by Vieira et al. However, Vieira et al. found it was difficult for the DNN models generalized to other sites, indicating that detection of individuals in the early stages of psychosis is more challenging. In the present study, we did not test our DNN model in another independent dataset, and future study using fNIRS dataset from other sites to test our DNN model is warranted.

Comparison of the Results of Correlational Analyses With Previous NIRS Studies

Similar to previous NIRS studies, we found that cortical activities over bilateral frontotemporal regions were negatively correlated with PANSS negative (17, 40) or general psychopathology scores (17, 28) during the both versions of VFTs. However, it is interesting to note that there was a significant positive correlation between frontal activity (R1 integral value) and antipsychotic dosage during the CFT, which has never been reported before. Antipsychotic treatment has been shown to improved cognitive function in first-episode and recent-onset schizophrenia (41). This finding probably indicated an improved cortical function after antipsychotic treatment and future studies are warranted to confirm our findings.

Limitations

There are several limitations in the present study. First, our study used small samples, which have been shown to yield unstable results (42, 43). Second, selection bias must be considered; this study used data from a tertiary hospital, and therefore the results may not be generalized. Third, the effects of medication on brain function should be considered. Although FES patients in the present study were minimally treated with antipsychotic medication, previous study demonstrated that even short-term treatment with antipsychotics was associated with structural brain changes (44). Fourth, there were only training and validation groups in our analysis, failure to test performance on additional independent samples (i.e., testing group) may limit the interpretation of our results. Future studies recruiting larger numbers of subjects from multi-sites are warranted. Fifth, NIRS data used in training the DL algorithm applied binary labels (FES or HCs). This dichotomous classification is widely used in researches of ML or DL, but it can be a barrier to applying this methodology in clinical practice. Most psychiatric diseases have a continuous spectrum and psychiatric comorbidities are common in a patient. The effects of psychiatric comorbidities on the brain function were not considered in the present study. In conclusion, in the present study, we distinguished FES from HCs by applying DNN to analyze frontotemporal activities during VFT measured by fNIRS and demonstrated fair sensitivity and specificity. However, additional independent datasets are needed to confirm the validity of our model.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by Institutional Review Board, Taichung Veterans General Hospital. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

P-HC designed the study, managed the literature searches, performed the NIRS measurements, assessments of study subjects, statistical analyses and wrote the first draft of the manuscript. Y-HY, R-XZ, and Y-LL revised the section Deep neural network, Support vector machines, and Statistical analysis of the manuscript, and performed SVM and DNN analyses supervised by T-TL. All authors interpreted the results, revised the manuscript and approved the final version submitted for publication.

Conflict of Interest

Y-HY, R-XZ, and Y-LL are currently employed by company MediaTek Inc. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We would like to than Wei-Hao Lin from Taichung Veterans General Hospital for his help in patient assessments.

Footnotes

Funding. This research was funded by MOST 109-2314-B-695-001.

References

  • 1.Chou PH, Chu CS, Lin CH, Cheng C, Chen YH, Lan TH, et al. Use of atypical antipsychotics and risks of cataract development in patients with schizophrenia: a population-based, nested case-control study. Schizophr Res. (2016) 174:137–43. 10.1016/j.schres.2016.03.027 [DOI] [PubMed] [Google Scholar]
  • 2.McGorry PD, Killackey E, Yung A. Early intervention in psychosis: concepts, evidence and future directions. World Psychiatry. (2008) 7:148–56. 10.1002/j.2051-5545.2008.tb00182.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.McGorry P, Keshavan M, Goldstone S, Amminger P, Allott K, Berk M, et al. Biomarkers and clinical staging in psychiatry. World Psychiatry. (2014) 13:211–23. 10.1002/wps.20144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.McGorry PD, Mihalopoulos C, Henry L, Dakis J, Jackson HJ, Flaum M, et al. Spurious precision: procedural validity of diagnostic assessment in psychotic disorders. Am J Psychiatry. (1995) 152:220–3. 10.1176/ajp.152.2.220 [DOI] [PubMed] [Google Scholar]
  • 5.Ferrari M, Quaresima V. A brief review on the history of human functional near-infrared spectroscopy (fNIRS) development and fields of application. Neuroimage. (2012) 63:921–35. 10.1016/j.neuroimage.2012.03.049 [DOI] [PubMed] [Google Scholar]
  • 6.Koike S, Nishimura Y, Takizawa R, Yahata N, Kasai K. Near-infrared spectroscopy in schizophrenia: a possible biomarker for predicting clinical outcome and treatment response. Front Psychiatry. (2013) 4:145. 10.3389/fpsyt.2013.00145 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Chou PH, Huang CJ, Sun CW. The potential role of functional near-infrared spectroscopy as clinical biomarkers in schizophrenia. Curr Pharm Design. (2020) 26:201–17. 10.2174/1381612825666191014164511 [DOI] [PubMed] [Google Scholar]
  • 8.Arbabshirani MR, Plis S, Sui J, Calhoun VD. Single subject prediction of brain disorders in neuroimaging: promises and pitfalls. Neuroimage. (2017) 145(Pt B):137–65. 10.1016/j.neuroimage.2016.02.079 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Pinaya WH, Gadelha A, Doyle OM, Noto C, Zugman A, Cordeiro Q, et al. Using deep belief network modelling to characterize differences in brain morphometry in schizophrenia. Sci Rep. (2016) 6:38897. 10.1038/srep38897 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Meyer S, Mueller K, Stuke K, Bisenius S, Diehl-Schmid J, Jessen F, et al. Predicting behavioral variant frontotemporal dementia with pattern classification in multi-center structural MRI data. NeuroImage Clin. (2017) 14:656–62. 10.1016/j.nicl.2017.02.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Suk HI, Lee SW, Shen D, Alzheimer's disease neuroimaging i . Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis. Neuroimage. (2014) 101:569–82. 10.1016/j.neuroimage.2014.06.077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Suk HI, Lee SW, Shen D, Alzheimer's disease neuroimaging I . Latent feature representation with stacked auto-encoder for AD/MCI diagnosis. Brain Struct Funct. (2015) 220:841–59. 10.1007/s00429-013-0687-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Suk HI, Shen D. Deep learning-based feature representation for AD/MCI classification. Med Image Comput Comput Assist Interv. (2013) 16(Pt 2):583–90. 10.1007/978-3-642-40763-5_72 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kim J, Calhoun VD, Shim E, Lee JH. Deep neural network with weight sparsity control and pre-training extracts hierarchical features and enhances classification performance: Evidence from whole-brain resting-state functional connectivity patterns of schizophrenia. Neuroimage. (2016) 124(Pt A):127–46. 10.1016/j.neuroimage.2015.05.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs J, Weiller E, et al. The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry. (1998) 59(Suppl 20):22–33 quiz 4–57. [PubMed] [Google Scholar]
  • 16.Fannon D, Chitnis X, Doku V, Tennakoon L, O'Ceallaigh S, Soni W, et al. Features of structural brain abnormality detected in first-episode psychosis. Am J Psychiatry. (2000) 157:1829–34. 10.1176/appi.ajp.157.11.1829 [DOI] [PubMed] [Google Scholar]
  • 17.Chou PH, Lin WH, Lin CC, Hou PH, Li WR, Hung CC, et al. Duration of untreated psychosis and brain function during verbal fluency testing in first-episode schizophrenia: a near-infrared spectroscopy study. Sci Rep. (2015) 5:18069. 10.1038/srep18069 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Oldfield RC. The assessment and analysis of handedness: the Edinburgh inventory. Neuropsychologia. (1971) 9:97–113. 10.1016/0028-3932(71)90067-4 [DOI] [PubMed] [Google Scholar]
  • 19.Kay SR, Fiszbein A, Opler LA. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr Bull. (1987) 13:261–76. 10.1093/schbul/13.2.261 [DOI] [PubMed] [Google Scholar]
  • 20.Inada T, Inagaki A. Psychotropic dose equivalence in Japan. Psychiatry Clin Neurosci. (2015) 69:440–7. 10.1111/pcn.12275 [DOI] [PubMed] [Google Scholar]
  • 21.Gardner DM, Murphy AL, O'Donnell H, Centorrino F, Baldessarini RJ. International consensus study of antipsychotic dosing. Am J Psychiatry. (2010) 167:686–93. 10.1176/appi.ajp.2009.09060802 [DOI] [PubMed] [Google Scholar]
  • 22.Chou PH, Koike S, Nishimura Y, Satomura Y, Kinoshita A, Takizawa R, et al. Similar age-related decline in cortical activity over frontotemporal regions in schizophrenia: a multichannel near-infrared spectroscopy study. Schizophr Bull. (2015) 41:268–79. 10.1093/schbul/sbu086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Ehlis AC, Herrmann MJ, Plichta MM, Fallgatter AJ. Cortical activation during two verbal fluency tasks in schizophrenic patients and healthy controls as assessed by multi-channel near-infrared spectroscopy. Psychiatry Res. (2007) 156:1–13. 10.1016/j.pscychresns.2006.11.007 [DOI] [PubMed] [Google Scholar]
  • 24.Ikezawa K, Iwase M, Ishii R, Azechi M, Canuet L, Ohi K, et al. Impaired regional hemodynamic response in schizophrenia during multiple prefrontal activation tasks: a two-channel near-infrared spectroscopy study. Schizophr Res. (2009) 108:93–103. 10.1016/j.schres.2008.12.010 [DOI] [PubMed] [Google Scholar]
  • 25.Pu S, Nakagome K, Yamada T, Yokoyama K, Itakura M, Satake T, et al. Association between subjective well-being and prefrontal function during a cognitive task in schizophrenia: a multi-channel near-infrared spectroscopy study. Schizophr Res. (2013) 149:180–5. 10.1016/j.schres.2013.06.036 [DOI] [PubMed] [Google Scholar]
  • 26.Suto T, Fukuda M, Ito M, Uehara T, Mikuni M. Multichannel near-infrared spectroscopy in depression and schizophrenia: cognitive brain activation study. Biol Psychiatry. (2004) 55:501–11. 10.1016/j.biopsych.2003.09.008 [DOI] [PubMed] [Google Scholar]
  • 27.Jobsis FF. Noninvasive, infrared monitoring of cerebral and myocardial oxygen sufficiency and circulatory parameters. Science. (1977) 198:1264–7. 10.1126/science.929199 [DOI] [PubMed] [Google Scholar]
  • 28.Takizawa R, Kasai K, Kawakubo Y, Marumo K, Kawasaki S, Yamasue H, et al. Reduced frontopolar activation during verbal fluency task in schizophrenia: a multi-channel near-infrared spectroscopy study. Schizophr Res. (2008) 99:250–62. 10.1016/j.schres.2007.10.025 [DOI] [PubMed] [Google Scholar]
  • 29.Tsuzuki D, Jurcak V, Singh AK, Okamoto M, Watanabe E, Dan I. Virtual spatial registration of stand-alone fNIRS data to MNI space. Neuroimage. (2007) 34:1506–18. 10.1016/j.neuroimage.2006.10.043 [DOI] [PubMed] [Google Scholar]
  • 30.Shattuck DW, Mirza M, Adisetiyo V, Hojatkashani C, Salamon G, Narr KL, et al. Construction of a 3D probabilistic atlas of human cortical structures. Neuroimage. (2008) 39:1064–80. 10.1016/j.neuroimage.2007.09.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Sato H, Yahata N, Funane T, Takizawa R, Katura T, Atsumori H, et al. A NIRS-fMRI investigation of prefrontal cortex activity during a working memory task. Neuroimage. (2013) 83:158–73. 10.1016/j.neuroimage.2013.06.043 [DOI] [PubMed] [Google Scholar]
  • 32.Takizawa R, Fukuda M, Kawasaki S, Kasai K, Mimura M, Pu S, et al. Neuroimaging-aided differential diagnosis of the depressive state. Neuroimage. (2014) 85(Pt 1):498–507. 10.1016/j.neuroimage.2013.05.126 [DOI] [PubMed] [Google Scholar]
  • 33.Bleich-Cohen M, Jamshy S, Sharon H, Weizman R, Intrator N, Poyurovsky M, et al. Machine learning fMRI classifier delineates subgroups of schizophrenia patients. Schizophr Res. (2014) 160:196–200. 10.1016/j.schres.2014.10.033 [DOI] [PubMed] [Google Scholar]
  • 34.Kloppel S, Stonnington CM, Chu C, Draganski B, Scahill RI, Rohrer JD, et al. Automatic classification of MR scans in Alzheimer's disease. Brain. (2008) 131(Pt 3):681–9. 10.1093/brain/awm319 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Plis SM, Hjelm DR, Salakhutdinov R, Allen EA, Bockholt HJ, Long JD, et al. Deep learning for neuroimaging: a validation study. Front Neurosci. (2014) 8:229. 10.3389/fnins.2014.00229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. (2015) 521:436–44. 10.1038/nature14539 [DOI] [PubMed] [Google Scholar]
  • 37.Li Z, Wang Y, Quan W, Wu T, Lv B. Evaluation of different classification methods for the diagnosis of schizophrenia based on functional near-infrared spectroscopy. J Neurosci Methods. (2015) 241:101–10. 10.1016/j.jneumeth.2014.12.020 [DOI] [PubMed] [Google Scholar]
  • 38.Yang J, Ji X, Quan W, Liu Y, Wei B, Wu T. Classification of schizophrenia by functional connectivity strength using functional near infrared spectroscopy. Front Neuroinformatics. (2020) 14:40. 10.3389/fninf.2020.00040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Vieira S, Gong QY, Pinaya WHL, Scarpazza C, Tognin S, Crespo-Facorro B, et al. Using machine learning and structural neuroimaging to detect first episode psychosis: reconsidering the evidence. Schizophr Bull. (2020) 46:17–26. 10.1093/schbul/sby189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Chou PH, Lin WH, Li WR, Huang CM, Sun CW. Reduced language lateralization in first episode schizophrenia: A near infrared spectroscopy study. Prog Neuropsychopharmacol Biol Psychiatry. (2017) 78:96–104. 10.1016/j.pnpbp.2017.05.001 [DOI] [PubMed] [Google Scholar]
  • 41.Gold S, Arndt S, Nopoulos P, O'Leary DS, Andreasen NC. Longitudinal study of cognitive function in first-episode and recent-onset schizophrenia. Am J Psychiatry. (1999) 156:1342–8. [DOI] [PubMed] [Google Scholar]
  • 42.Nieuwenhuis M, van Haren NE, Hulshoff Pol HE, Cahn W, Kahn RS, Schnack HG. Classification of schizophrenia patients and healthy controls from structural MRI scans in two large independent samples. Neuroimage. (2012) 61:606–12. 10.1016/j.neuroimage.2012.03.079 [DOI] [PubMed] [Google Scholar]
  • 43.Schnack HG, Kahn RS. Detecting neuroimaging biomarkers for psychiatric disorders: sample size matters. Front Psychiatry. (2016) 7:50. 10.3389/fpsyt.2016.00050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Lesh TA, Tanase C, Geib BR, Niendam TA, Yoon JH, Minzenberg MJ, et al. A multimodal analysis of antipsychotic effects on brain structure and function in first-episode schizophrenia. JAMA Psychiatry. (2015) 72:226–34. 10.1001/jamapsychiatry.2014.2178 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.


Articles from Frontiers in Psychiatry are provided here courtesy of Frontiers Media SA

RESOURCES