Abstract
Resting-state electroencephalography (EEG) microstates reflect sub-second, quasi-stable states of brain activity. Several studies have reported alterations of microstate features in patients with schizophrenia (SZ). Based on these findings, it has been suggested that microstates may represent neurophysiological biomarkers for the classification of SZ. To explore this possibility, machine learning approaches can be employed. Bayesian optimization is a machine learning approach that selects the best-fitted machine learning model with tuned hyperparameters from existing models to improve the classification. In this proof-of-concept preliminary study based on secondary analysis, 20 microstate features were extracted from 14 SZ patients and 14 healthy controls’ EEG signals. These parameters were then ranked as predictors based on their importance, and an optimized machine learning approach was applied to evaluate the performance of the classification. SZ patients had altered microstate features compared to healthy controls. Furthermore, Bayesian optimization outperformed conventional multivariate analyses and showed the highest accuracy (90.93%), AUC (0.90), sensitivity (91.37%), and specificity (90.48%), with reliable results using just six microstate predictors. Altogether, in this proof-of-concept study, we showed that machine learning with Bayesian optimization can be utilized to characterize EEG microstate alterations and contribute to the classification of SZ patients.
Keywords: microstate analysis, schizophrenia, optimized machine learning, microstate map correlation, resting-state EEG
1. Introduction
Schizophrenia (SZ) is a psychiatric disorder whose neurobiological underpinnings are still largely unknown. One of the most widely used techniques in SZ research is electroencephalography (EEG), which measures electrical activity in the brain. EEG signals reflect the oscillation and large-scale synchronization of underlying neural populations [1] and therefore can be used to investigate the synchrony and dynamics of neural circuits in SZ patients [2,3]. In patients with SZ, EEG abnormalities have been reported from studies measuring event-related potential paradigms, including face processing and mismatch negativity [4,5], sleep EEG characteristics [6,7], graph analysis measures of resting state EEG [8], and EEG power spectra, which are the most commonly computed features [9,10,11]. Statistical features, such as mean, skewness, and kurtosis were also investigated in SZ [12,13]. However, most of these EEG-based features have failed to take advantage of the EEG millisecond resolution [14,15,16,17,18].
Microstates are topographic maps of resting state brain activity that exploit the EEG temporal resolution. Specifically, Lehmann et al. first demonstrated the existence of microstates by segmenting EEG at the sub-second level, generating quasi-stable and evenly patterned data occurring at 80–120 ms intervals [19]. These images do not gradually merge or overlap in time; instead, a single map abruptly switches to another (i.e., microstates are quasi-stable periods of single maps) [20]. Microstate changes have been described as “atoms of cognition” [21] or “building blocks of mentation” [22]. For instance, distinct changes in EEG microstates have been associated with specific cognitive functions, sensory inputs [23], and tasks requiring reasoning [20]. Four main microstates (respectively Type A, Type B, Type C, and Type D) have been described, and the auditory, visual, default mode and dorsal attention pathways have been shown to relate to these microstates, respectively [24,25,26]. Several studies have reported microstate alterations in patients with SZ relative to healthy control subjects [14,15,21,27,28,29], and based on this increasing body of evidence, it was recently suggested that EEG microstates features could be used for SZ classification [27,30].
The heterogeneity and complexity of schizophrenia symptoms challenge an objective diagnosis; thus, a single predictor would likely be affected by such heterogeneity and may not reach the threshold to be detected by traditional statistical approaches. In contrast, more complex statistical models, such as those used in machine learning approaches, can be utilized to evaluate the combination of several predictors for the classification of SZ concurrently [31]. Although previous studies investigated SZ classification with machine-learning-based clustering algorithms [32,33,34], these studies have mostly used univariate or conventional machine learning methods, rather than optimized multivariate analyses [18]. Choosing optimized SZ classification is challenging and comprises multiple hyperparameters, which are set before the training process and define how the model can best fit the data [31,35]. There are two main types of hyperparameter selection methods: manual and automatic search. Manual search requires experienced users, whereas automatic methods, including grid and random searches, are more user independent [36]. Random search has solved the expensive cost of exhaustive searching in grid search and has proved to be more efficient in high-dimensional space, even though it may be unreliable for some complex models [36]. Choosing the best-fitted model with tuned hyperparameters is an optimization problem with black-box objective function, and several recent studies indicated that Bayesian optimization outperforms other optimization methods [37,38,39].
A few studies applied machine learning algorithms on SZ microstate predictors [27,30,40,41], but there is no study that evaluated the optimization multivariate analysis on these measures. In this study, we hypothesized that using optimized multivariate analysis along with numerous dependent and independent variables (i.e., microstate features), including newly computed microstate measures, would contribute to efficiently discriminating SZ patients from healthy subjects. Thus, we aimed to discriminate between EEG recordings of individuals diagnosed with SZ and healthy control participants using the Bayesian optimized machine learning approach and the best-fitted model based on the most important microstate features.
2. Materials and Methods
2.1. Dataset
We used a publicly available EEG dataset, which included 14 schizophrenia patients (7 males: 27.9 ± 3.3 years, 7 females: 28.3 ± 4.1 years) and 14 healthy controls (7 males: 26.8 ± 2.9, 7 females: 28.7 ± 3.4 years) from the Institute of Psychiatry and Neurology in Warsaw, Poland [42,43]. The patients met the ICD10 criteria for paranoid schizophrenia, according to the International Classification of Diseases (category F20.0). All participants were given a written description of the protocol and signed a consent form to participate in the study. A minimum age of 18 was required, as well as an ICD-10 diagnosis of F20.0 and at least a seven-day medication washout period was required before EEG was performed. The Warsaw Institute of Psychiatry and Neurology’s Ethics Committee approved the study protocol. Pregnancy, organic brain pathology, severe neurological diseases (e.g., epilepsy, Alzheimer’s disease, and Parkinson’s disease), the presence of a general medical condition, and the first episode of SZ, were all considered exclusion criteria. The 14 patients who completed the study were matched in gender and age to the control group. Fifteen minutes of resting-state EEG eye-closed were recorded from 19 channels with a sampling frequency of 250 Hz using the international 10/20 EEG system montage.
2.2. EEG Data Pre-Processing
We used EEGLAB [44] to pre-process the EEG dataset before performing microstate analyses. This step was completed based on well-established pre-processing methods [14,15,22,27,45]. Briefly, data were filtered between 2 and 20 Hz and re-referenced to the common average electrode. Then, EEG signals were segmented into 5-second nonoverlapping epochs. Outlier epochs were removed with a variance threshold (i.e., >3 standard deviation). Overall, 2511 artifact-free EEG epochs for schizophrenia patients and 2511 artifact-free epochs for healthy control subjects were utilized for the microstate analysis (Figure 1).
2.3. Microstate Analysis
The entire EEG signal can be represented by a small set of topographic maps that alternate at discrete intervals when EEG is viewed as a topography of electric potentials evolving over time [46]. These topographic maps are named microstates. These images/topographies do not gradually merge or overlap in time; instead, a single map dominates for roughly 80–120 milliseconds before abruptly switching to another map (i.e., microstates are quasi-stable periods of single maps) [19]. To extract the relevant maps, the root-mean-squared potential differences at all electrodes (i.e., ) from the mean of instantaneous potentials across electrodes (i.e., ) were computed to characterize the EEG global field power (GFP) [47].
(1) |
Topographies of the local maxima of the GFP curve were identified based on the above equation. We then used a modified K-means clustering approach and global explained variance (GEV) criteria [27,48,49] by setting the re-run parameter to 20 times, the convergence threshold at 10−6, and the maximum number of iterations to 1000 [33,44].
We identified four microstates (i.e., A, B, C, and D, Figure 2), which reflected the activity of the EEG channels with almost 70% of the total topographic variance, and labeled the topography at each GFP peak as one of these microstates. Five categories of features were computed for these microstates (Figure 3 and Figure S1): (1) the average number of times per second that each microstate occurred during the EEG recording (occurrence); (2) the average amount of time each microstate lasted after it occurred (duration); (3) the percentage of total recording time in which each microstate was dominant (coverage); [19] (4) the average GFP during microstate dominance (mean GFP) [28]; and (5) the average correlation between each labeled GFP peak map (i.e., A) and the corresponding microstate template (microstate map correlations) [34]. All microstate features are summarized in Table 1.
Table 1.
Features | Definition | Number |
---|---|---|
Occurrence | Occurrence of a microstate per second (Hz) | 4 |
Duration | The average duration of a microstate (ms) | 4 |
Coverage | Percent of time occupied by a microstate (%) | 4 |
Mean GFP | Mean of global field power (uV) | 4 |
Microstate map correlation (MsMC) | Spatial correlation between topographies and microstates | 4 |
2.4. Predictors’ Ranking and Classification
We ranked the 20 features/predictors described above using chi-square tests (“fscchi2”) embedded in the Statistics and Machine Learning Toolbox of MATLAB software (MathWorks, Inc., Natick, MA, USA, 2020) (Figure 3). Then, an optimized machine-learning approach was applied to classify the combination of ranked microstate features for SZ classification by adding each feature incrementally. Of note, machine learning classification can be subject dependent or subject independent. In the subject-independent classification, the model is tested using unseen subject’s data, (i.e., the data that are used to test the model are not included in the training phase), while here in the subject-dependent classification, training and testing sets are randomly split, and EEG-segment data from the same subject are included in both sets [31]. Because our cohort of SCZ and HC subjects was rather small, subject-dependent classification was employed. In each run, 80% of the data was utilized in the training process with a 5-fold cross-validation method to avoid overfitting, and the remaining data (20%) were used to test for the best-fitted trained model. Histograms of subject-based features used for this study to show the within-group subject variabilities of microstate features were also computed (see Figures S5 and S6). Accuracy, area under the curve (AUC), sensitivity, and specificity were used to assess classifier performance. Furthermore, we used the leave-one-out validation method by excluding the data of one individual per group each time to examine the subject-independent design in addition to the current approach. The trained models on the remaining dataset were tested on these two individuals, who each time were left out. Our results are comparable in both designs (output measures in the range of about 90% for subject independent vs. subject dependent) in Table S3. The classifier algorithms were selected from a pool of existing classifiers based on prior EEG studies that highlighted the important application role of ML as a real-time health monitoring system for stroke prognostics [50], access ischemic stroke-derived cortical impairment [51] and other biomedical engineering works [50,51,52,53,54,55]. The Shapley additive explanations (SHAP) or Shapley values of features were computed using the “shapley” function embedded in the Statistics and Machine Learning Toolbox of MATLAB2022b, which explains the deviation of the prediction for the query points from the average prediction, due to the feature (Figure 4). For each query point, the sum of the Shapley values for all features corresponds to the total deviation of the prediction from the average [56,57]. The details of the machine learning optimization analysis are described below.
2.5. Bayesian Optimization of Classification
To perform Bayesian optimization of machine learning classification, we employed an algorithm comprising 2 principal steps (Equations (S2) and (S3) in Table S1), where defined the training dataset with the t − 1 observation of an unknown function (Table S1 and Figure S2).
To automatically choose the machine learning algorithm with tailored hyperparameters, the Statistics and Machine Learning ToolboxTM (MATLAB and Release 2020b, The MathWorks, Inc., Natick, MA, USA) was utilized with the “fitcauto” function [37,38,56]. A multi-TreeBagger model of the objective function was included in the Bayesian optimization approach of “fitcauto”. The objective function of this model differed from the Gaussian process model implemented by other machine learning toolbox functions using Bayesian optimization, and the next point to be examined was determined by an acquisition function (i.e., expected improvement). The output of the “fitcauto” algorithm was the point with the lowest objective function value among the points assessed during the optimization. This method automatically chose the best machine learning method for training data from among the most applicable machine learning methods (e.g., discriminant analysis [‘discr’], ensemble learning [“ensemble”], kernel classifier [‘kernel’], k-nearest neighbor [‘knn’], support vector machine classifier (SVM), linear classifier [‘linear’], naive Bayes classifier [‘nb’], neural network classifier [‘net’], and decision tree classifier [‘tree’]). When the optimization process was completed, “fitcauto” returned the trained model for the entire train dataset to perform classification [57].
3. Results
Four microstates were identified: A, B, C, and D that exhibited right-frontal left-posterior, left-frontal right-posterior, midline frontal-occipital, and midline frontal topographies respectively. Microstate scalp topographies of patients diagnosed with SZ and healthy control subjects are shown in Figure 2. We also computed and compared twenty microstate features, including occurrence, duration, coverage, and microstate correlation maps between SZ patients and HC subjects. We found that most of the predictors were significantly different between groups after Bonferroni correction for multiple comparisons (α < 0.008, Table 2, Figure S1).
Table 2.
Features (Occurrence) | Type A | Type B | Type C | Type D |
---|---|---|---|---|
Mean ± SD [HC] | 3.65 ± 1.17 | 4.26 ± 1.39 | 4.13 ± 1.28 | 4.01 ± 1.26 |
Mean ± SD [SZ] | 4.05 ± 1.23 | 4.41 ± 1.06 | 2.34 ± 1.42 | 4.49 ± 1.09 |
p-value | <0.001 | <0.001 | <0.001 | <0.001 |
t-value | −11.68 | −4.28 | 46.89 | −14.41 |
Observed power | >0.999 | 0.990 | >0.999 | >0.999 |
Features (Duration) | Type A | Type B | Type C | Type D |
Mean ± SD [HC] | 67.18 ± 62.08 | 58.46 ± 15.80 | 55.91 ± 15.89 | 65.07 ± 26.78 |
Mean ± SD [SZ] | 56.22 ± 11.90 | 59.49 ± 13.94 | 57.29 ± 36.88 | 79.79 ± 36.43 |
p-value | <0.001 | 0.015 | 0.086 | <0.001 |
t-value | 8.68 | −2.43 | −1.72 | −16.30 |
Observed power | >0.999 | 0.684 | 0.405 | >0.999 |
Features (Coverage) | Type A | Type B | Type C | Type D |
Mean ± SD [HC] | 23.53 ± 17.62 | 25.75 ± 10.65 | 23.65 ± 10.49 | 27.04 ± 13.36 |
Mean ± SD [SZ] | 22.82 ± 8.31 | 26.22 ± 8.18 | 15.21 ± 14.37 | 35.73 ± 15.507 |
p-value | 0.068 | 0.077 | <0.001 | <0.001 |
t-value | 1.82 | −1.76 | 23.76 | −21.28 |
Observed power | 0.446 | 0.423 | >0.999 | >0.999 |
Features (Mean GFP) | Type A | Type B | Type C | Type D |
Mean ± SD [HC] | 4.61 ± 1.67 | 4.92 ± 1.75 | 5.20 ± 2.00 | 5.40 ± 1.99 |
Mean ± SD [SZ] | 4.90 ± 3.93 | 5.05 ± 1.67 | 5.42 ± 2.75 | 5.61 ± 1.77 |
p-value | 0.001 | 0.007 | <0.001 | <0.001 |
t-value | −3.29 | −2.70 | −3.32 | −3.85 |
Observed power | 0.908 | 0.770 | 0.913 | 0.971 |
Features (Mean MsMC) | Type A | Type B | Type C | Type D |
Mean ± SD [HC] | 0.60 ± 0.05 | 0.62 ± 0.11 | 0.64 ± 0.09 | 0.66 ± 0.11 |
Mean ± SD [SZ] | 0.58 ± 0.07 | 0.60 ± 0.08 | 0.59 ± 0.10 | 0.66 ± 0.08 |
p-value | <0.001 | <0.001 | <0.001 | 0.434 |
t-value | 6.86 | 6.12 | 15.99 | 0.78 |
Observed power | >0.999 | >0.999 | >0.999 | 0.1222 |
Bold values indicate significant results.
We then ranked microstate parameters, which indicated that the features from microstates C and D had the highest predictor importance scores (Figure 3 and Figure 4). Specifically, Ocurrence_C, Coverage_C, MsMC_C, Duration_C, MsMC_B, and Coverage_D were the highest ranked parameters. Furthermore, to compare the performance obtained using ranked microstate features in classifying patients diagnosed with SZ and HC subjects, we implemented models that incrementally considered ranked features (e.g., model 1 included only the Occurrence_C feature, and model 2 considered Occurance_C and Coverage_C as input) The number of features that were fed to the machine learning approach based on the ranked order is presented in Figure 3. Furthermore, the contributions of the value of the feature to the difference between the actual prediction and the mean prediction is estimated as Shapley values in Figure 4. The x-axis indicates the variable name, and the value next to them is the mean SHAP value. On the y-axis is the SHAP value that indicates how much the change in features can positively or negatively affect the probability of prediction.
Classification performance using the optimized machine learning approach relative to the quadratic SVM [27,58] showed that the highest output measure results were obtained when using 19 ranked features as the input of the optimized ML algorithm [ACC = 90.93%, AUC = 0.90%, sensitivity = 91.37%, specificity = 90.48%] (Table 3 and Table S2, Figure 5, Figures S3 and S4). Furthermore, comparable results in terms of accuracy, sensitivity, and ACU were obtained with the optimized ML algorithm by using the first six ranked features. The best-fitted model selected by the optimized algorithm using 19 features was SVM with Gaussian kernel and ‘Ensemble’ when using the 6 most important features, while quadratic SVM was used in the recent study on 19 microstate features.
Table 3.
Classifier | Accuracy (%) | AUC | Sensitivity (%) | Specificity (%) | Best-Fitted Model |
---|---|---|---|---|---|
Previous study [19 features] [27] | 75.64 | 0.80 | 71.93 | 75.50 | Quadratic SVM |
Previous study [15 features] [27] | 76.62 | - | - | - | Quadratic SVM |
Optimized ML [6 features] | 89.04 | 0.89 | 88.62 | 89.47 | Ensemble |
Optimized ML [19 features] | 90.93 | 0.90 | 91.37 | 90.48 | Gaussian SVM |
4. Discussion
We employed an optimized machine learning approach to microstate measures and examined their potential for SZ classification. By applying the Bayesian optimized machine learning approach to ranked microstate measures, we were able to discern resting EEG recordings of SZ from HC subjects with high sensitivity, specificity, and accuracy (Table 3, Figure 5 and Figure S2). We also established that with only six features, we could efficiently classify SZ using microstate analyses (Figure 3, Figure 4 and Figure 5 and Figure S3). Overall, findings from this proof-of-concept study show that optimized ML applied to microstate features could contribute to the identification of patients with SZ relative to HC subjects.
In line with previous studies [18,27,28,30,59], we found four microstates (i.e., A, B, C, and D) that had a similar topography in SZ and HC groups. These four microstates explained the global topographic variance and have been suggested to represent distinct functions of the brain [25,27,48]. Specifically, microstates A and B have been associated with the processing of different sensory modalities and with the mental visualization of the situation [60], whereas types D and C have been implicated in attention regulation and default mode functionality, respectively [27]. Although the types and topographies of these microstates were similar between SZ and HC subjects, individual microstate features differed across groups. For instance, the occurrence of microstate B was significantly increased in SZ vs. HC groups, in line with another study that also reported an association of this altered microstate parameter with the positive symptoms of patients with SZ [30]. We also found the mean GFP was significant in SZ vs. HC individuals across all four types of microstates, likely indicating that SZ patients have a higher level of synchronization (i.e., increased cortical power) during resting EEG recordings [61]. The MsCM, a new microstate feature that was calculated for this study showed higher values in HC vs. SZ across types A, B, and C, thus suggesting that GFP topographies are more consistently repeated (i.e., less variability in topographic patterns) in HC relative to SZ patients. Besides MsCM, Type C had reduced occurrence and coverage in SZ vs. HC individuals. These findings are in line with results from a recent meta-analyses of microstate research [18,30]. Of note, microstate C has been linked to the functionality of the saliency network, including the anterior cingulate, inferior frontal gyrus, and insula [20], and aberrant activity in the salience network has been consistently reported in SZ [62,63,64]. Thus, alterations in microstate C parameters further point to dysfunction in this network in SZ. Here, microstate Type D showed an increase in occurrence, duration, and coverage, while previous studies reported a decrease in these parameters [18,27]. Although these discrepancies could be related to methodological differences as well as medication status [15,65], other microstate studies reported an increase in these Type D characteristics, consistent with our findings [30,66]. Microstate D features have been linked to flexible aspects of attention because of their association with the frontoparietal attention network [20]. Studies indicated that impairments of microstates of class D in SZ are associated with deficits in context update, attentional processes, and executive control, which are often observed in these patients [18,67].
Since we observed alterations in most of the microstate parameters computed, we wanted to assess whether some alterations were more relevant than others in differentiating individuals with SZ from HC. We therefore computed the predictor importance score for each of these 20 parameters and found that features from microstate Type C and D were ranked the highest. A reduction in the occurrence and the coverage of microstate Type C were the two top-ranked features, while coverage and duration of microstate D, both of which were increased in SZ vs. HC, were also highly ranked. Given the implication of types C and D in default mode and dorsal attention respectively, these findings suggest that alterations in these domains may be more relevant for SZ classification [20,22]. At the same time, some microstate B features were ranked high as well, including the MsMC that was computed in this study for the first time, and therefore these parameters should be considered and assessed in future studies of SZ. In contrast, Type A microstate features were ranked lower in our study, a finding in agreement with previous work, showing that these features were relatively intact in patients with SZ vs. HC [27].
Multivariate analysis based on machine learning algorithms provides an opportunity to understand the SZ classification by analyzing many features simultaneously. A handful of studies have utilized microstate features and have tested their accuracy and precision to classify SZ with multivariate patterns and have suggested the efficacy of this approach [14,27]. Building on this body of evidence, in this study, we used this machine learning approach to evaluate multiple, ranked microstate features at the same time. As such, we created a more generalized cross-validated model with increased AUC, sensitivity, and specificity. In particular, the optimized machine learning approach employed here was able to achieve greater than 90% efficacy based on the Gaussian SVM using 19 features and greater than 88% accuracy with the ensemble as the best fitted using just 6 features. Importantly, a recent study that used the same dataset for SZ classification reported highest output measures as [Acc = 75.64%,, Sensitivity = 71.93%, and Specificity = 75.50%] with quadratic SVM [27]. Several factors, including the number of EEG trials, the type and number of microstate features (i.e., MsMC), and the feature selection method (i.e., feature importance score calculation) may have contributed to this difference. The accuracy, specificity, and sensitivity scores reported here are comparable with some deep learning approaches that use EEG microstates for SZ classification [68,69,70,71,72,73]. The fact that findings obtained from our optimized ML approach were comparable to deep learning methods in terms of performance but outperformed in computation and time of processing (i.e., our method [order of minutes] vs. [order of hours]) potentially provide a more rapid, efficient way to achieve an optimized SZ classification.
This study has several limitations that should be addressed in future studies. For example, although the number of epochs was large enough for applying the optimized classification approach, the sample size of the SZ and HC groups was rather small; thus, we decided to apply subject-dependent machine learning approach classification on 5-second segments of data. We also chose this approach to make our results more comparable to a previous study using the same dataset [27]. Compared to the subject-dependent method, the subject-independent method may offer greater generalizability regarding learning the HC vs. SZ labels rather than from the individual’s signature. Therefore, future work on larger groups of SZ patients is needed to confirm the findings from this proof-of-concept study on a larger dataset, and also by applying a subject-independent classification. Nonetheless, in this study, we run a leave-one analysis and found that the main findings did not change (Table S3). Additionally, even though age and gender were matched, obtaining enough data to reflect the broader range of ages in both genders is necessary for the generalization of the trained models. Thus, to increase classification accuracy and develop an accurate model reflecting the general population, the classification performance of microstate characteristics should be examined using data from larger cohorts encompassing the lifespan. This will contribute to establish how specific features of individuals with SZ vs. HC are captured by the machine learning classification method presented here, in line with a personalized medicine approach. Furthermore, in the present study, four microstates were identified which encompassed at least 70% of the global explained variance (GEV) (Figure S7) [14,15,45,49]. While we found that GEV did not significantly change when the number of microstates increased, this could still affect classification performance. Future work should, therefore, also assess whether more than four microstates are identified and whether a different number of archetypes may affect the classification of SZ. Relatedly, in the present study, the characteristics of Type C and D microstates were among the highest ranked features, thus indicating that these parameters may be more reliable diagnostic features than Type A and B features in schizophrenia classification, which eventually could have relevant implication in the day-to-day clinical psychiatry practice [20,22]. Of note, each patient enrolled in this study underwent a medication washout period of at least seven days before the EEG recordings were performed. Nonetheless, future work should confirm these findings in medication-naïve patients and/or more thoroughly assess the possible impact of antipsychotic medications on EEG microstate parameters.
5. Conclusions
By employing for the first time an optimized machine learning approach on microstate measures of resting EEG recordings we achieved higher accuracy, sensitivity, and specificity of SZ patients compared to conventional classification methods, even with just six microstate predictors. Furthermore, our results showed that ranking microstate features was critical to optimize this process. Further studies should confirm and extend these findings on datasets involving larger cohorts of SZ patients. Eventually, the novel machine learning approach employed here may help establish EEG microstates as neurophysiological biomarkers that contribute to the classification of SZ.
Acknowledgments
We thank Ahmad Mayeli for the helpful feedback on the manuscript.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/brainsci12111497/s1, Table S1: Bayesian optimization workflow for the training dataset; Table S2: Hyperparameters of the machine learning models; Table S3: Hyperparameters of the machine learning models for subject independent design; Figure S1: The distribution of twenty microstate features that were extracted in this study; Figure S2: Flowchart of input data and Bayesian optimization of ML approach; Figure S3: Results of AUC computed for each model; Figure S4: ROC curves for the best-fitted models; Figure S5: Subject-based histogram of HC microstate features; Figure S6: Subject-based histogram of SZ microstate features; Figure S7: Global explained variance (GEV) rate for choosing different numbers of archetypes map.
Author Contributions
A.K.: Conceptualization; methodology; software; validation; formal analysis, investigation; resources; data curation; visualization; writing—original draft preparation; writing—review and editing. S.S.S.: Methodology; software; validation; formal analysis, investigation; resources; data curation; writing—original draft preparation; visualization. M.H.: Methodology; software; formal analysis, investigation; resources; data curation. F.F.: Conceptualization; methodology; validation; investigation; resources; data curation; visualization; writing—original draft preparation; writing—review and editing; supervision; project administration; funding acquisition. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
The public repository data were used under license no. (Approval Code): CC0 Creative Commons Zero 1.0.
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
Publicly archived datasets analyzed during the study can be accessed via the link: https://repod.icm.edu.pl/dataset.xhtml?persistentId=doi:10.18150/repod.0107441 (accessed on 1 September 2017).
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This research was funded by the National Institute of Mental Health (NIMH), grant number R01 MH113827, awarded to Fabio Ferrarelli.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Bleuler E. Dementia Praecox or the Group of Schizophrenias. International Universities Press; New York, NY, USA: 1950. [Google Scholar]
- 2.Kraepelin E. In: Dementia Praecox and Paraphrenia. Barclay R.M., translator. Krieger; Huntingdon, NY, USA: 1919. [Google Scholar]
- 3.Vignapiano A., Koenig T., Mucci A., Giordano G.M., Amodio A., Altamura M., Bellomo A., Brugnoli R., Corrivetti G., Di Lorenzo G. Disorganization and cognitive impairment in schizophrenia: New insights from electrophysiological findings. Int. J. Psychophysiol. 2019;145:99–108. doi: 10.1016/j.ijpsycho.2019.03.008. [DOI] [PubMed] [Google Scholar]
- 4.McCleery A., Lee J., Joshi A., Wynn J.K., Hellemann G.S., Green M.F. Meta-analysis of face processing event-related potentials in schizophrenia. Biol. Psychiatry. 2015;77:116–126. doi: 10.1016/j.biopsych.2014.04.015. [DOI] [PubMed] [Google Scholar]
- 5.Haigh S.M., Coffman B.A., Salisbury D.F. Mismatch negativity in first-episode schizophrenia: A meta-analysis. Clin. EEG Neurosci. 2017;48:3–10. doi: 10.1177/1550059416645980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ferrarelli F. Sleep abnormalities in schizophrenia: State of the art and next steps. Am. J. Psychiatry. 2021;178:903–913. doi: 10.1176/appi.ajp.2020.20070968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Chan M.-S., Chung K.-F., Yung K.-P., Yeung W.-F. Sleep in schizophrenia: A systematic review and meta-analysis of polysomnographic findings in case-control studies. Sleep Med. Rev. 2017;32:69–84. doi: 10.1016/j.smrv.2016.03.001. [DOI] [PubMed] [Google Scholar]
- 8.Kambeitz J., Kambeitz-Ilankovic L., Cabral C., Dwyer D.B., Calhoun V.D., Van Den Heuvel M.P., Falkai P., Koutsouleris N., Malchow B. Aberrant functional whole-brain network architecture in patients with schizophrenia: A meta-analysis. Schizophr. Bull. 2016;42:S13–S21. doi: 10.1093/schbul/sbv174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Craig N., Coppola R., Morihisa J.M., Weinberger D.R. Computed electroencephalographic activity mapping in schizophrenia: The resting state reconsidered. Arch. Gen. Psychiatry. 1987;44:514–517. doi: 10.1001/archpsyc.1987.01800180024003. [DOI] [PubMed] [Google Scholar]
- 10.Kam J.W., Bolbecker A.R., O’Donnell B.F., Hetrick W.P., Brenner C.A. Resting state EEG power and coherence abnormalities in bipolar disorder and schizophrenia. J. Psychiatr. Res. 2013;47:1893–1901. doi: 10.1016/j.jpsychires.2013.09.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Sueyoshi K., Sumiyoshi T. Electrophysiological evidence in schizophrenia in relation to treatment response. Front. Psychiatry. 2018;9:259. doi: 10.3389/fpsyt.2018.00259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Alimardani F., Cho J.-H., Boostani R., Hwang H.-J. Classification of bipolar disorder and schizophrenia using steady-state visual evoked potential based features. IEEE Access. 2018;6:40379–40388. doi: 10.1109/ACCESS.2018.2854555. [DOI] [Google Scholar]
- 13.Tikka S.K., Singh B.K., Nizamie S.H., Garg S., Mandal S., Thakur K., Singh L.K. Artificial intelligence-based classification of schizophrenia: A high density electroencephalographic and support vector machine study. Indian J. Psychiatry. 2020;62:273. doi: 10.4103/psychiatry.IndianJPsychiatry_91_20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Koenig T., Lehmann D., Merlo M.C., Kochi K., Hell D., Koukkou M. A deviant EEG brain microstate in acute, neuroleptic-naive schizophrenics at rest. Eur. Arch. Psychiatry Clin. Neurosci. 1999;249:205–211. doi: 10.1007/s004060050088. [DOI] [PubMed] [Google Scholar]
- 15.Lehmann D., Faber P.L., Galderisi S., Herrmann W.M., Kinoshita T., Koukkou M., Mucci A., Pascual-Marqui R.D., Saito N., Wackermann J. EEG microstate duration and syntax in acute, medication-naive, first-episode schizophrenia: A multi-center study. Psychiatry Res. Neuroimaging. 2005;138:141–156. doi: 10.1016/j.pscychresns.2004.05.007. [DOI] [PubMed] [Google Scholar]
- 16.Strelets V., Faber P., Golikova J., Novototsky-Vlasov V., König T., Gianotti L., Gruzelier J., Lehmann D. Chronic schizophrenics with positive symptomatology have shortened EEG microstate durations. Clin. Neurophysiol. 2003;114:2043–2051. doi: 10.1016/S1388-2457(03)00211-6. [DOI] [PubMed] [Google Scholar]
- 17.Tomescu M.I., Rihs T.A., Roinishvili M., Karahanoglu F.I., Schneider M., Menghetti S., Van De Ville D., Brand A., Chkonia E., Eliez S. Schizophrenia patients and 22q11. 2 deletion syndrome adolescents at risk express the same deviant patterns of resting state EEG microstates: A candidate endophenotype of schizophrenia. Schizophr. Res. Cogn. 2015;2:159–165. doi: 10.1016/j.scog.2015.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Rieger K., Diaz Hernandez L., Baenninger A., Koenig T. 15 years of microstate research in schizophrenia–where are we? A meta-analysis. Front. Psychiatry. 2016;7:22. doi: 10.3389/fpsyt.2016.00022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lehmann D., Ozaki H., Pál I. EEG alpha map series: Brain micro-states by space-oriented adaptive segmentation. Electroencephalogr. Clin. Neurophysiol. 1987;67:271–288. doi: 10.1016/0013-4694(87)90025-3. [DOI] [PubMed] [Google Scholar]
- 20.Seitzman B.A., Abell M., Bartley S.C., Erickson M.A., Bolbecker A.R., Hetrick W.P. Cognitive manipulation of brain electric microstates. Neuroimage. 2017;146:533–543. doi: 10.1016/j.neuroimage.2016.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lehmann D., Strik W.K., Henggeler B., König T., Koukkou M. Brain electric microstates and momentary conscious mind states as building blocks of spontaneous thinking: I. Visual imagery and abstract thoughts. Int. J. Psychophysiol. 1998;29:1–11. doi: 10.1016/S0167-8760(97)00098-6. [DOI] [PubMed] [Google Scholar]
- 22.Britz J., Van De Ville D., Michel C.M. BOLD correlates of EEG topography reveal rapid resting-state network dynamics. Neuroimage. 2010;52:1162–1170. doi: 10.1016/j.neuroimage.2010.02.052. [DOI] [PubMed] [Google Scholar]
- 23.Zappasodi F., Perrucci M.G., Saggino A., Croce P., Mercuri P., Romanelli R., Colom R., Ebisch S.J. EEG microstates distinguish between cognitive components of fluid reasoning. Neuroimage. 2019;189:560–573. doi: 10.1016/j.neuroimage.2019.01.067. [DOI] [PubMed] [Google Scholar]
- 24.Bowers K.S. Imagination and dissociation in hypnotic responding. Int. J. Clin. Exp. Hypn. 1992;40:253–275. doi: 10.1080/00207149208409661. [DOI] [PubMed] [Google Scholar]
- 25.Milz P., Faber P.L., Lehmann D., Koenig T., Kochi K., Pascual-Marqui R.D. The functional significance of EEG microstates—Associations with modalities of thinking. Neuroimage. 2016;125:643–656. doi: 10.1016/j.neuroimage.2015.08.023. [DOI] [PubMed] [Google Scholar]
- 26.Yuan H., Zotev V., Phillips R., Drevets W.C., Bodurka J. Spatiotemporal dynamics of the brain at rest—Exploring EEG microstates as electrophysiological signatures of BOLD resting state networks. Neuroimage. 2012;60:2062–2072. doi: 10.1016/j.neuroimage.2012.02.031. [DOI] [PubMed] [Google Scholar]
- 27.Kim K., Duc N.T., Choi M., Lee B. EEG microstate features for schizophrenia classification. PLoS ONE. 2021;16:e0251842. doi: 10.1371/journal.pone.0251842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lehmann D., Pascual-Marqui R., Michel C. EEG microstates. Scholarpedia. 2009;4:7632. doi: 10.4249/scholarpedia.7632. [DOI] [Google Scholar]
- 29.Mackintosh A.J., Borgwardt S., Studerus E., Riecher-Rössler A., De Bock R., Andreou C. EEG microstate differences in medicated vs. Medication-Naïve first-episode psychosis patients. Front. Psychiatry. 2020;11:600606. doi: 10.3389/fpsyt.2020.600606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Baradits M., Bitter I., Czobor P. Multivariate patterns of EEG microstate parameters and their role in the discrimination of patients with schizophrenia from healthy controls. Psychiatry Res. 2020;288:112938. doi: 10.1016/j.psychres.2020.112938. [DOI] [PubMed] [Google Scholar]
- 31.Barros C., Silva C.A., Pinheiro A.P. Advanced EEG-based learning approaches to predict schizophrenia: Promises and pitfalls. Artif. Intell. Med. 2021;114:102039. doi: 10.1016/j.artmed.2021.102039. [DOI] [PubMed] [Google Scholar]
- 32.Khanna A., Pascual-Leone A., Michel C.M., Farzan F. Microstates in resting-state EEG: Current status and future directions. Neurosci. Biobehav. Rev. 2015;49:105–113. doi: 10.1016/j.neubiorev.2014.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Poulsen A.T., Pedroni A., Langer N., Hansen L.K. Microstate EEGlab toolbox: An introductory guide. BioRxiv. 2018:289850. doi: 10.1101/289850. [DOI] [Google Scholar]
- 34.Hu W., Zhang Z., Zhang L., Huang G., Li L., Liang Z. Microstate Detection in Naturalistic Electroencephalography Data: A Systematic Comparison of Topographical Clustering Strategies on an Emotional Database. Front. Neurosci. 2022;16:812624. doi: 10.3389/fnins.2022.812624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wu J., Chen X.-Y., Zhang H., Xiong L.-D., Lei H., Deng S.-H. Hyperparameter optimization for machine learning models based on Bayesian optimization. J. Electron. Sci. Technol. 2019;17:26–40. [Google Scholar]
- 36.Bergstra J., Bengio Y. Random search for hyper-parameter optimization. J. Mach. Learn. Res. 2012;13:281–305. [Google Scholar]
- 37.Gelbart M.A., Snoek J., Adams R.P. Bayesian optimization with unknown constraints. arXiv. 20141403.5607 [Google Scholar]
- 38.Snoek J., Larochelle H., Adams R.P. Practical bayesian optimization of machine learning algorithms. Adv. Neural Inf. Process. Syst. 2012;25 doi: 10.48550/arXiv.1206.2944. [DOI] [Google Scholar]
- 39.Jones D.R. A taxonomy of global optimization methods based on response surfaces. J. Glob. Optim. 2001;21:345–383. doi: 10.1023/A:1012771025575. [DOI] [Google Scholar]
- 40.Lillo E., Mora M., Lucero B. Automated diagnosis of schizophrenia using EEG microstates and Deep Convolutional Neural Network. Expert Syst. Appl. 2022;209:118236. doi: 10.1016/j.eswa.2022.118236. [DOI] [Google Scholar]
- 41.Alves L.M., Côco K.F., de Souza M.L., Ciarelli P.M. Microstate Graphs: A Node-Link Approach to Identify Patients with Schizophrenia; Proceedings of the Brazilian Congress on Biomedical Engineering; Vitória, Brazil. 26–30 October 2020; pp. 1679–1685. [Google Scholar]
- 42.Olejarczyk E., Jernajczyk W. Graph-based analysis of brain connectivity in schizophrenia. PLoS ONE. 2017;12:e0188629. doi: 10.1371/journal.pone.0188629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Olejarczyk E., Jernajczyk W. EEG in schizophrenia. RepOD. 2017 doi: 10.18150/repod.0107441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Delorme A., Makeig S. EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods. 2004;134:9–21. doi: 10.1016/j.jneumeth.2003.10.009. [DOI] [PubMed] [Google Scholar]
- 45.Koenig T., Prichep L., Lehmann D., Sosa P.V., Braeker E., Kleinlogel H., Isenhart R., John E.R. Millisecond by millisecond, year by year: Normative EEG microstates and developmental stages. Neuroimage. 2002;16:41–48. doi: 10.1006/nimg.2002.1070. [DOI] [PubMed] [Google Scholar]
- 46.Wackermann J., Lehmann D., Michel C., Strik W. Adaptive segmentation of spontaneous EEG map series into spatially defined microstates. Int. J. Psychophysiol. 1993;14:269–283. doi: 10.1016/0167-8760(93)90041-M. [DOI] [PubMed] [Google Scholar]
- 47.Lehmann D., Skrandies W. Reference-free identification of components of checkerboard-evoked multichannel potential fields. Electroencephalogr. Clin. Neurophysiol. 1980;48:609–621. doi: 10.1016/0013-4694(80)90419-8. [DOI] [PubMed] [Google Scholar]
- 48.Michel C.M., Koenig T. EEG microstates as a tool for studying the temporal dynamics of whole-brain neuronal networks: A review. Neuroimage. 2018;180:577–593. doi: 10.1016/j.neuroimage.2017.11.062. [DOI] [PubMed] [Google Scholar]
- 49.Murray M.M., Brunet D., Michel C.M. Topographic ERP analyses: A step-by-step tutorial review. Brain Topogr. 2008;20:249–264. doi: 10.1007/s10548-008-0054-5. [DOI] [PubMed] [Google Scholar]
- 50.Hussain I., Park S.J. HealthSOS: Real-time health monitoring system for stroke prognostics. IEEE Access. 2020;8:213574–213586. doi: 10.1109/ACCESS.2020.3040437. [DOI] [Google Scholar]
- 51.Hussain I., Park S.-J. Quantitative evaluation of task-induced neurological outcome after stroke. Brain Sci. 2021;11:900. doi: 10.3390/brainsci11070900. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Choi W., Park E.-Y., Jeon S., Kim C. Clinical photoacoustic imaging platforms. Biomed. Eng. Lett. 2018;8:139–155. doi: 10.1007/s13534-018-0062-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Tahernezhad-Javazm F., Azimirad V., Shoaran M. A review and experimental study on the application of classifiers and evolutionary algorithms in EEG-based brain–machine interface systems. J. Neural Eng. 2018;15:021007. doi: 10.1088/1741-2552/aa8063. [DOI] [PubMed] [Google Scholar]
- 54.Keihani A., Mohammadi A.M., Marzbani H., Nafissi S., Haidari M.R., Jafari A.H. Sparse representation of brain signals offers effective computation of cortico-muscular coupling value to predict the task-related and non-task sEMG channels: A joint hdEEG-sEMG study. PLoS ONE. 2022;17:e0270757. doi: 10.1371/journal.pone.0270757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Hussain I., Park S.J. Big-ECG: Cardiographic predictive cyber-physical system for stroke management. IEEE Access. 2021;9:123146–123164. doi: 10.1109/ACCESS.2021.3109806. [DOI] [Google Scholar]
- 56.Bull A.D. Convergence rates of efficient global optimization algorithms. J. Mach. Learn. Res. 2011;12:2879–2904. [Google Scholar]
- 57.Li L., Jamieson K., Rostamizadeh A., Gonina E., Ben-Tzur J., Hardt M., Recht B., Talwalkar A. A system for massively parallel hyperparameter tuning. Proc. Mach. Learn. Syst. 2020;2:230–246. [Google Scholar]
- 58.Noble W.S. What is a support vector machine? Nat. Biotechnol. 2006;24:1565–1567. doi: 10.1038/nbt1206-1565. [DOI] [PubMed] [Google Scholar]
- 59.Wang F., Hujjaree K., Wang X. Electroencephalographic microstates in schizophrenia and bipolar disorder. Front. Psychiatry. 2021;12:638722. doi: 10.3389/fpsyt.2021.638722. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Custo A., Van De Ville D., Wells W.M., Tomescu M.I., Brunet D., Michel C.M. Electroencephalographic resting-state networks: Source localization of microstates. Brain Connect. 2017;7:671–682. doi: 10.1089/brain.2016.0476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Yang G.J., Murray J.D., Repovs G., Cole M.W., Savic A., Glasser M.F., Pittenger C., Krystal J.H., Wang X.-J., Pearlson G.D. Altered global brain signal in schizophrenia. Proc. Natl. Acad. Sci. USA. 2014;111:7438–7443. doi: 10.1073/pnas.1405289111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Kowalski J., Aleksandrowicz A., Dąbkowska M., Gawęda Ł. Neural Correlates of Aberrant Salience and Source Monitoring in Schizophrenia and At-Risk Mental States—A Systematic Review of fMRI Studies. J. Clin. Med. 2021;10:4126. doi: 10.3390/jcm10184126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.White T.P., Joseph V., Francis S.T., Liddle P.F. Aberrant salience network (bilateral insula and anterior cingulate cortex) connectivity during information processing in schizophrenia. Schizophr. Res. 2010;123:105–115. doi: 10.1016/j.schres.2010.07.020. [DOI] [PubMed] [Google Scholar]
- 64.Palaniyappan L., White T.P., Liddle P.F. The concept of salience network dysfunction in schizophrenia: From neuroimaging observations to therapeutic opportunities. Curr. Top. Med. Chem. 2012;12:2324–2338. doi: 10.2174/156802612805289881. [DOI] [PubMed] [Google Scholar]
- 65.Nishida K., Morishima Y., Yoshimura M., Isotani T., Irisawa S., Jann K., Dierks T., Strik W., Kinoshita T., Koenig T. EEG microstates associated with salience and frontoparietal networks in frontotemporal dementia, schizophrenia and Alzheimer’s disease. Clin. Neurophysiol. 2013;124:1106–1114. doi: 10.1016/j.clinph.2013.01.005. [DOI] [PubMed] [Google Scholar]
- 66.Yoshimura M., Koenig T., Irisawa S., Isotani T., Yamada K., Kikuchi M., Okugawa G., Yagyu T., Kinoshita T., Strik W. A pharmaco-EEG study on antipsychotic drugs in healthy volunteers. Psychopharmacology. 2007;191:995–1004. doi: 10.1007/s00213-007-0737-8. [DOI] [PubMed] [Google Scholar]
- 67.Fioravanti M., Bianchi V., Cinti M.E. Cognitive deficits in schizophrenia: An updated metanalysis of the scientific evidence. BMC Psychiatry. 2012;12:64. doi: 10.1186/1471-244X-12-64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Bagherzadeh S., Shahabi M.S., Shalbaf A. Detection of schizophrenia using hybrid of deep learning and brain effective connectivity image from electroencephalogram signal. Comput. Biol. Med. 2022;146:105570. doi: 10.1016/j.compbiomed.2022.105570. [DOI] [PubMed] [Google Scholar]
- 69.Khare S.K., Bajaj V., Acharya U.R. Spwvd-cnn for automated detection of schizophrenia patients using eeg signals. IEEE Trans. Instrum. Meas. 2021;70:2507409. doi: 10.1109/TIM.2021.3070608. [DOI] [Google Scholar]
- 70.Oh S.L., Vicnesh J., Ciaccio E.J., Yuvaraj R., Acharya U.R. Deep convolutional neural network model for automated diagnosis of schizophrenia using EEG signals. Appl. Sci. 2019;9:2870. doi: 10.3390/app9142870. [DOI] [Google Scholar]
- 71.Phang C.-R., Noman F., Hussain H., Ting C.-M., Ombao H. A multi-domain connectome convolutional neural network for identifying schizophrenia from EEG connectivity patterns. IEEE J. Biomed. Health Inform. 2019;24:1333–1343. doi: 10.1109/JBHI.2019.2941222. [DOI] [PubMed] [Google Scholar]
- 72.Shalbaf A., Bagherzadeh S., Maghsoudi A. Transfer learning with deep convolutional neural network for automated detection of schizophrenia from EEG signals. Phys. Eng. Sci. Med. 2020;43:1229–1239. doi: 10.1007/s13246-020-00925-9. [DOI] [PubMed] [Google Scholar]
- 73.Singh K., Singh S., Malhotra J. Spectral features based convolutional neural network for accurate and prompt identification of schizophrenic patients. Proc. Inst. Mech. Eng. Part H J. Eng. Med. 2021;235:167–184. doi: 10.1177/0954411920966937. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Publicly archived datasets analyzed during the study can be accessed via the link: https://repod.icm.edu.pl/dataset.xhtml?persistentId=doi:10.18150/repod.0107441 (accessed on 1 September 2017).