1. Scope
Executive cognitive functions like working memory determine the success or failure of a wide variety of different cognitive tasks, such as problem solving, navigation, or planning. Estimation of constructs like working memory load or memory capacity from neurophysiological or psychophysiological signals would enable adaptive systems to respond to cognitive states experienced by an operator and trigger responses designed to support task performance (e.g., by simplifying the exercises of a tutor system when the subject is overloaded Gerjets et al., 2014, or by shutting down distractions from the mobile phone). The determination of cognitive states like working memory load is also useful for automated testing/assessment or for usability evaluation. While there exists a large body of research work on neural and physiological correlates of cognitive functions like working memory activity, fewer publications deal with the application of this research with respect to single-trial detection and real-time estimation of cognitive functions in complex, realistic scenarios. Single-trial classifiers based on brain activity measurements such as electroencephalography (EEG, Kothe and Makeig, 2011; Lotte et al., 2018), functional near-infrared spectroscopy (fNIRS, Putze et al., 2014; Herff et al., 2015), physiological signals (Fairclough et al., 2005; Fairclough, 2008), or eye tracking (Putze et al., 2013) have the potential to classify affective (Koelstra et al., 2010; Heger et al., 2014; Mühl et al., 2014) or cognitive states based upon short segments of data. For this purpose, signal processing and machine learning techniques need to be developed and transferred to real-world user interfaces.
The goal of this Frontiers Research Topic was to advance the State-of-the-Art in signal-based modeling of cognitive processes. We were especially interested in research toward more complex and realistic study designs, for example collecting data in the wild or investigating the interaction between different cognitive processes or signal modalities. Bringing together many contributions in one format allowed us to look at the state of convergence or diversity regarding concepts, methods, and paradigms.
The accepted manuscripts in this research topic cover a large range of aspects of cognition, reflecting the broadness of the field and its many application domains. A dominant challenge in the research topic is the analysis of cognitive workload (or memory load) from neurological signals. This does not come as a surprise because workload is a thoroughly studied construct and workload models can be immediately exploited, e.g., for adaptive human-machine interaction. While all these manuscripts share a joint research interest, they tackle the challenge of workload modeling in different application domains, with different signals, different classification approaches, and different features. Working memory and attentional control represent two recurring themes through the collection of papers in this research theme. The most prominent modality in this research topic is EEG, drawing from both spectral as well as time-domain features. In multiple articles, EEG is complemented by other modalities: Two of them use fNIRS as a different mode to capture neural activity (two others use fNIRS as single modality); two others use eye tracking as a way to capture visual attention and one also uses physiological signals (such as heart rate and breath rate). This shows that researchers today routinely select which (combinations of) signals are most promising for a given task.
2. Highlights
In the following paragraphs, we give an overview of the contributed manuscripts and their main contributions to the field. We structure them according to the cognitive processes which are in the respective focus of interest, according to the core themes identified in the previous section.
2.1. Cognitive workload
Gateau et al. perform a study in which they investigated cognitive workload (induced by a serial memorization task in two difficulty levels) in pilots from prefrontal fNIRS. They show that binary classification of workload level was feasible both in flight simulator as well as in a real aircraft. Additionally, the authors investigated the differences between these two conditions and revealed that workload tends to be higher in the real aircraft. Workload classification performances were similar in both conditions though. Grissmann et al. investigated in a study how the simultaneous stimulation of multiple cognitive and affective states, namely workload and valence, influenced the respective neural markers in the EEG signal. They found that while correlates of workload (in frontal theta and parietal alpha power) could still be detected, these correlates were affected by negative valence. Moreover they found no evidence of typical neural correlates of affect. This has implications for many real-world applications for which different concurrent mental states can be expected. In a study with 17 healthy volunteers, Aghajani et al. investigated mental workload classification using EEG and fNIRS. Workload was induced using a letter n-back task and could be classified using Support Vector machines in EEG, fNIRS, and the combination of both, which outperformed the individual modalities.
The impact of a multimodal approach for the classification of working memory load was also investigated by Liu et al.. These authors presented a study in which they discriminated three levels of mental workload, induced by an n-back task, using EEG, fNIRS, and physiological measures, with data combined over modalities performing best. They could show that all three modalities yielded better than chance level classification. To reduce calibration time for each subject, they demonstrated how results could be improved by learning from other subjects.
Extending offline analysis to a closed-loop system, Walter et al. demonstrate in their study that cognitive workload can be estimated from EEG to automatically adapt the difficulty level in a learning environment. The prediction model was trained on 10 subjects beforehand and no user-specific calibration data was needed, highlighting the feasibility in realistic scenarios. Labonte-Lemoyne et al. investigate a closed-loop system which adapted a game of Tetris to the cognitive load of the player, as measured from EEG and facial expression, with dynamic thresholding. In their user study, they show that adaptation to measured cognitive user states is not a trivial task and neither perceived experience, nor objective performance could be improved compared to the control condition.
2.2. Memory processes
Noh et al. used ERP features from the EEG signal to discriminate between successful and unsuccessful retrieval of memorized items. In their study, they show that the presence or absence of contextual information interacts with the type of information to be encoded. Tian et al. conducted an EEG study to investigate neural markers for the discrimination of low and high performers in a repeated memory task. They found that fronto-central spectral entropy during the retention period of working memory was predictive for intra-session and inter-session differences in performance. Toppi et al. use graph-theoretic connectivity features to discriminate different phases of working memory processing in 60-channel EEG data from a study in which 17 participants performed a Sternberg task. The authors could identify topological differences between the encoding, storage, and retrieval phases and also found a relationship between the neural markers and the participants' behavior.
2.3. Attention
Riccio et al. worked with a P3-BCI and study how attentional processing differs between patients suffering from amyotrophic lateral sclerosis (ALS) and healthy users and whether and how this corresponds to differences in P3-BCI performance. The corresponding study investigates attentional processing via a rapid serial visual presentation task. Results revealed that ALS patients have degraded attentional performances which translates in smaller P3 amplitude and lower P3-BCI classification performances. Using EEG, Baldwin et al. investigated mind wandering during simulated driving. In their study, they found increased alpha activity and reduced magnitude of P3a components to auditory stimuli in periods of mind wandering. As mind wandering during driving is a potential safety threat, this study can have large impact in future adaptive interfaces. Brouwer et al. presented a study combining EEG and eye tracking to investigate target encoding during visual search. They could show that eye tracking features distinguished better between hits and misses (i.e. targets which are never reported by the participant), while EEG features differentiated better between targets and non-targets. These results highlight how the two modalities complement each other.
2.4. Methods & other cognitive processes
Verdière et al. measure engagement of pilots via fNIRS in a flight simulator. In their study, different levels of engagement are induced via manual vs. automatic landing. The authors propose connectivity-based features and show that these can be used for effective classification. Takayoshi et al. used rewards in a simple number-discrimination task to study motivation and apathy tendency. In their study, they found P2 and P3 ERPs in EEG to be modulated by reward dependence and apathy tendency, which is an important step toward an adaptive interface based on motivation. Touryan et al. used a simultaneous visual search tasks and auditory n-back to investigate the neural sources of target detection in the presence of eye movements and during simultaneous task-demand. In their study, the authors used a modified measure-projection approach to separate the neural sources into six regions of interest (ROIs). This enabled efficient target detection from EEG signals, as well as revealing the time course of EEG activity in these ROIs—thus leading to a better understanding of the underlying processes. Nicolae et al. investigate a relatively novel concept by studying depth of processing (on three levels: no processing, shallow processing, or deep processing) from event-related time-domain and frequency domain features in EEG on a single-trial basis. In their study, they demonstrate the feasibility of this approach in three domains (namely: memory, language, and visual imagination).
3. Summary
The presented articles tackle a large variety of cognitive processes, their neurophysiological correlates and their detection. This research shows great progress in basic understanding of cognition as well as for applied research, especially for the development of adaptive systems. They also highlighted the need for more research into realistic and ecologically valid application environments of cognitive state estimation. If such states could be reliably estimated in real application scenarios–outside the lab—which is not really the case so far—then this could prove very useful for multiple applications such as aeronautics, driving, education or video games, among many others. Future directions of research should also include thorough attempts to define and differentiate the modeled concepts (e.g., how can we discriminate “mental workload” from “memory load”) and find a consensus on publicly available data sets for joint evaluations. A role model here could be the field of affective computing. Available common data sets such as DEAP (Koelstra et al., 2012), fNIRS-nback (Herff et al., 2014), EEG (So et al., 2017), or EEG combined with fNIRS (Shin et al., 2017) could be a start for such a program.
Author contributions
FP coordinated the writing process and proposed the structure of the article. All authors contributed to the writing process.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We thank all reviewers involved in the compilation of this exciting research topic. FL received research support from the European Research Council with project BrainConquest (grant ERC-2016-STG-714567).
References
- Fairclough S. H. (2008). Fundamentals of physiological computing. Interact. Comput. 21, 133–145. 10.1016/j.intcom.2008.10.011 [DOI] [Google Scholar]
- Fairclough S. H., Venables L., Tattersall A. (2005). The influence of task demand and learning on the psychophysiological response. Int. J. Psychophysiol. 56, 171–184. 10.1016/j.ijpsycho.2004.11.003 [DOI] [PubMed] [Google Scholar]
- Gerjets P., Walter C., Rosenstiel W., Bogdan M., Zander T. O. (2014). Cognitive state monitoring and the design of adaptive instruction in digital environments: lessons learned from cognitive workload assessment using a passive brain-computer interface approach. Front. Neurosci. 8:385. 10.3389/fnins.2014.00385 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heger D., Herff C., Putze F., Mutter R., Schultz T. (2014). Continuous affective states recognition using functional near infrared spectroscopy. Brain Computer Interfaces 1, 113–125. 10.1080/2326263X.2014.912884 [DOI] [Google Scholar]
- Herff C., Fortmann O., Tse C.-Y., Cheng X., Putze F., Heger D., et al. (2015). Hybrid fNIRS-EEG based discrimination of 5 levels of memory load, in 2015 7th International IEEE/EMBS Conference on Neural Engineering (NER) (Montpellier: IEEE; ), 5–8. [Google Scholar]
- Herff C., Heger D., Fortmann O., Hennrich J., Putze F., Schultz T. (2014). Mental workload during n-back task quantified in the prefrontal cortex using fnirs. Front. Hum. Neurosci. 7:935. 10.3389/fnhum.2013.00935 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koelstra S., Mühl C., Soleymani M., Lee J.-S., Yazdani A., Ebrahimi T., et al. (2012). Deap: A database for emotion analysis; using physiological signals. IEEE Trans. Affect. Comput. 3, 18–31. 10.1109/T-AFFC.2011.15 [DOI] [Google Scholar]
- Koelstra S., Yazdani A., Soleymani M., Mühl C., Lee J.-S., Nijholt A., et al. (2010). Single trial classification of EEG and peripheral physiological signals for recognition of emotions induced by music videos, in International Conference on Brain Informatics (Toronto, ON: Springer; ), 89–100. [Google Scholar]
- Kothe C. A., Makeig S. (2011). Estimation of task workload from EEG data: new and current tools and perspectives, in Engineering in Medicine and Biology Society, EMBC, 2011 Annual International Conference of the IEEE (Boston, MA: IEEE; ), 6547–6551. [DOI] [PubMed] [Google Scholar]
- Lotte F., Bougrain L., Cichocki A., Clerc M., Congedo M., Rakotomamonjy A., et al. (2018). A review of classification algorithms for EEG-based brain–computer interfaces: a 10 year update. J. Neural Eng. 15:031005. 10.1088/1741-2552/aab2f2 [DOI] [PubMed] [Google Scholar]
- Mühl C., Jeunet C., Lotte F. (2014). EEG-based workload estimation across affective contexts. Front. Neurosci. 8:114. 10.3389/fnins.2014.00114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Putze F., Hesslinger S., Tse C.-Y., Huang Y., Herff C., Guan C., et al. (2014). Hybrid fNIRS-EEG based classification of auditory and visual perception processes. Front. Neurosci. 8:373. 10.3389/fnins.2014.00373 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Putze F., Hild J., Kärgel R., Herff C., Redmann A., Beyerer J., et al. (2013). Locating user attention using eye tracking and EEG for spatio-temporal event selection, in Proceedings of the 2013 International Conference on Intelligent User Interfaces (Santa Monica, CA: ACM; ), 129–136. [Google Scholar]
- Shin J., von Lühmann A., Blankertz B., Kim D.-W., Jeong J., Hwang H. -J., et al. (2017). Open access dataset for EEG+NIRS single-trial classification. IEEE Trans. Neural Syst. Rehabil. Eng. 25, 1735–1745. 10.1109/TNSRE.2016.2628057 [DOI] [PubMed] [Google Scholar]
- So W. K., Wong S. W., Mak J. N., Chan R. H. (2017). An evaluation of mental workload with frontal EEG. PLoS ONE 12:e0174949. 10.1371/journal.pone.0174949 [DOI] [PMC free article] [PubMed] [Google Scholar]