Skip to main content
. 2024 Oct 18;10(20):e39592. doi: 10.1016/j.heliyon.2024.e39592

Table 1.

Image based technique.

Ref Year Experimental
Setup
Number of Subjects Driving Duration Type of Sensor Physical
Measurand
Detection Algorithm Problem Solved Limitations
[24] 2024 Public data set Web Camera Fatigue CNN The Cyber-Physical Systems are used to facilitate the real-time monitoring and analysis of the driving situation It has a limited data set that will affect the model generality. The model must be validated with real-world challenges, such as lighting conditions.
[25] 2024 Driving simulator Multiple drivers Infrared cameras Fatigue Transfer learning with YOLOv8 The study proposed a fatigue-driving detection model that blends transfer learning methods with the YOLOv8n architecture. The accuracy of fatigue identification can be impacted by individual variance because the model used in this work depends on factors like yawning and eye closure, which may not be relevant to all drivers.
[26] 2024 37 Subjects+
Public data set
Web Camera Drowsiness Deep Residual
Networks (ResNet)
This study developed a deep learning architecture integrating residual and feature pyramid networks (FPN) to identify driver drowsiness. Image quality, where the system's effectiveness relies on the quality of the input images. Additionally, the sensitivity of the camera position is essential.
[27] 2024 Public data set Web Camera Drowsiness CNN and LSTM The study combined IoT technology DL to develop an effective and unobtrusive drowsiness detection system. This integration allows real-time monitoring and notifications, which are critical for avoiding accidents. Standard binarization algorithms struggle with dark skin tones, and limited variability in training data can affect their effectiveness. Testing conditions restricting rapid head movements may not accurately reflect real-world driving situations, impacting drowsiness detection.
[28] 2024 YAWDD dataset Web Camera Drowsiness Deep Transfer
Learning
The proposed method is intended to reduce false detections by employing ensemble learning and deep transfer learning models to ensure that only genuine drowsiness states are detected. The accuracy of the DDD system depends on high-quality image processing. Factors like wearing sunglasses, changes in lighting, and camera-to-driver distance affect detection performance.
[29] 2024 Driving simulator Public dataset Web Camera Drowsiness lightweight convolutional
neural
networks
The suggested method employed lightweight CNN to ensure the system performed seamlessly on embedded devices such as Jetson Nano. This is essential for real-time applications. The dataset has diverse photos; however, black-and-white images and varied lighting in the YALEB dataset can be challenging for lightweight models. The suggested network is lightweight but slightly more computationally complex than other networks.
[30] 2024 Driving simulator 37 subjects and two more
online datasets
Web Camera Drowsiness two-branch
multi-head
attention architecture (TB-MHA)
The research presented a method for improving discrimination between various classes of samples (drowsy vs. awake) using a spatial filtering technique based on the Common Spatial Pattern (CSP) algorithm. The technique detected fatigue mainly using facial features like landmarks and local areas. Still, its performance may be limited if facial visibility is compromised, such as when a driver wears sunglasses or has a partially concealed face.
[31] 2023 Actual driving 25 Subjects RGB Camera Drowsiness Embedded system, edge computing, cloud computing modules The study described an IoT-based automated approach for detecting Driver drowsiness This framework combined several components, such as an embedded platform, edge computing, and cloud computing, to give a solution for drowsiness detection and monitoring. Reflections from drivers' spectacles can cause misclassifications and Reduced detection accuracy. Changing environmental conditions can also impact the system's performance, and there are potential data privacy risks with the cloud-hosted database for real-time monitoring.
[32] 2023 Driving simulator Web Camera Drowsiness CNN model The study proposed a method to detect driver fatigue by analyzing eye and mouth movements using a camera and a Convolutional Neural Network (CNN). The current methods may fail at night or in poor conditions. Additionally, the driver's head posture can affect detection accuracy. The proposed model needs more real-world testing to assess the system's efficacy.
[33] 2023 Simulation Multiple public data sets Web Camera Drowsiness CNN model This study examined the difficulty of achieving high precision on low-cost embedded devices like the Nvidia Jetson Nano. This makes the technology more practical for car use, improving safety. The suggested method mainly detected signs of fatigue via the eyes and face. It did not look at other signs of fatigue, like posture and overall behavior, which could give a complete picture of a driver's condition.
[34] 2023 Actual driving 30 subjects Web Camera Yawning detection,
fatigue detection
3D deep learning network and Bi-directional long short-term memory (LSTM) This research addresses a few critical issues in yawning detection, particularly driver weariness, such as head posture variability, redundant frames, and lighting differences. The detection accuracy depends on video data quality, including resolution, frame rate, and camera angle. It may also disregard minor yawning gestures between frames, resulting in false negatives.
[35] 2023 Driving simulator 1 h Event camera Yawning detection lightweight deep
learning models
It introduced event cameras, a neuromorphic sensing technology, for monitoring and verifying seatbelt fastening and unfastening. This approach used the unique capabilities of event cameras to give real-time analysis of dynamic behaviors that regular RGB or NIR cameras may not adequately capture. The deployment of the proposed models within the limits of embedded hardware commonly seen in DMS is recognized as a difficulty. This constraint could impact the practical application of the system in actual automobiles, where computer resources may be limited.
[36] 2023 Simulator Web Camera Drowsiness
And Fatigue
Threshold and DL based The critical issue is the requirement for real-time monitoring of driver drowsiness. The suggested method provides instant feedback based on the driver's eye state, which is critical for prompt interventions to prevent accidents caused by drowsiness. Drivers wearing sunglasses impair the model's effectiveness by preventing visual landmark identification and eye blink measurement, which are crucial for detecting fatigue. Facial landmark detection might also mislead the tracking driver if other objects block the face.
[37] 2022 Simulator Multiple public data sets Web Camera Drowsiness Various CNN models The study introduced a real-time driver disturbance monitoring approach based on Convolutional Neural Networks (CNN). This technology is designed to assess driver drowsiness and fatigue, two crucial elements in road safety. Implementing the proposed system in real-time applications may provide hurdles, notably in processing speed and computational requirements. The demand for high-performance hardware may limit technology accessibility in regular vehicles.
[38] 2022 Simulator 5 subjects Camera Fatigue Machine learning and Resnet-50 models This research aimed to develop a system for detecting driver fatigue by evaluating changes in facial features, particularly the eyes and mouth, in real-time. The results showed that although some classifiers worked well, others did not. This variability indicates the system's performance may be inconsistent across different subjects or conditions.
[39] 2022 Actual driving 13 Subjects 1.5 h Web Camera Sleepiness generic deep feature
extraction module
The authors proposed personalized methods for identifying fatigue in drivers. They demonstrated the necessity of personalizing, as different people show signs of fatigue. The authors acknowledged the limitations of their dataset, which was utilized to design and test the sleepiness detection system. They proposed that future research include independent datasets to validate these findings further.
[40] 2022 public
data set
Web Camera Drowsiness two-stream spatial–
temporal graph convolutional
network (2s-STGCN)
This paper introduces the twin-stream spatial-temporal graph convolutional network (2s-STGCN). Many current driver drowsiness detection technologies face difficulties differentiating between various driving states, such as chatting, yawning, and blinking. This strategy sets out to rectify that. The performance of the 2s-STGCN may deteriorate in challenging driving scenarios. Variations in lighting, shadows, and occlusions can all substantially impact the feature extraction process, potentially leading to misclassifications of driver states.
[41] 2022 public
data set
Web Camera Fatigue multigranularity
Deep Convolutional
Model
The research proposed the Multi-Granularity Network (MEN), which used cues from partial face regions (such as the eyes, mouth, and glabella) to improve feature representation. This method addressed pose variability and enhanced the resilience of the feature extraction process. The research acknowledged that the RF-DCM model cannot capture temporal information adequately. This limitation may impair the model's capacity to assess long-term relationships in time series data, critical for effectively diagnosing fatigue states over lengthy durations.
[42] 2022 public
data set
Web Camera Fatigue (CNN) This paper presented a systematic three-phase detection approach incorporating facial feature extraction, the Viola-Jones algorithm for identifying face traits, skin segmentation to ensure lighting invariance, and template matching for yawning and eye tracking. This study revealed that drivers who wear glasses exhibit less precision in identifying facial characteristics and eye movements. The system's precision was inferior under low light conditions compared to daylight.
[43] 2021 synthetic event-based
dataset
Neuromorphic vision
sensors
Fatigue and drowsiness
detection
(CNN) The authors aimed to improve eye blink identification and analysis by developing a synthetic event-based dataset with accurate bounding box annotations, taking advantage of event cameras' high temporal resolution to improve overall driver safety and monitoring capabilities. One crucial issue is the limited availability of event-based data, making it difficult to apply machine learning algorithms to event cameras successfully. The lack of data makes conducting thorough and statistically meaningful testing of the proposed approaches difficult.
[44] 2021 Simulator 8 Subjects Web Camera Fatigue Detection fuzzy neural network This study used an enhanced face identification approach to identify the driver's face in images collected by a CCD camera. It then used an ensemble of regression trees to identify face feature points, specifically the eyes and mouth. The variation in recognition rates depending on head movement is one major limitation. Another significant issue with real-time face recognition systems is how varying lighting conditions or partially hidden facial features affect the system's effectiveness.
[45] 2021 Multiple Public data sets Web Camera Fatigue CNN In order to improve real-time performance and detection accuracy on edge computing devices, this research suggested a driver fatigue detection system that uses an optimized face alignment algorithm and a convolutional neural network. The system's facial alignment performance may degrade under extreme conditions, such as when the driver is wearing sunglasses or in low-light situations. This can diminish detection accuracy, which is critical for real-world applications.
[46] 2021 Actual driving 10 Subjects 1 h CCD Camera Fatigue Lightweight neural network model In the context of vehicle-mounted embedded sensors with constrained memory and processing power, this paper attempted to address the problem of real-time driver fatigue detection using deep learning techniques on face video data. Although the study claims to reach a detection speed of 27 FPS, more is needed in some real-time applications, particularly high-speed driving scenarios. Variations in the processing power of different vehicle-mounted embedded devices may alter the model's performance.
[47] 2021 Simulator Public
Data Set
HD camera Yawning and Fatigue
Detection
3D convolutional and BiLSTM networks This paper presents a novel keyframe selection technique that reduces computational expenses and eliminates unnecessary frames from frame sequences. Achieving rapid detection of the most relevant frames is crucial for real-time processing. The effect of low image quality on detection accuracy is one significant limitation. Additionally, significant camera vibrations might result in missing or false detections since face features are not always captured accurately.
[48] 2020 Simulator Public data set Web Camera Drowsiness A depth wise separable
3D convolutions
The study described a real-time method for detecting driver drowsiness using mobile platforms. The research emphasizes the usefulness of depth-wise separable 3D convolutions, which enable spatial and temporal data integration. The method demanded significant processing capacity, notably for inference of 10-frame sequences. This high demand may limit the viability of real-time applications on lower-powered mobile devices. Because of these computational limits, the report suggests that other ways may be faster right now.
[49] 2020 Simulator Public data set Web Camera Fatigue Multi-task
CNN model
The study suggested a Multi-task ConNN model using eye and mouth characteristics to measure driver fatigue. The model used the percentage of eye closure (PERCLOS) and the frequency of yawning as essential markers to determine the fatigue level of drivers. The study used a constant frequency range and a set number of frames for analysis. This rigidity may limit the model's flexibility to diverse driving situations or surroundings, possibly affecting its real-world performance.
[50] 2020 Simulator Public data set Web Camera Fatigue a deep cascaded
convolutional
neural network (DCCNN
The research developed a Real-time and Robust Detection System (R2DS), to improve the accuracy, speed, and robustness of fatigue detection. This framework contains three main modules: facial feature extraction, ocular area extraction, and fatigue detection. While the DCCNN is intended to increase detection accuracy, the study recognizes that methods based on artificial neural networks (ANN) frequently exhibit inadequate real-time performance due to their complicated structures and the requirement for considerable training data.
[51] 2019 Simulator Public data sets surveillance
digital
camera
Fatigue and Drowsiness hybrid of CNN
and (LSTM)
The research proposed the Eye Feature Vector (EFV) and Mouth Feature Vector (MFV) as evaluation parameters for determining the driver's eye and mouth states. These vectors are critical for estimating fatigue levels based on visual cues like eye closure and yawning. The model detected fatigue primarily through facial indicators such as eye and mouth states. This concentration may neglect other signs of fatigue, such as physiological signals or behavioral patterns, which could provide a more comprehensive knowledge of a driver's status.
[52] 2019 Simulator Public data sets Web Camera Drowsiness A deep cascaded
Convolutional neural network
The research described a condition-adaptive representation learning method for detecting driver drowsiness under various driving scenarios, including varied times of day and changes in the driver's appearance. The proposed model requires a lot of labeled training data to cover various driver circumstances and scenarios appropriately. This can be a restriction when gathering such extensive data is unfeasible.
[53] 2019 Simulator Public data sets Web Camera Drowsiness convolutional
control gate based
recurrent neural network (ConvCGRNN)
The article developed a deep neural network (DNN) that can identify driver drowsiness in real time using video data. The network used CNN and ConvCGRNN, as well as a voting layer, to assess temporal relationships in facial data taken from videos. The model's performance varies depending on the scenario, such as when drivers wear sunglasses or spectacles, which can obscure face features. Furthermore, the system depended substantially on consistent and precise facial tracking.
[54] 2019 Simulator Public data sets Web Camera Drowsiness Condition adaptive representation
learning framework
based on CNN
This approach combined four major models: spatiotemporal representation learning, scene condition understanding, feature fusion, and sleepiness detection. This system is intended to handle varied driving conditions in an adaptable manner and has been validated using the NTHU drowsy driver detection video dataset. A significant limitation of this work is that the framework requires a large amount of labeled training data to handle different driving situations and scenarios adequately. Collecting such extensive data can be challenging.
[55] 2019 Simulator 10 subjects Web Camera Fatigue Multiple Convolutional
Neural Networks (CNN)-KCF (MC-KCF)
The system used a detection algorithm based on 68 critical facial locations, allowing for exact identification of facial regions crucial to drowsiness detection. This improved the system's accuracy in detecting indicators of weariness. Drivers' heights can influence the location of their faces in the camera frame, leading to detection accuracy discrepancies. This diversity can make it challenging to ensure consistent performance for a broad group of users.
[56] 2018 Simulator 9 Subjects Web Camera Fatigue and drowsiness
detection
Transfer learning classifier based on fast
wavelet transforms
and separator wavelet
networks
The first essential contribution is the development of an eyes classifier that uses two transfer learning classifier designs. The second contribution is designing a fuzzy logic decision assistance system that divides driver vigilance into five categories. Although the system is intended for real-time processing, the efficiency of the methods utilized (such as the Viola and Jones algorithm) can vary depending on the computational resources available. In resource-constrained contexts, this may impact system performance and dependability.