. 2025 Nov 21;15:41422. doi: 10.1038/s41598-025-25363-z

Table 1.

Summary of existing studies comprising methods, datasets, and key findings.

Reference Number	Objective	Method	Dataset	Measures
Islam et al.¹¹	To explore how ML and sensor technologies enhance emotion perception and activity recognition.	Multimodal Sensor Data, DL, Temporal–Spatial Behaviour Modelling, Cognitive and Affective Computing	AAL or Smart Home Sensor Dataset	Quality of Life, Emotion Recognition Accuracy, Activity Detection
Romero and Armenta¹²	To develop a real-time model for detecting and classifying seven facial emotions in children.	Camera input, CNN on Raspberry Pi 3b+, Trained CNN model	FER‑2013	Facial Emotion Identification Rate
Asha et al.¹³	To develop a modular, voice-controlled AI assistant for seamless human–machine interaction.	NLP, Voice Commands	Pre-Trained Models	Real-Time Performance, Interaction Accuracy
Pavithra et al.¹⁴	To develop a DL-based SER system for accurate and reliable emotion detection from speech.	Audio Feature Extraction, DL, RNN	Labelled Emotional Speech Samples	Accuracy, Reliability
Brilli et al.¹⁵	To develop AIris, an AI-powered wearable device for visually impaired users.	Object Recognition, Scene Interpretation, NLP, Real-Time Auditory Feedback	Real-World Visual Data	Accuracy, Usability
Bertacchini et al.¹⁶	To explore the use of a Pepper robot integrated with ChatGPT.	Pepper–ChatGPT Integration, Simulated Interaction Scenarios, Social Robotics Dialogue	Simulated ASD Interaction Scenarios	Feasibility, Acceptability, Effectiveness
Reddy et al.¹⁷	To develop an assistive system to support individuals with paralyzed hands.	HGR, Real-time Voice Output, Sensor-based Input Processing	Custom Gesture Dataset	Accuracy, Responsiveness
Begum et al.¹⁸	To develop an end-to-end system to aid communication for hearing-impaired individuals.	Quantized YOLOv4-Tiny Detection, Character to Text Generation, LSTM-based Text Model	BdSL 49 Dataset	mAP, Accuracy
Kandula et al.¹⁹	To bridge communication gaps and memorize complex sign systems.	Webcam hand gesture capture, Gesture recording pipeline, Model training and testing	Custom Webcam Gesture Data	Accuracy
Di Luzio, Rosato, and Panella²⁰	To enhance emotion classification via video by optimizing facial landmark inputs.	Facial Landmarks Detection, Binary DNN, Improved Integrated Gradient	Facial Video Dataset	Higher Accuracy, Reduced Cost
Slade et al.²¹	To enhance SER accuracy and robustness.	AST with CSO, Optimizable 1D‑CNN, BiLSTM, CNN‑BiLSTM with Attention, NTKM	EMO‑DB, SAVEE, TESS	Accuracy, mAP
Neeraja et al.²²	To develop an accurate and real-time system for detecting driver somnolence, thereby improving road safety.	CV, Physiological Signal Monitoring, ML	Driver Drowsiness Datasets	Accuracy, Real-Time Performance
Ali and Hughes²³	To develop a model for emotion recognition using self-supervised pretraining techniques.	UBVMT, Self-supervised pretraining, masked autoencoding, contrastive modelling, transformer architecture	CMU-MOSEI, Public Biosensor Datasets	Accuracy, Memory Efficiency
Paul et al.²⁴	To develop a real-time attendance system for enhanced accuracy and practical deployment.	ResNet-50 Face Recognition, ViT Emotion Detection, Dual-Path Architecture, Web Integration	Custom Real-Time Dataset	Accuracy, AUC-ROC Score
Choi, Zhang, and Watkins²⁵	To enhance audio classification for improved performance.	SSAST, Multi-Layer Feature Fusion, Patch-Wise Pooling, Self-Supervised Learning, Dual Representation	CREMA-D, TESS, RAVDESS, Speech Emotion Classification, Isolated Urban Events, CornellBirdCall	Accuracy Rates
Wang and Chai²⁶	To optimize personalized learning paths and learning efficiency.	LSTM Behaviour Capture, Transformer Self-Attention, DL Integration	Learner Behaviour Sequences	Knowledge Mastery, Learning Time, Satisfaction
Ramani et al.²⁷	To accurately detect human emotions without manual feature engineering.	Deep Bidirectional LSTM, Multimodal Sensor Fusion, Iterative DL	On-Body, Ambient, Geographical Sensors	Accuracy, Effectiveness
Prithi and Tamizharasi²⁸	To enhance CRM for accurate emotion analysis.	FFDMLC, Feature Fusion, DL, COOT	CK+, FER2013	Recognition Rate, Accuracy
Selvaraju et al.²⁹	To develop a real-time ISL gesture-to-text subtitle system for individuals with hearing and speaking impairments.	CNN, YOLOv5, HMM, WebRTC	ISL Gesture Video Dataset	Accuracy, Latency, Usability
Ghadami, Taheri, and Meghdari³⁰	To develop a transformer-based DL system for improved communication and learning.	Early And Late Fusion Transformers, GS, Keypoint Feature Extraction, Multi-Task Learning	101 Iranian Sign Language Words	Accuracy, Real-Time Feedback
Khanum et al.³¹	To develop an IoT-based wearable device to enhance women’s safety, including offline functionality for evidence preservation.	IoT Wearable Device, Real-Time Audio Tracking, Location Tracking, Emergency Alert System	Not Specified/Real-Time User Data	Response Time, Alert Accuracy
Siju and Selvam³²	To develop a system for accurate and efficient SLR compatible with edge devices.	Google Mediapipe Landmarks, DNN, Tensorflow Training, Live Webcam Testing	Hand Gesture Images (Peace, Okay, Stop)	Accuracy, Latency
Naik et al.³³	To develop a robust multimodal real-time emotion recognition system.	Text: BERT + TF-IDF, Audio: CNN + Augmentation, Video: CNN + OpenCV	Four Kaggle Datasets (audio, video, text)	Audio and Video Accuracy of 99.44% and 97.66%, Audio and Video Validation of 94.71% and 65.38%
Liu et al.³⁴	To enhance sentiment analysis accuracy for short texts.	TF-IDF, CSO, SVM, AdaBoost Soft Voting	Six Real Polar Sentiment Analysis Datasets	Accuracy Improvement: >4.5%
Filahi et al.³⁵	To improve e-commerce decision-making using IoT data and ML models.	LR, NB, SVM, RF, AdaBoosting, GRU, LSTM	Customer Behaviour and Preference Data Collected Via IoT Devices	Accuracy of 88%, F1-Score of 0.927, Precision of 0.908, Recall-0 of 0.569 and Recall-1 of 0.947
Sandulescu et al.³⁶	To develop an AI-driven healthcare platform.	IoMT Sensor Integration, AI Predictive Models, Emotion Detection Algorithm	Patient Sensor Data and Voice Recordings	Early Symptom Detection, Disease Progression Tracking
Muhammad et al.³⁷	To enhance emotion detection accuracy on imbalanced and limited data.	DeBERTa-v3-large + CNN, Electra + CNN, XLNet-base, RoBERTa + CNN, T5-base, Synonym Replacement Augmentation	ISEAR	Accuracy (best) of 94.94%, Accuracy (others) of 93%/69%, Improved Precision & Recall
Thiab, Alawneh, and Mohammad³⁸	To evaluate and enhance emotion classification in textual conversations.	RNN and Transformer-based Models, Ensemble via Majority Voting	SemEval 2019 Task 3 (EmotionX)	Transformer F1-Score of 75.55%, RNN F1-Score of 67.03%, Ensemble F1-Score of 77.07%
Kumar, Khan, and Choi³⁹	To develop an accurate mental health detection model from social media text.	RoBERTa + Adapter Layers, BiLSTM, AM	Filtered GoEmotions Dataset	Accuracy (binary): 92%, Accuracy (multiclass): 88%
Geethanjali and Valarmathi⁴⁰	To improve multimodal sentiment analysis during the COVID-19 pandemic.	CNN, LSTM, IChOA, Feature Fusion Strategy	GeoCoV19 Dataset	Accuracy of 97.8%
Arbaizar et al.⁴¹	To enable real-time, objective monitoring and prediction of psychiatric patients’ emotional states.	HMM, Transformer DNN, Time-Series Forecasting, Classification Algorithms	Passive and Self-Reported Data from the Evidence-Based Behaviour (eB2) App	Emotional state accuracy: 0.93, ROC AUC (valence): 0.98, ROC AUC (1-day prediction): 0.87, Accuracy (suicidal ideation): 0.9, ROC AUC (suicidal ideation): 0.77
Kohneh Shahri, Afshar Kazemi, and Pourebrahimi⁴²	To develop and evaluate a comprehensive sentiment analysis model for improved accuracy and speed.	Image, Sound and text processing, CV, NLP	Social Network Multimedia Data	Accuracy (High), Speed (Fast)