Ensemble based high performance deep learning models for fake news detection

Mohammed EAlmandouh; Mohammed F Alrahmawy; Mohamed Eisa; Mohamed Elhoseny; A S Tolba

doi:10.1038/s41598-024-76286-0

. 2024 Nov 4;14:26591. doi: 10.1038/s41598-024-76286-0

Ensemble based high performance deep learning models for fake news detection

Mohammed EAlmandouh ^1,^2,^✉, Mohammed F Alrahmawy ^2,^3,⁴, Mohamed Eisa ¹, Mohamed Elhoseny ^2,⁵, A S Tolba ^2,⁶

PMCID: PMC11535404 PMID: 39496680

Abstract

Social media has emerged as a dominant platform where individuals freely share opinions and communicate globally. Its role in disseminating news worldwide is significant due to its easy accessibility. However, the increase in the use of these platforms presents severe risks for potentially misleading people. Our research aims to investigate different techniques within machine learning, deep learning, and ensemble learning frameworks in Arabic fake news detection. We integrated FastText word embeddings with various machine learning and deep learning methods. We then leveraged advanced transformer-based models, including BERT, XLNet, and RoBERTa, optimizing their performance through careful hyperparameter tuning. The research methodology involves utilizing two Arabic news article datasets, AFND and ARABICFAKETWEETS datasets, categorized into fake and real subsets and applying comprehensive preprocessing techniques to the text data. Four hybrid deep learning models are presented: CNN-LSTM, RNN-CNN, RNN-LSTM, and Bi-GRU-Bi-LSTM. The Bi-GRU-Bi-LSTM model demonstrated superior performance regarding the F1 score, accuracy, and loss metrics. The precision, recall, F1 score, and accuracy of the hybrid Bi-GRU-Bi-LSTM model on the AFND Dataset are 0.97, 0.97, 0.98, and 0.98, and on the ARABICFAKETWEETS dataset are 0.98, 0.98, 0.99, and 0.99 respectively. The study’s primary conclusion is that when spotting fake news in Arabic, the Bi-GRU-Bi-LSTM model outperforms other models by a significant margin. It significantly aids the global fight against false information by setting the stage for future research to expand fake news detection to multiple languages.

Keywords: Ensemble Learning, Deep Learning, Fake News Detection, FastText

Subject terms: Mathematics and computing, Computer science

Introduction

Digital platforms such as social media, online forums, and websites have overtaken traditional media as the primary sources of information ¹. This shift signifies a substantial transformation in how we seek out and interact with information ². Social media’s appeal lies in its unrestricted expression and immediate access to information, making it particularly popular among younger demographic groups. However, the ease of engagement and sharing on these platforms has also led to the rapid spread of misinformation, including fake news, online. The harmful effects of internet-based fake news extend beyond simply misleading audiences ³.

Intentionally fabricated and demonstrably false information, commonly known as fake news, presents a severe risk to democratic systems’ credibility. It undermines public trust in governmental institutions and significantly affects various societal sectors, such as elections, economic conditions, and public perceptions of critical issues such as wars ^4,5.

The aim of this research stems from the urgent need to fight the increasing popularity of fake news, which is becoming increasingly common and significant on social media. Since fake news is so common, developing efficient detection systems is essential to maintaining the trust of online information.

Existing research indicates that fake news and authentic news exhibit notable superficial differences⁶. Fake news frequently leverages heightened emotional appeals and subjectivity⁷ , often incorporating phrases like “urgent notice” or “sharing quickly” to create a sense of urgency⁸. Additionally, images linked to fake news are usually of lower quality but are designed to be visually striking ⁹. On the other hand, genuine news tends to be objective, more detailed, and more accompanied by higher-quality visual imagery. Current multimodal approaches ^10–12, which typically apply convolutional and recurrent neural networks (RNNs and CNNs), analyze these superficial characteristics by examining textual and visual components. Reports indicate that fake news attracts more attention on social media than factual news, a trend observable across major platforms ^13,14. The high prevalence of fake news on social media presents a substantial challenge to the credibility of online information compared to other forms of misinformation. This pervasive issue underscores the urgent need to develop effective strategies to combat fake news. As data volumes grow, the rapid and efficient retrieval of relevant information becomes increasingly important. This highlights the critical role of computational linguistic techniques ¹⁵. Machine learning and deep learning methods are particularly crucial in this context, providing advanced tools to detect and counter misinformation effectively.

Machine learning and deep learning methods are particularly crucial in this context, providing advanced tools to detect and counter misinformation effectively. There is currently much interest in fake news identification because machine learning (ML), deep learning (DL), and natural language processing (NLP) have advanced recently, leading to the development of numerous innovative research methods ^16,17.

E. Hashmi et al. raised doubts about the accuracy of the information that voters are exposed to during critical political events in their paper "Advancing Fake News Detection: Hybrid Deep Learning"¹⁸. They emphasized that during those times, more than 19 million bot accounts were made to disseminate erroneous information about Trump and Clinton, significantly increasing the amount of misinformation the public was exposed to and its impact ^19,20. Furthermore, reports indicate that fake news frequently receives greater attention on social media platforms than true news, a pattern visible on several well-known sites ^21,22. The reliability of online information is more seriously threatened by the widespread spread of fake news on social media than by other types of disinformation. Since fake news is a prevalent problem, creating efficient counterstrategies is becoming increasingly important.

The rapid and efficient retrieval of pertinent information is becoming increasingly important as the quantity of data continues to increase. This emphasizes how important it is to use computational linguistic approaches ²³. Machine learning and deep learning techniques are essential because they provide advanced tools for effectively identifying and refuting misinformation.

Fake news identification has garnered increased attention due to recent developments in machine learning, deep learning, and natural language processing (NLP). These advances have given rise to many creative study methodologies ²⁴. Because so much content is available online on a wide range of topics, this work becomes more complex, so researchers focus on creating automated techniques to detect fake news. Therefore, the integrity of internet information depends on this technological advancement ²⁵. The identification of fake news is a significant technological obstacle due to multiple factors, necessitating sophisticated approaches to guarantee the authenticity and dependability of the information shared on the internet.

This paper employs ML and DL techniques, including cutting-edge transformer-based models, to improve fake news detection. We conduct a comprehensive and detailed analysis by incorporating fast text word embeddings for efficient text data processing and applying these methods to available datasets. This approach is vital for accurately identifying misinformation in online media.

The contributions of this paper are as follows:

It introduces a multilayer preprocessing framework that utilizes two sets of text datasets, one consisting of fake news and the other composed of real news.
The data preprocessing phase incorporates NLP techniques to prepare the data for use in word embedding.
We integrated both supervised and unsupervised FastText embeddings into machine learning (ML) models, such as decision tree (D.T), support vector machine (SVM), random forest (R.F), logistic regression (L.R), and extreme gradient boosting (XGBoost) bagging classifiers (CATBoost). We also devised a method to handle out-of-vocabulary words (OOV) using FastText embeddings so that our models may process phrases that have never been seen before, ensuring thorough coverage of text data. Furthermore, we diligently pursued optimization, optimizing regularization strategies and hyperparameters throughout our machine-learning models. With careful consideration, we hoped to maximize model performance, avoid overfitting, and generate reliable, broadly applicable findings.
In addition, we used FastText embeddings in DL-based models such as long short-term memory (LSTM), gated recurrent unit (GRU), and convolutional neural network (CNN) to efficiently capture intricate contextual information and sequential dependencies inside the text data. Additionally, this work employed the most recent transformer-based models for text categorization, such as the autoregressive transformer XLNET with hyperparameter tweaking, Robustly Optimized BERT (RoBERTa), and Bidirectional Encoder Representation from Transformers (BERT). We used these transformers because they have a track record of capturing complex contextual data and long-range correlations in text data, making them ideal for detecting fake news.
To enhance the interpretability of our results, particularly after observing the best performance of the hybrid model (Bi-LSTM–Bi-GRU), which integrates multiple deep learning models, achieved higher accuracy in classification than the other models.

To achieve our goals for this research, we used several machine learning and deep learning techniques, including state-of-the-art transformer-based models. In addition, we integrated FastText word embeddings with these models for effective text data processing using these techniques. We evaluated these models by conducting experiments on two publicly available Arabic datasets. Then, we compared the results obtained from these models and analyzed them to find the best model for detecting fake Arabic news. This method is essential for recognizing false information in online media, ultimately distributing more dependable and trustworthy information.

The organization of this paper is as follows: Section 2 reviews related research. Section 3 explains the proposed model architecture and methodology. Section 4 presents the results and insights, compares models within categories such as machine learning, deep learning, transformer models, ensemble learning, and hybrid models, and contrasts our results with those of other studies using the same dataset. The conclusions are presented in Sect. 5.

Related work

In this section, we review the current literature on identifying fake news. Numerous studies have investigated various approaches, from transformer-based to conventional ML and DL techniques.

Han et al. ²⁶ highlighted the robust modeling capabilities of pre-trained language models, exemplified by (bidirectional encoder representations from transformers) BERT. After extensive pretraining on corpora, these models have acquired significant syntactic and commonsense knowledge. Verma et al. ²⁷ introduced word embedding over linguistic features for fake news detection (WELFake). This novel two-phase benchmark model authenticates news content by employing machine learning classification with word embedding over linguistic features. This comprehensive approach demonstrates a notable enhancement in fake news detection, with the WELFake model achieving a peak accuracy of 96.73%. This performance surpasses traditional methods such as BERT and CNN models by up to 4.25%, underscoring the effectiveness of integrating linguistic features with advanced embedding techniques. Additionally, the study contributes a novel dataset comprising approximately 72,000 articles, thereby bolstering the model’s reliability and generalizability across diverse datasets.

Shu et al. ²⁸ introduced FakeNewsNet, a repository to facilitate research on fake news detection in social media. This repository includes two detailed datasets rich in news content, social context, and spatiotemporal information, aiming to overcome the limitations of existing datasets. The comprehensive Analysis of FakeNewsNet illuminates its potential applications in detecting fake news, addressing the challenges posed by the scarcity of multifaceted fake news datasets. This initiative represents a significant stride toward improving the accuracy and effectiveness of fake news detection mechanisms.

Truică and Apostol ²⁹ introduced an innovative methodology utilizing document embeddings to develop multiple models that accurately classify news articles as trustworthy or fake. Their evaluation encompassed a range of machine learning (ML) models, including naive Bayes (N.B.), gradient boosting, deep learning (DL) models like long short-term memory (LSTM) and gated recurrent unit (GRU), as well as three transformer-based models: pre-trained BERT ²⁹, (bidirectional and autoregressive transformers) BART ³⁰, and RoBERTa ³¹.

These evaluations were conducted across five datasets containing fake news articles, employing various word embeddings such as TF-IDF, Word2Vec ³², and FastText ³³. In a study by Nanade and Kumar ³⁴, a transformer-based method utilizing the BERT base model for Twitter fake news detection achieved an accuracy of 77.29%. Verma et al. ³⁵ introduced a binary classification framework for fake news detection that combines bidirectional encoder representations from transformers (BERT) to capture global text semantics and convolutional neural networks (CNN) to leverage N-gram features for local text semantics. Their experiments were conducted on four publicly available datasets. Guo et al. ³⁶ proposed a similar approach using DL-based models and a pre-trained transformer-based BERT model for the same purpose. The results from both studies offer valuable insights into the effectiveness of these methods in fake news detection.

The study by Praseed et al. ³⁷ focused on leveraging ensemble techniques with pre-trained transformer models such as XLM-RoBERTa ³⁸, mBERT, and ELECTRA ³⁹ to combat the proliferation of fake news, specifically in Hindi. The authors’ fine-tuning process tailored these models to discern misleading information across the linguistic nuances of Hindi effectively. This approach was validated on the CONSTRAINT dataset ⁴⁰, which contains more than 8000 online posts delineated between nonhostile and hostile content, offering a nuanced understanding of the landscape of misinformation.

Wu et al. ⁴¹ introduced graph-based semantic structure mining with contrastive learning (GETRAL), a groundbreaking framework for semantic structure mining based on graphs coupled with contrastive learning. This innovation significantly enhances the identification of evidence-based fake news, outperforming existing models notably on the Snopes ⁴² and PolitiFact ⁴³ datasets. By representing claims and evidence as graph-structured data, GETRAL effectively captures intricate semantic relationships, overcoming the limitations of previous methodologies. Graph structure learning reduces information redundancy and improves representation learning via supervised contrastive learning with adversarial augmented examples. On the Snopes dataset, GETRAL achieves an F1-Macro score of 80.61% and an F1-Micro score of 85.12%. On PolitiFact, it records an F1-Macro of 69.53% and an F1-Micro of 69.81%, demonstrating its superior performance in tackling the challenges of fake news detection by integrating advanced techniques for more precise and interpretable analysis.

Soga et al. ⁴⁴ focused on detecting fake news on social media by analyzing stance similarity and employing graph neural networks (GNNs). Their approach revolves around assessing the opinion similarity between users based on their stances toward news articles and interactions within user posts. Leveraging graph transformer networks (GNNs), their method effectively extracts both global structural information and interactions of similar stances, addressing stance analysis challenges in microblogs while mitigating the impact of poorly represented stance features. Ying et al. ⁴⁵ proposed a knowledge-enhanced semantic representation model named enhanced representation from knowledge integration (ERNIE), which shares structural similarities with BERT while leveraging multilayer transformers ^46,47 as elemental encoders to model contextual information through self-attention mechanisms. Diverging from BERT, ERNIE incorporates semantic units such as words and entities, extending pretraining on word corpora abundant in knowledge to better model entity concepts and other semantic priors, enhancing the model’s semantic representation capabilities.

To lessen the effects of misinformation, especially in light of Russia’s aggression against Ukraine, Pilkevych et al. ⁴⁸ investigated the detection of fake news using GNNs. They conducted a thorough analysis. They stress the use of GNNs in online media monitoring to quickly identify and evaluate fake news, suggesting that GNNs are powerful tools for the automated identification of damaging information.

Their method uses knowledge graphs (K.G.s) to map relationships and recognize entities in textual information, focusing on identifying indicators of harmful psychological influence. Among the models tested, GraphSAGE performed the best, attaining impressive accuracy scores of 98.01% on the Gossipcop dataset and 89.78% on the Politifact dataset when trained on data exhibiting indicators of detrimental psychological impact.

This study emphasizes how important it is to use advanced machine-learning approaches to combat misinformation. It also shows how GNNs may be used to improve the precision and efficacy of fake news detection systems.

Ying et al. ⁴⁹ proposed enhanced representation from knowledge integration (ERNIE), a knowledge-enhanced semantic representation model that uses multilayer transformers ^50,51 as the fundamental encoders for modeling contextual information through self-attention mechanisms. This model has a structure similar to that of BERT.

In contrast to BERT, ERNIE hides semantic units such as words and entities and expands pretraining on knowledge-rich word corpora. This enables improved modeling of previous semantic knowledge and entity concepts, ultimately improving the model’s capacity for semantic representation. In addition to being a context encoder for producing sentence expressions, ERNIE can also function as a knowledge repository, creating sentences implicitly from a large quantity of recorded factual knowledge. As a result, the ERNIE serves as the feature extractor of textual modalities, concurrently capturing the text’s surface and semantic properties. Dahou, Abdelghani, et al. ⁵² investigated how linguistic features—particularly Named Entity Recognition (NER)— identify false news. Two models were created: a token classification model for NER features and an AraBERT Multi-task Learning (MTL) model for identifying fake news. Machine learning techniques and an embedding fusion methodology were used to integrate the embedding vectors of these models. To improve performance, RLTTAO, a feature selection algorithm, was devised. It selects pertinent features. Findings indicated that adding NER features increased detection accuracy by 1.62% on average across 5 out of 7 datasets. Dahou, Abdelghani ⁵³ used a Transformer model that has already been trained to extract characteristics from Arabic social media postings using several advanced approaches like Multi-task Learning (MTL). A customized Nutcracker Optimization Algorithm is used to enhance these properties. 87% for binary classification and 69% for multi-classification exhibit the framework’s high accuracy and beat traditional techniques. This helpful instrument gives the public trustworthy information, which also aids in the fight against fake news.

Alotaibi, Taghreed, and Hmood Al-Dossari ⁵⁴ addressed that fake news detection in Arabic is less advanced than in English. It provided an overview of Arabic fake news research, detailing feature extraction methods, machine learning, and deep learning algorithms.

The following are the present challenges in identifying fake news, based on the overview of the related work:

1. Limited Research on Arabic fake news: Although the amount of work on detecting fake news is growing, there isn’t much research focusing just on Arabic-language content ⁵⁵. This reflects a high demand for developing Arabic language solutions in this domain.

2. Emphasis on real-world applications: Although accuracy is the primary concern of numerous existing studies, research on the practical use of false news detection models in real-world scenarios is needed, taking user acceptability, scalability, and computational efficiency into account.

3. Explainable models are necessary: many current models, particularly those based on deep learning, are regarded as "black boxes." Confidence and spotting potential biases depend on knowing the rationale underlying model judgments.

4. Handling the changing of fake news: Since fake news strategies are ever-changing, it is necessary to have detection techniques that can adjust to new developments. Creating models that generalize well to many forms of fake news is crucial.

5. Variability and sophistication: Differentiating fake news from real news based only on outward appearances can be challenging because fake news frequently imitates real news in style and presentation. Misinformation strategies are becoming increasingly sophisticated and require sophisticated detection methods that can adjust to new patterns.

6. Linguistic nuances and Contextual Understanding: A thorough grasp of linguistic nuances and the capacity to perceive context are essential for successfully identifying fake news. The great range of languages and the unique cultural contexts in which news is distributed make this problematic.

7. Subjectivity and Bias: Finding biases and subjective claims in news articles is challenging without restricting free speech or adding biases to the detection process.

8. Scalability and Generalizability: Making detection algorithms scalable to handle enormous amounts of data on several platforms and generalizable to various subjects and languages is challenging. The literature currently in a publication makes clear that many researchers have addressed the issue of fake news identification using both conventional ML and DL-based algorithms, and they also emphasize the difficulties that the field is currently facing. These include the complex methods used to produce and spread false information, the speed at which false information spreads, and the challenge of detecting high levels of accuracy while preserving interpretability and generalizability.

So, our goal in this research is to expand the knowledge base in fake news detection for the Arabic language by focusing on Arabic news articles and developing language-specific models that can effectively address the unique challenges of Arabic text, such as complex linguistic structures and limited available resources. These models should be general enough to work on different Arabic datasets. These models include ML, transformer-based, and DL models. We integrate both supervised and unsupervised FastText word embeddings with these models to improve the generalizability and accuracy of fake news detection, along with various regularization strategies and hyperparameter tuning techniques.

Proposed methodology

In the proposed work, we propose different types of classifiers to identify the fake text. Then, we evaluate all these classifiers by conducting a set of experiments on two different Arabic datasets to find the best classifier to be used for detecting fake news in Arabic. With all these classifiers, the FastText library is applied to provide efficient word representations in order to enhance text classification. The proposed framework is shown in Fig. 1. As shown in Fig. 1, the proposed framework is comprised of three primary consecutive stages: the Data Collection and Preprocessing stage, the Textual Representation and Feature Extraction stage, and finally, the Modelling of Fake News Classifiers stage. The details of all these stages are explained next.

Fig. 1 — Methodology diagram for fake news detection.

A. Dataset collection and preprocessing stage

We adopted the binary classification problem in our study, where fake news is represented by 0 and real news by 1. Two publicly accessible datasets were utilized: The first dataset, AFND ⁵⁶, comprises 134 distinct Arabic online news sources, providing a dataset of 606,912 Arabic articles. The dataset is divided into "real news," with 52 sources and 207,310 articles, and "fake news,” with 51 sources and 167,233 articles. The second dataset is Arabic Pocket Sheets ⁵⁷. This dataset was created by acquiring 61,228 news sentences from an Arabic Twitter dataset (fake news) initially written in Arabic; Mendeley58 provided it. Web scraping was used to get 66,977 data instances from the well-known no-rumors (real news) website repository. We named this dataset “ARABICFAKETWEETS.” The details of both datasets are shown in Table 1.

Table 1.

Number of instances in the used datasets.

Dataset	No. of fake articles	No. real articles	Total No. of articles
AFND	167,233	207,310	374,543
ARABICFAKETWEETS	61,228	66,977	128,205

		S-No	Freq	Text	Label
The text of the AFND dataset	A. Before Preprocessing	1	254	{“articles”: [{“title”: “مجلة بلاي بوي الإباحية تشعر بالخجل من وضعها	Fake
		2	1499	{“articles”: [{“title”: “شاب متكبر يضع النفايات في سلة المهملات	Fake
		3	285	{“articles”: [{“title”: “أب يخير ابنه بين البربيش والخيزرانة ليعطيه مالا	Fake
		4	517	{“articles”: [{“title”: “الزعيم يوز ع المكرمات كاريكاتير محمد عفيفة	Fake
		5	425	{“articles”: [{“title”: “شاب يقاطع دكانا لم يعطه صاحبه البخيل كيس بيض	Fake
		6	379	{“articles”: [{“title”: “قرد يصاب بالقلق بعد سماعه إشاعات عن إمكانية تحوله لانسان	Fake
		7	616	{“articles”: [{“title”: “دراسة المواطن العربي يفني ربع عمره في البحث عن وظيفة	Fake
		8	639	{“articles”: [{“title”: “توق عات بارتفاع أرباح شركة الجامعة الأردنية العالمية	Fake
		9	1014	{“articles”: [{“title”: “ليبرمان النبي موسى أخطأ حين جاء بنا إلى الارض	Fake
		10	1254	{“articles”: [{“title”, “دليل أربع عقد نفسية تؤه لك لتصبح مديرا يطمح	Fake
	B. After Preprocessing	1	254	مجلة بلاي بوي الإباحية تشعر بالخجل من وضعها	Fake
		2	1499	شاب متكبر يضع النفايات في سلة المهملات	Fake
		3	285	أب يخير ابنه بين البربيش والخيزرانة ليعطيه مالا	Fake
		4	517	الزعيم يوز ع المكرمات كاريكاتير محمد عفيفة	Fake
		5	425	شاب يقاطع دكانا لم يعطه صاحبه البخيل كيس بيض	Fake
		6	379	قرد يصاب بالقلق بعد سماعه إشاعات عن إمكانية تحوله لانسان	Fake
		7	616	دراسة المواطن العربي يفني ربع عمره في البحث عن وظيفة	Fake
		8	639	توقعات بارتفاع أرباح شركة الجامعة الأردنية العالمية	Fake
		9	1014	ليبرمان النبي موسى أخطأ حين جاء بنا إلى الأرض	Fake
		10	1254	دليل أربع عقد نفسية تؤه لك لتصبح مديرا يطمح	Fake

		S-No	Freq	Text	Label
The text of ARABICFAKETWEETS dataset	A. Before Preprocessing	1	211	صاحب سوبرماركت وضع لافته مضحكة على شفتيه press:	Fake
		2	145	\nRT @a_s_h1234567 شاب سعودى انقذ طفل كان يرضع من الامم المتحدة	Fake
		3	256	nRT @Amer_AlQah6ani: هبوط صاروخ باليستى بين مكة وجدة	Fake
		4	237	@qertyui5521 هزة ارضية بالمدينة المنورة تحدث انفجار ضخم	Fake
		5	345	@ALsiasi14\nRT السعودية تقرر منع تشغيل الموسيقى في المطاعم و المقاهي ب الرياض	Fake
		6	423	\nRT @Dr__Hussein غرامه وسجن عدم حمل البطاقة الشخصية فى اليمن	Fake
	B. After Preprocessing	1	211	صاحب سوبرماركت وضع لافته مضحكة على شفتيه	Fake
		2	145	شاب سعودى انقذ طفل كان يرضع من الامم المتحدة	Fake
		3	256	هبوط صاروخ باليستى بين مكة وجدة	Fake
		4	237	هزة ارضية بالمدينة المنورة تحدث انفجار ضخم	Fake
		5	345	السعودية تقرر منع تشغيل الموسيقى في المطاعم و المقاهي بالرياض	Fake
		6	423	غرامه وسجن عدم حمل البطاقة الشخصية فى اليمن	Fake

Text in Arabic	Text Translated into English
مجلة بلاي بوي الإباحية تشعر بالخجل من وضعها	Playboy porn magazine feels ashamed of its situation
شاب متكبر يضع النفايات في سلة المهملات	Arrogant Young Man Puts Trash in the Trash Bin
أب يخير ابنه بين البربيش والخيزرانة ليعطيه مالا	Father Gives His Son a Choice Between a Hose and a Stick to Give Him Money
الزعيم يوز ع المكرمات كاريكاتير محمد عفيفة	The Leader Distributes Gifts (Cartoon by Mohammed Afifa)
شاب يقاطع دكانا لم يعطه صاحبه البخيل كيس بيض	Young Man Boycotts a Shop Because the Stingy Owner Didn’t Give Him a Bag of Eggs
قرد يصاب بالقلق بعد سماعه إشاعات عن إمكانية تحوله لانسان	A monkey becomes anxious after hearing rumors about the possibility of him turning into a human.
دراسة المواطن العربي يفني ربع عمره في البحث عن وظيفة	Study: The Arab citizen spends a quarter of his life looking for a job
توقعات بارتفاع أرباح شركة الجامعة الأردنية العالمية	Expectations of an increase in the profits of the Jordanian International University Company
ليبرمان النبي موسى أخطأ حين جاء بنا إلى الأرض	Lieberman: The Prophet Moses made a mistake when he brought us to earth
دليل أربع عقد نفسية تؤه لك لتصبح مديرا يطمح	A guide to four psychological complexes that prepare you to become an aspiring manager
صاحب سوبرماركت وضع لافته مضحكة على شفتيه	A supermarket owner put a funny sign on his lips.
شاب سعودى انقذ طفل كان يرضع من الامم المتحدة	A young Saudi man saved a child who the United Nations was breastfeeding.
هبوط صاروخ باليستى بين مكة وجدة	A ballistic missile landed between Mecca and Jeddah.
هزة ارضية بالمدينة المنورة تحدث انفجار ضخم	An earthquake in Medina causes a massive explosion.
السعودية تقرر منع تشغيل الموسيقى في المطاعم و المقاهي بالرياض	Saudi Arabia has decided to ban music in restaurants and cafes in Riyadh.
غرامه وسجن عدم حمل البطاقة الشخصية فى اليمن	Fine and imprisonment for not carrying an I.D. card in Yemen

Model	Regulation
Naïve Bayes	alpha: [0.1, 0.5, 1.0, 1.5, 2.0]
Logistic Regression	C: [0.01, 0.1, 1, 10, 100]
Linear SVC	C: [0.01, 0.1, 1, 10, 100],
Random Forest	n_estimators :[50, 100, 200]
SVM	C: [0.1, 1, 10, 100],
Decision Tree	‘min_samples_split’: ^2,5,9,

Model	Round	Learning rate	Depth	Sample
Gradient Boosting	100	0.1	3	0.7
XGB	100	0.1	3	0.7
CatBoost	100	0.1	5	0.7
AdaBoost	100	1.0	None	None

Model	Tokenizer	Batch	Learning rate	Epoch
BERT	BertTokenizer	32	2e-5	5
ROBERTa	Auto tokenizer	32	2e-5	5
MBERT	Auto tokenizer	32	2e-5	5
XLNET	Auto tokenizer	32	2e-5	5

Model	Model layer	Dense layer	Dropout layer	Pooling layer	Flatten layer	Epoch	Function	loss	Optimizer
RNN-CNN	3	2	2	2	1	10	SoftMAX	Categorical entropy	Adam
CNN-LSTM	4	2	2	2	1	10	SoftMAX
RNN-LSTM	4	2	2	2	1	10	SoftMAX
BI-LSTM-Bi-GRU	3	3	2	0	1	10	Relu

Model	Unsupervised FastText				Supervised FastText
Model	Precision	Recall	F1	Accuracy	Precision	Recall	F1	Accuracy
Naïve Bayes	0.42	0.43	0.43	0.44	0.44	0.45	0.45	0.45
Logistic Regression	0.45	0.44	0.44	0.45	0.47	0.46	0.46	0.47
Linear SVC	0.33	0.33	0.33	0.34	0.35	035	0.35	0.36
Decision Tree	0.74	0.74	0.76	0.77	0.77	0.75	0.77	0.77
Random Forest	0.66	0.67	0.68	0.68	0.68	0.68	0.69	0.69
SVM	0.71	0.72	0.72	0.71	0.73	0.73	0.73	0.73

Model	Unsupervised FastText				Supervised FastText
Model	Precision	Recall	F1	Accuracy	Precision	Recall	F1	Accuracy
CNN-LSTM	0.91	0.91	0.92	0.92	0.92	0.92	0.93	0.93
RNN-CNN	0.88	0.88	0.88	0.89	090	0.90	0.91	0.91
RNN-LSTM	0.93	0.93	0.93	0.94	0.94	0.94	0.95	0.95
(Bi-LSTM + Bi-GRU)	0.97	0.97	0.98	0.98	0.98	0.98	0.99	0.99

Model	Precision	Recall	F1	Accuracy
CAPSNET ⁵⁹	0.77	0.76	0.78	0.78
CNN-LSTM ⁶⁰	0.80	0.80	0.81	0.81
Bi-LSTM + Bi-GRU	0.97	0.97	0.98	0.98

Model	Unsupervised FastText				Supervised FastText
Model	Precision	Recall	F1	Accuracy	Precision	Recall	F1	Accuracy
Gradient Boosting Classifier	0.58	0.57	0.60	0.58	0.56	0.59	0.60	0.60
XGB Classifier	0.53	0.54	0.53	0.53	0.56	0.56	0.55	0.55
CatBoost Classifier	0.45	0.43	0.46	0.44	0.47	0.46	0.48	0.46
AdaBoost Classifier	0.56	0.55	0.56	0.56	0.58	0.57	0.58	0.58

Model	Precision	Recall	F1	Accuracy
ARBERT ⁵⁵	0.99	098	0.98	0.98
Bi-LSTM + Bi-GRU	0.98	0.98	0.99	0.99

PERMALINK

Ensemble based high performance deep learning models for fake news detection

Mohammed EAlmandouh

Mohammed F Alrahmawy

Mohamed Eisa

Mohamed Elhoseny

A S Tolba

Abstract

Introduction

Related work

Proposed methodology

Fig. 1.

A. Dataset collection and preprocessing stage

Table 1.

Table 2.

Table 3.

Table 4.

B. Textual representation and feature extraction stage

Fig. 2.

Algorithm 1.

Algorithm 2.

C. Modelling of fake news classifiers stage

1- Machine learning models

1–1. Hyperparameter tuning for ML models

Table 5.

1–2. Hyperparameter tuning for boosting models

Table 6.

2. Deep learning models

2–1 Ensemble-based models architectures

Fig. 3.

Fig. 4.

Fig. 5.

Fig. 6.

Algorithm 3.

2–2 Deep learning model configurations

2–2-a. Hyperparameter tuning for transformer-based deep learning models

Table 7.

2–2-b Hyperparametertuning for ensemble-based deep learning models

Table 8.

Results and evaluations

1- Evaluation results of all models on the AFND dataset

Table 9.

Table 10.

Table 11.

Table 12.

Table 13.

2- Evaluation results of all models on the ARABICFAKETWEETS dataset

Table 14.

Table 15.

Table 16.

Table 17.

Table 18.

Fig. 7.

Fig. 8.

Fig. 9.

3- Error analysis

A- Error analysis of the misclassified instances with unsupervised FastText

B- Error analysis of the misclassified instances with supervised FastText.

C- Error Analysis of the proposed (Bi-LSTM + Bi-GRU) model due to the nature of the dataset samples

4- Statistical significance analysis of the results

5- Comparison with the state-of-the-Art methods

A- Comparison with models working on the AFND dataset

Table 19.

B- Comparison with models working on the ARABICFAKETWEETS dataset

Table 20.

6- The evaluation of the computational time and space of the Bi-LSTM + Bi-GRU model

Discussions

Implications

Limitations

Future work

Conclusion

Acknowledgments

Author contributions

Funding

Data availability

Competing interests

Footnotes

References

Associated Data

Data Availability Statement