A novel hybrid multi-thread metaheuristic approach for fake news detection in social media

Gungor Yildirim

doi:10.1007/s10489-022-03972-9

. 2022 Sep 2;53(9):11182–11202. doi: 10.1007/s10489-022-03972-9

A novel hybrid multi-thread metaheuristic approach for fake news detection in social media

Gungor Yildirim ^1,^✉

PMCID: PMC9436741 PMID: 36068811

Abstract

In fake news detection, intelligent optimization seems to be a more effective and explainable solution methodology than the black-box methods that have been extensively used in the literature. This study takes the optimization-based method one step further and proposes a novel, multi-thread hybrid metaheuristic approach for fake news detection in social media. The most innovative feature of the proposed method is that it uses a supervisor thread mechanism, which simultaneously monitors and improves the performance and search patterns of metaheuristic algorithms running parallel. With the supervisor thread mechanism, it is possible to analyse different key attribute combinations in the search space. In addition, this study develops a software framework that allows this model to be implemented easily. It tests the performance of the proposed model on three different data sets, respectively containing news about Covid-19, the Syrian War, and daily politics. The proposed method is evaluated in comparison to the results of fifteen different well-known deep models and classification algorithms. Experimental results prove the success of the proposed model and that it can produce competitive results.

Keywords: Fake news detection, Metaheuristic, Multi-threading, Optimization

Introduction

The evolution of the print media to an online broadcasting model, together with the transformation of social networks into news producing-consuming platforms is causing an enormous flow of both news and data, regarding volume, velocity, and variety [36]. Since production and distribution is rather easy, there is nowadays an acceleration of the circulation of fake or fabricated content, as well as real content. This rapid change and transformation in communication technologies leads to people to be vulnerable to the news and data flow they are exposed to. Every day, and almost every second, huge amounts of information and news are presented to millions of users on different platforms, while most of them are unverified. Consequently, people often spread the news they encounter online without seeking any verification, causing fake/false/fabricated information and content to spread at a high speed [41, 42]. This spread has major negative effects on individuals and societies, which include fear, hopelessness, anger, and prejudice. From a holistic point of view, in addition to its individual and social effects, many other negative effects are observed; from reputational and commercial losses of social media companies, to social polarization, to triggering regional and international crises, to manipulation of political and commercial activities, or to the creation of insecurity in the society. Nowadays, fake news has the ability to control and to shape the scientific, social, and religious realities, among other beliefs and relations. Recently, fake news started its transformation into an asymmetrical attack tool by making use of social engineering elements. As a result of the search for a solution to this problem, a new interdisciplinary research area has emerged in the last 5 years, fake news detection. It has aroused curiosity in many different disciplines, but more specifically computer science.

The content of news usually consists of text, audio, visual, and video components. Considering the dissemination speed, size, diversity, and heterogeneity of data in the online environment, it is a very difficult task to develop an inclusive and systematic tool that can detect the truth of the news. In this context, the analysis of text, which is the most dominant type of content among the components that make up the news, is of great importance. The primary point of reference for the analysis of these text contents is the Natural Language Processing (NLP) discipline. The detection of intentionally misleading news is based on the analysis of several examples of both fraudulent and truthful previously reviewed news. The spread of fake news in different channels, especially online platforms, was not yet completely stopped or reduced to some extent. This happens because there is no system that can completely check for fake news with little or no human intervention.

There are a number of different fake news detection models, which can be categorized as illustrated in Fig. 1. Expert-fact-checkers are a small group of professionals in a variety of disciplines, who have the ability to confirm the accuracy of certain news, deciding afterwards whether that information is fake or genuine. The advantage of expert verification techniques is that they are easy to manage and highly accurate [47]. However, this expert-fact-checker technique is time-consuming, especially in cases where a large amount of information is given for their verification, due to their small number and also the need for manual annotators. As an alternative to expert-fact-checker, the crowdsourcing methodology relies on the collective approval of individuals or groups [31]. The crowd is made up of ordinary people from diverse backgrounds that has little knowledge of some of the news sites; and as a result, news sites they are not familiar with are flagged as untrusted. The major strength of crowdsourcing methods is that they are based on pluralism, since they use individuals with a diverse knowledge. On the other hand, the number of users actually affected by the event needs to be verified for reliability prior to the classification.

Previous studies show that machine learning algorithms may be capable of detecting fake news, having into account the high number of cases needed to be trained for these models to properly work [28]. These artificial intelligence (AI)-based systems seem efficient methods to automatically verify reality and/or detect fake news [16, 34]. The previously cited works have used AI and human analysis together, which was able to correctly block and tag social media accounts, as well as content, in some important situations and events that was flagged as fake. They also collaborate with fact-checking organizations that carry out intensive manual processes. In recent years, many researchers have carried out important studies to develop automated solutions to detect fake news, applying different methodologies [14, 23]. At this point, it important to state that although important steps have been taken to solve the problem, it cannot be said that satisfaction levels have yet been reached. The increase in the number of news verification organizations and the number of people working around this topic [30] can be considered as an indication that suitable (or optimal) solutions have not yet been found. Black-box nature, problems with computational efficiency and low interpretability seem to be the major disadvantages of these methods. Also, these models need a high degree of optimization to get the suitable values of the entire parameters.

The recommendation systems aim to validate news content that is considered original, and afterwards recommend those news articles that are ready for consumption [10, 26]. The collaborative filtering recommendation method recommends news content based on comments and ratings from other readers/users. The reliability level of this method is good, since the rating mechanism provides pluralism. The weaknesses of recommendation systems are often scalability, change in user interest, and recency of these methods. To determine the authenticity of the news, deep learning that applies deep neural network models have also been widely used in the literature. A combination of several deep learning methods such as Long Short-Term Memory network (LSTM), Convolutional Neural Network (CNN), and Bidirectional LSTM (Bi-LSTM) was applied to a four-class label relating to news article headlines [1]. Fan et al. proposed an LSTM-based model to detect false reports in an environmental complaint system [9]. Bhattacharya et al. developed a Bi-LSTM based fake news detection model, which is an advanced version of LSTM. This model is assertive in the classification of fake news and news source detection, by using blockchain networks [4]. CNN with both different embedding models and margin loss were proposed to detect fake news with a higher accuracy in [12, 13]. A model based on Capsule Neural Network (CapsNet) was proposed for fake news detection in [12, 13]. Different levels of n-grams and different embedding models for news items with various lengths were applied in their work. Deep learning-based methods are capable of automatically learning latent textual representation, while capturing complex contextual patterns of news content. A detailed research on contextualized text representation and deep neural classification can be found in [38]. In addition, in fake news analysis, visual data may also be a part of the general analysis. Especially in recent years, multimodal models in which visual data are evaluated together with textual data stand out in this field [43, 44]. Besides this, different aspects and factors can be considered together when using advanced multitask deep learning models. In [20], this type of multitask model was developed, applying it to an automated fake news detection that takes into account textual anomaly and emotion factors. Raj and Meel used a deep model with two streams, which was named Coupled ConvNet [33]. This evaluates the results, obtained from a text-CNN and an Image-CNN that can use eight different deep models, together with weighted fusion. Lotfi et al. proposed a graph convolutional network-based approach that could detect rumours on Twitter conversations [22]. Another rumour detection model, which combines recurrent neural network (RNN) and Autoencoder deep models, was proposed by Chen et al. [7]. Cao et al. developed a deceptive reviews detection model that combines feature representations from the Gated Recurrent Unit (GRU), TextCNN, and Self-Attention deep models. [6]. Overall, the deep models are flexible and can adapt to complex interaction patterns. However, there are also several challenges associated with these methods, such as the volume of needed data, model complexity, and interpretability difficulties.

NLP methods work within automated detection algorithms, which involve powerful mechanisms, such as semantic and lexical analysis. Linguistic features are key factors for NLP, which can include both text style and content. Style and grammar detector with a syntactic analyser, such as Stanford parser, was reported in [18] with accurate results. These methods have limitations on the generalization of hand-crafted linguistic features across languages, topics, domains, and also in the use of the rich contextual and semantic information [40]. The graph network fake news detection model examines news content from homogeneous and heterogeneous networks [44]. Zhou and Zafarani [48] examined the graph network-based fake news detection, where the network is split into triads, communities, and nodes. The result proved to yield better results, since the method could detect fake news before spreading them. Hierarchical Graph Attention Network (HGAT) that employed the Heterogeneous Information Networks (HIN)-based fake news article in [35], with higher accuracy. Early detection in fake news analysis is an important factor for the minimization of damages that can occur. In [41, 42] a successful propagation network-based fake news detection technique that takes this factor was developed. In summary, the graph networks can be good at early detection, but they also suffer from high computational costs, since many hyper-parameters need to be computed for these to work.

There are also hybrid methods to work well, due to both the ambiguous nature and complexity of fake news. A hybrid machine-crowd approach was proposed by [39]. The model employed the fusion of the collective effort of humans, together with that of machine learning, leading to a higher accuracy, when compared to previous studies. Hybrid deep learning models, expert-crowdsource, machine-crowdsource, and the fusion of methods from the content-based models and social context-based algorithms were also presented to use auxiliary information from different perspectives [8]. As expected, hybrid models inherit the strengths and weaknesses of all the used methods. Intelligent optimization is another methodology that can be used to detect fake news [27]. In order to obtain a better model with respect to different metrics, only one improved version of the intelligent optimization method was recently proposed [29]. In this study, the meta-heuristic approach is used. The reason is that these approaches are more explainable and suitable for parallel and hybrid work. However, these methods also have disadvantages, due to the need of sequential execution and large population management. These disadvantages may be more evident in fake news detection problems that involve multiple features. Therefore, this study proposes a different multi-thread hybrid meta-heuristic model for fake news analysis.

The next parts of the study are planned as follows; the objective of the study will be explained in Chapter 2. In Chapter 3, the basic principles of optimization-based fake news analysis are explained. The definition of the problem and the details of the proposed method are given in Chapter 4. Chapter 5 details the framework developed and the meta-heuristic algorithm used. The experiments, discussions and conclusions are presented in Chapter 6, Chapter 7 and in Chapter 8, respectively.

Purpose and innovative aspects of the proposed method

Deep models are more successful on complex and large data sets. On the other hand, deep models trained with relatively small data sets may not perform well. In addition, the black-box nature of machine learning and deep models makes them less explainable [45, 46]. In these two issues, metaheuristic approaches come to the fore. These approaches are both more explainable and adaptable and can be more successful in small and medium-sized data sets. However, there are two important bottlenecks in metaheuristic approaches. The first is the high population and sequential work could increase the time and resource cost. The second is that the efficiency of exploration and exploitation decreases as the number of features of the search space increases. Hybrid metaheuristic approaches are an important technique used to overcome these challenges [11, 32]. The hybrid approaches proposed in the literature are generally based on using sequential or simultaneous joint solutions. In these approaches, the exploration and exploitation mechanisms of the metaheuristic methods proceed naturally, which causes a non-dynamic exploration process. To overcome these bottlenecks, this study proposes a multi-thread hybrid approach, which combines the performances of different meta-heuristic algorithms with today’s parallel working technologies. This study introduces a new approach: the supervisor thread mechanism. The basic principle of this mechanism, which is proposed for the first time in the literature as far as is known, is to simultaneously observe and improve the performances of different meta-heuristic algorithms running in parallel in different threads. Also, the supervisor thread executes a meta-heuristic algorithm that can optimize the best values shared by other threads. The study develops a software framework based on this multi-thread model and proves its performance on different data sets. The original and innovative aspects of this study can be briefly summarized as follows:

It could be more efficient in small and medium-sized fake news data sets than deep models.
It proposes for the first time the use of the supervisor thread, which can observe and improve the meta-heuristic algorithm threads running in parallel.
Furthermore, this method can also be used in general-purpose solution search strategies.
Thanks to the supervisor thread, it enables combinations of unsearched or untested attributes in the search space for fake news detection.
It uses the parallel-hybrid optimization technique for the first time in fake news detection.
It separately observes both the single and multi-thread performances of different swarm-based meta-heuristic algorithms for fake news analysis.
It offers a software framework that ensures the easy applicability of the proposed model.

Basic principles of optimization-based fake news detection

Optimization-based fake news analysis considers the relevant unstructured textual data set as a search space. The optimization-based method requires a binary search space. To obtain this, the relevant textual data set is pre-processed. In the pre-processing stage, first, word roots are found by applying case conversion and some filtering operations (filtering number, N char, punctuation, etc.). Then the weights (W_i) of each word are calculated. This calculation uses the number of repetitions of each word in the data set. Thus, the weight of a word is calculated with $W_{i} = \frac{R_{i}}{R_{\max}}$ . Here, R_i is the number of repetitions of the ith word, and R_max is the maximum number of repetitions. Since the inclusion of words with very low weights in the optimization process will adversely affect performance, the search space includes words above a certain threshold value as attributes. Finally, the words included in the search space are scanned in each record in the data set. If the word is in the relevant record, it is evaluated as 1; if not, it is evaluated as 0. Thus, the dataset becomes a binary search space consisting of 1 s and 0 s.

Next, the method constructs population candidates for this search space. As an example, a swarm intelligence–based meta-heuristic algorithm has a population P containing N candidates ( $P = \{\vec{X_{1}}, \vec{X_{2}}, . . \vec{X_{N}}\}$ ). The variables of each candidate in the population take values between [0,1] ( $\vec{X_{j}} = \{f_{1}, f_{2}, . f_{i} . . f_{K}\}, f_{i} \in [0, 1], K = the number of the attributes, i \in [1, N] and i \in Z$ ). While calculating the fitness values of the candidates, each candidate is evaluated for each record in the edited data set. The fitness evaluations take into account two criteria. The first criterion is whether the similarity ratio between the candidate values and the related record is greater than a predefined threshold value (τ). For this, similarity functions such as Jaccard Similarity, given in Eq. 1, can be used. While performing the similarity test, the continuous value of the candidate can be used [17], or it can be converted to binary form. This study used binary representations of candidate values in similarity controls due to better performance. The second criterion is whether the class of the relevant record is the same as the candidate class. These two criteria are considered together, as shown in Table 1. Thus, the candidate can calculate the current true-positive (TP), true-negative (TN), false-positive (FP), and false-negative (FN) values. After this process is repeated for all records in the data set, Eq. 2 calculates the fitness value of the candidate [3]. The values of the best candidate found at the end of the iterations can provide Accuracy, Precision, and Recall metrics via (Eq. 3).

{Jaccard Value}_{\vec{X}} = \frac{\sum_{i = 1}^{K} Round (f_{i}) \times {Record}_{i}}{\sum_{i = 1}^{K} Round (f_{i}) + \sum_{i = 1}^{K} {Record}_{i} - \sum_{i = 0}^{K} Round (f_{i}) \times {Record}_{i}}

F = \frac{k_{1} \times TP \times TN}{(TP + FN) (TN + FP)} + \frac{k_{2} \times TP}{TP + FP} + \frac{k_{3} \times TN}{TN + FN}

Acuracy = \frac{TP + TN}{TP + TN + FP + FN} Precision = \frac{TP}{TP + FP} Recall = \frac{TP}{TP + FN}

Table 1.

Updating of TP, FP, FN, and TN for each record in the dataset

Condition	Updating
If ${Jaccard Value}_{\vec{X}} \geq τ$ and the Class searched = = the Record Class in the data set	Increase TP by 1
If ${Jaccard Value}_{\vec{X}} \geq τ$ and the Class searched! = the Record Class in the data set	Increase FP by 1
If ${Jaccard Value}_{\vec{X}} < τ$ and the Class searched = = the Record Class in the data set	Increase FN by 1
If ${Jaccard Value}_{\vec{X}} < τ$ and the Class searched! = the Record Class in the data set	Increase TN by 1

Sync.	Func. Type	Function Name
Yes	Getter / Setter	BestSolution1,2 and 3
Yes/No	Getter / Setter	Pattern1,2,..C-1
Yes	Getter / Setter	RecommendedCandidates1,2,.C-1
No	Setter / Getter	Terminated

Data set	Total Records	Training / Test Splitting Rate (%)	Attribute Words	Training data set size	Test data set size	Class
Covid-19	3119	70 / 30	134	2183 × 135 (134 + 1)	936 × 135	True/Fake
Syrian	804	70 / 30	109	563 × 110 (109 + 1)	241 × 110	True/Fake
General-news	44,858	70 / 30	136	31,401 × 137 (136 + 1)	13,457 × 137	True/Fake

Deg	Method	Acc	Pre	Rec
1	SVM (Support vector mechine)	78.8	0.785	0.788
2	HMT	74.31	0.738	0.733
3	Bi-LSTM	74.04	0.734	0.74
4	JRIP	73.7	0.726	0.737
5	FilteredClassifier	73.6	0.731	0.736
6	GWO	72.91	0.716	0.698
7	LSTM	71.474	0.706	0.715
8	Ridor	71.4	0.702	0.715
9	RNN	70.83	0.698	0.708
10	Ibk	70.4	0.717	0.704
11	CNN	70.19	0.692	0.702
12	DT (Decision Tree)	69.9	0.698	0.7
13	NB (Naive Bayes)	68.9	0.723	0.689
14	GRU	68.59	0.668	0.686
15	DrO	68.54	0.458	0.641
16	PSO	68.46	0.619	0.774
17	One	67.9	0.651	0.679
18	FFN (Feedfoward neural network)	67.31	0.661	0.673
19	RandomTree	66.4	0.662	0.665

GWO							PSO						DrO						HMT
	Fake			True			Fake			True			Fake			True			Fake			True
	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec
Best	71.500	0.638	0.569	74.700	0.793	0.961	63.200	0.464	67.000	72.000	0.715	0.981	66.300	0.750	0.412	69.900	0.700	0.971	72.970	0.723	0.494	75.000	0.762	0.966
Worst	64.500	0.476	0.082	71.600	0.714	0.783	43.800	0.327	42.100	68.500	0.686	0.940	40.400	0.222	0.003	67.800	0.682	0.921	68.056	0.551	0.230	71.261	0.707	0.905
Mean	68.799	0.567	0.385	73.215	0.749	0.898	52.095	0.374	56.945	69.620	0.696	0.959	64.665	0.491	0.028	68.420	0.689	0.949	70.730	0.651	0.312	73.130	0.731	0.939
Median	69.250	0.574	0.392	73.400	0.751	0.907	52.550	0.375	0.588	69.300	0.694	0.957	65.950	0.523	0.006	68.300	0.689	0.949	70.459	0.649	0.303	73.130	0.728	0.943
Std	1.833	0.042	0.097	0.788	0.022	0.051	5.819	0.037	0.066	0.830	0.007	0.009	0.949	0.192	0.088	1.202	0.097	0.013	1.350	0.052	0.061	0.998	0.014	0.019

	GWO - weighted			PSO - weighted			DrO- weighted			HMT- weighted
	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec
Best	72.910	0.722	0.821	68.464	0.619	0.875	68.540	0.712	0.769	74.310	0.749	0.798
Worst	69.798	0.641	0.647	61.244	0.567	0.774	58.550	0.529	0.610	71.017	0.666	0.681
Mean	71.713	0.687	0.724	63.662	0.586	0.827	67.143	0.622	0.636	72.314	0.704	0.726
Median	71.760	0.690	0.712	63.473	0.584	0.832	67.482	0.635	0.632	72.208	0.701	0.724
Std	0.817	0.023	0.049	2.056	0.014	0.024	2.005	0.067	0.032	0.951	0.024	0.026

	Sig.Level	p value for accuracy	Significant for accuracy	p value for precision	Significant for precision	p value for recall	Significant for the recall
Fake	0.05	< 1E-3	Yes	< 1E-3	Yes	< 1E-3	Yes
True	0.05	< 1E-3	Yes	< 1E-3	Yes	< 1E-3	Yes
Weighted	0.05	< 1E-3	Yes	< 1E-3	Yes	< 1E-3	Yes

Deg	Metot	Acc	Pre	Rec
1	HMT	57.47	0.575	0.562
2	LSTM	56.79	0.573	0.568
3	RNN	55.56	0.563	0.556
4	GWO	54.32	0.528	0.765
5	GRU.	54.32	0.542	0.543
6	FFN	54.32	0.559	0.543
7	DrO	54.18	0.536	0.602
8	NB	53.9	0.538	0.539
9	SVM	53.5	0.533	0.535
10	DT	52.7	0.528	0.527
11	JRIP	51.9	–	0.519
12	FilteredClassifier	51.9	–	0.519
13	PSO	50.93	0.505	0.976
14	Bi-LSTM	50.62	0.524	0.506
15	RandomTree	50.2	0.5	0.502
16	One	49	0.403	0.49
17	CNN	48.15	0.488	0.481
18	Ibk	47.7	0.477	0.477
19	Ridor	47.3	0.372	0.473

	GWO - weighted			PSO - weighted			DrO- weighted			HMT- weighted
	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec
Best	54.184	0.536	0.882	50.932	0.507	0.983	54.324	0.533	0.987	57.471	0.580	0.909
Worst	45.224	0.469	0.489	49.024	0.496	0.853	46.908	0.484	0.512	49.686	0.478	0.346
Mean	49.753	0.506	0.719	49.854	0.501	0.930	50.542	0.506	0.843	53.553	0.519	0.608
Median	49.732	0.505	0.741	50.004	0.501	0.949	50.324	0.503	0.902	53.223	0.512	0.619
Std	2.104	0.016	0.104	0.561	0.003	0.043	1.542	0.011	0.147	2.159	0.028	0.125

Deg	Method	Acc	Pre	Rec
1	LSTM	94.31	0.946	0.943
2	SVM	93.5	0.935	0.935
3	CNN	91.82	0.918	0.918
4	JRIP	91.2	0.913	0.912
5	Bi-LSTM	90.57	0.916	0.906
6	HMT	90.18	0.903	0.902
7	GRU.	90.17	0.905	0.901
8	DT	90.1	0.901	0.901
9	RNN	89.38	0.895	0.893
10	Ibk	89.2	0.892	0.892
11	FFN	88.12	0.901	0.846
12	GWO	87.4	0.896	0.834
13	Ridor	87.32	0.869	0.864
14	FilteredClassifier	86.9	0.893	0.824
15	NB	86.7	0.869	0.867
16	RandomTree	86.4	0.864	0.864
17	PSO	71.61	0.708	0.779
18	DrO	71.39	0.708	0.771
19	One	68.1	0.787	0.681

	GWO - weighted			PSO - weighted			DrO- weighted			HMT- weighted
	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec
Best	87.448	0.896	0.865	71.600	0.807	0.801	71.392	0.712	0.807	90.176	0.908	0.902
Worst	73.500	0.212	0.625	61.720	0.174	0.641	46.796	0.328	0.379	80.868	0.867	0.736
Mean	82.730	0.651	0.770	67.538	0.581	0.725	58.391	0.540	0.675	84.049	0.891	0.786
Median	82.076	0.576	0.747	67.546	0.656	0.716	57.244	0.553	0.740	83.909	0.891	0.782
Std	4.619	0.238	0.074	3.030	0.171	0.047	7.213	0.125	0.146	2.000	0.010	0.036

	GWO						PSO						DrO						HMT
	Fake			True			Fake			True			Fake			True			Fake			True
	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec
Best	53.300	0.504	0.982	55.400	0.579	0.797	47.900	0.474	0.974	55.400	0.579	0.797	55.000	0.537	0.982	54.100	0.552	0.992	56.612	0.615	0.921	58.264	0.576	0.898
Worst	45.500	0.436	0.316	44.600	0.474	0.430	45.900	0.464	0.956	44.600	0.474	0.430	45.900	0.434	0.009	47.100	0.500	0.320	47.107	0.400	0.018	49.174	0.514	0.563
Mean	47.838	0.471	0.845	51.520	0.537	0.602	46.900	0.469	0.968	51.520	0.537	0.602	47.955	0.475	0.840	52.930	0.535	0.846	52.005	0.493	0.468	53.059	0.542	0.737
Median	47.100	0.470	0.956	52.100	0.540	0.617	47.100	0.470	0.974	52.100	0.540	0.617	47.100	0.470	0.982	53.300	0.533	0.961	52.273	0.490	0.500	52.479	0.537	0.723
Std	2.326	0.014	0.209	3.023	0.027	0.100	0.482	0.241	0.008	0.030	0.027	0.100	10.029	0.021	0.289	1.418	0.209	0.204	2.795	0.048	0.225	0.023	0.018	0.098

	GWO						PSO						DrO						HMT
	Fake			True			Fake			True			Fake			True			Fake			True
	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec	Acc	Pre	Rec
Best	87.500	0.896	0.865	87.500	0.896	0.865	71.600	0.898	0.821	87.500	0.896	0.865	71.200	0.712	0.778	71.600	0.886	0.838	89.600	0.920	0.896	90.800	0.914	0.908
Worst	73.500	0.212	0.625	73.500	0.212	0.607	52.600	0.164	0.534	73.500	0.212	0.607	46.900	0.120	0.051	30.000	0.539	0.265	80.200	0.863	0.723	80.400	0.853	0.728
Mean	81.710	0.591	0.751	83.835	0.715	0.790	67.730	0.596	0.736	83.835	0.715	0.790	58.410	0.471	0.630	58.370	0.614	0.723	83.062	0.896	0.765	85.118	0.884	0.809
Median	87.100	0.872	0.834	86.400	0.863	0.834	70.550	0.705	0.763	86.400	0.863	0.834	55.900	0.556	0.730	58.700	0.552	0.749	82.709	0.896	0.750	85.126	0.886	0.806
Std	6.315	0.330	0.100	5.182	0.282	0.091	4.800	22.053	0.066	0.052	0.282	0.091	7.717	0.233	0.218	10.660	0.111	0.157	2.382	0.151	0.048	0.027	0.020	0.054

PERMALINK

A novel hybrid multi-thread metaheuristic approach for fake news detection in social media

Gungor Yildirim

Abstract

Introduction

Fig. 1.

Purpose and innovative aspects of the proposed method

Basic principles of optimization-based fake news detection

Table 1.

The problem definition and the proposed hybrid multi-thread approach

The influencing factors in optimization-based data set analysis

Fig. 2.

The proposed hybrid multi-thread model

Fig. 3.

Table 2.

The implementation and framework of the HMT

The developed framework for HMT

Fig. 4.

The metaheuristic algorithms used in the experiments

Experiments

Table 3.

Fig. 5.

The comparative results and discussion

The Covid-19 data set results

Table 4.

Table 5.

Table 6.

Table 7.

Fig. 6.

Fig. 7.

The Syrian war data set results

Table 8.

Table 9.

Table 10.

Table 11.

Fig. 8.

Fig. 9.

The general-news data set results

Table 12.

Table 13.

Table 14.

Table 15.

Fig. 10.

Fig. 11.

Fig. 12.

Conclusion

Gungor Yildirim

Appendix 1

Fig. 13.

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases