Skip to main content
. 2023 Feb 9;13(1):30. doi: 10.1007/s13278-023-01028-5

Table 6.

The features and datasets used in the news content-based approaches

Feature and metadata Datasets Reference
The average number of words in sentences, number of stop words, the sentiment rate of the news measured through the difference between the number of positive and negative words in the article Getting real about fake newsa, Gathering mediabiasfactcheckb, KaiDMML FakeNewsNetc, Real news for Oct-Dec 2016d Kapusta et al. (2019)
The length distribution of the title, body and label of the article News trends, Kaggle, Reuters Kaur et al. (2020)
Sociolinguistic, historical, cultural, ideological and syntactical features attached to particular words, phrases and syntactical constructions FakeNewsNet Vereshchaka et al. (2020)
Term frequency BuzzFeed political news, Random political news, ISOT fake news Ozbay and Alatas (2020)
The statement, speaker, context, label, justification POLITIFACT, LIARe Wang (2017)
Spatial vicinity of each word, spatial/contextual relations between terms, and latent relations between terms and articles Kaggle fake news datasetf Hosseinimotlagh and Papalexakis (2018)
Word length, the count of words in a tweeted statement Twitter dataset, Chile earthquake 2010 datasets Abdullah-All-Tanvir et al. (2019)
The number of words that express negative emotions Twitter dataset Abdullah-All-Tanvir et al. (2020)
Labeled data BuzzFeedg, PolitiFacth Mahabub (2020)
The relationship between the news article headline and article body. The biases of a written news article Kaggle: real_or_fakei, Fake news detectionj Bahad et al. (2019)
Historical data. The topic and sentiment associated with content textual. The subject and context of the text, semantic knowledge of the content Facebook dataset Del Vicario et al. (2019)
The veracity of image text. The credibility of the top 15 Google search results related to the image text Google images, the Onion, Kaggle Vishwakarma et al. (2019)
Topic modeling of text and the associated image of the online news Twitter datasetk, Weibol Amri et al. (2022)