Machine Learning Frameworks for Fake News Detection and Datasets

DOI

A web framework designed for researchers to perform comparative analysis of various machine learning algorithms in the context of fake news detection. The folder also includes several datasets for experimentation, alongside the source code.

The rise of social media has transformed the landscape of news dissemination, presenting new challenges in combating the spread of fake news. This study addresses the automated detection of misinformation within written content, a task that has prompted extensive research efforts across various methodologies. We evaluate existing benchmarks, introduce a novel hybrid word embedding model, and implement a web framework for text classification. Our approach integrates traditional frequency–inverse document frequency (TF–IDF) methods with sophisticated feature extraction techniques, considering linguistic, psychological, morphological, and grammatical aspects of the text. Through a series of experiments on diverse datasets, applying transfer and incremental learning techniques, we demonstrate the effectiveness of our hybrid model in surpassing benchmarks and outperforming alternative experimental setups. Furthermore, our findings emphasize the importance of dataset alignment and balance in transfer learning, as well as the utility of incremental learning in maintaining high detection performance while reducing runtime. This research offers promising avenues for further advancements in fake news detection methodologies, with implications for future research and development in this critical domain.

Identifier
DOI https://doi.org/10.34894/CUCITF
Related Identifier IsCitedBy https://doi.org/10.3390/fi16100352
Metadata Access https://dataverse.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34894/CUCITF
Provenance
Creator Mohsen, Fadi ORCID logo; Chaushi, Bedir; Abdelhaq, Hamed; Wang, Kevin
Publisher DataverseNL
Contributor Groningen Digital Competence Centre; Mohsen, Fadi
Publication Year 2024
Rights CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Groningen Digital Competence Centre (Groningen University); Mohsen, Fadi (Groningen University)
Representation
Resource Type Dataset
Format application/x-rar-compressed; text/markdown
Size 133821784; 6091
Version 1.0
Discipline Other