Abstract:
The rapid spread of misinformation on platforms like Twitter, and Facebook, and in news
headlines highlights the urgent need for effective ways to detect it. Currently, researchers
are increasingly using machine learning (ML) and deep learning (DL) techniques to tackle
misinformation detection (MID) because of their proven success. However, this task is
still challenging due to the complexity of deceptive language, digital editing tools, and the
lack of reliable linguistic resources for non-English languages. This paper provides a
comprehensive analysis of relevant research, providing insights into advanced techniques
for MID. It covers dataset assessments, the importance of using multiple forms of data
(multimodality), and different language representations. By applying the Preferred
Reporting Items for Systematic Review and Meta-Analysis (PRISMA) methodology, the
study identified and analyzed literature from 2019 to 2024 across five databases: Google
Scholar, Springer, Elsevier, ACM, and IEEE Xplore. The study selected thirty-one papers
and examined the effectiveness of various ML and DL approaches with a focal point on
performance metrics, datasets, and false or misleading information detection challenges.
The findings indicate that most current MID models are heavily dependent on DL
techniques, with approximately 81% of studies preferring these over traditional ML
methods. In addition, most studies are text-based, with much less attention given to audio,
speech, images, and videos. The most effective models are mainly designed for highresource
languages, with English datasets being the most used (67%), followed by Arabic
(14%), Chinese (11%), and others. Less than 10% of the studies focus on low-resource
languages (LRLs). Therefore, the study highlighted the need for robust datasets and interpretable, scalable MID models for LRLs. It emphasizes the critical need to prioritize
and advance MID research for LRLs across all data types, including text, audio, speech,
images, videos, and multimodal approaches. This study aims to support ongoing efforts
to combat misinformation and promote a more informed understanding of underresourced
African languages.