Social Media Bangla Fake News Detection Using Deep and Machine Learning Algorithms

Social Media Bangla Fake News Detection Using Deep and Machine Learning Algorithms

  IJETT-book-cover           
  
© 2024 by IJETT Journal
Volume-72 Issue-5
Year of Publication : 2024
Author : Nipa Rani Das, Md. Mahbubul Alam, Asif Pervez Polok, Mirza Raquib, Abdullah Al Mamun, Md Jakir Hossen
DOI : 10.14445/22315381/IJETT-V72I5P135

How to Cite?

Nipa Rani Das, Md. Mahbubul Alam, Asif Pervez Polok, Mirza Raquib, Abdullah Al Mamun, Md Jakir Hossen, "Social Media Bangla Fake News Detection Using Deep and Machine Learning Algorithms," International Journal of Engineering Trends and Technology, vol. 72, no. 5, pp. 346-354, 2024. Crossref, https://doi.org/10.14445/22315381/IJETT-V72I5P135

Abstract
In this current world, media and online news publications are spreading rapidly. The dissemination of inaccurate information on social media platforms is increasingly becoming a concern. The proliferation of fake news is due to the ease with which data can be accessed and shared as a result of the mobile technology revolution. Like many countries, fake news is spreading very fast in Bangladesh. The situation gets worse with the spreading of misinformation about epidemics like COVID-19. Created a novel dataset of the Bengali language and achieved to goal by using LSTM and machine learning models. Now, other algorithms are used, but the LSTM and machine learning models have good performance. This program's algorithm to select the attribute, a text feature based on TF-IDF and Word Embedding was used. Focused LSTM-base model and machine learning models, especially the Bangla-LSTM-base model and machine learning models. Finally, add a dense layer as a summary layer responsible for generating summary sentences to text. According to all of the evaluations performed above, the additional Trees Classifier outperformed the other six Machine Learning methods. The accuracy rate for identifying false news in news headline data is roughly 86.14%. The second-best accuracy provided by the Random Forest Classifier algorithm is close to 85%. The third-best accuracy provided by the Decision Tree Classifier method is approximately 84%. Moreover, seeing that deep learning algorithms outperformed machine learning ones. Furthermore, LSTM has a 96.14% training accuracy rate and an 86% testing accuracy rate for identifying false news in news headline data.

Keywords
LSTM, Word Embedding, Dataset, Headline, TF-IDF, Accuracy.

References
[1] David M.J. Lazer et al., “The Science of Fake News,” Science, vol. 359, no. 6380, pp. 1094-1096, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Tasnuba Sraboni et al., “FakeDetect: Bangla Fake News Detection Model Based On Different Machine Learning Classifiers,” Brac University, pp. 1-45, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Shafayat Bin Shabbir Mugdha, Sayeda Muntaha Ferdous, and Ahmed Fahmin, “Evaluating Machine Learning Algorithms For Bengali Fake News Detection,” 2020 23rd International Conference on Computer and Information Technology (ICCIT), Dhaka, Bangladesh, pp. 1-6, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Soroush Vosoughi, Deb Roy, and Sinan Aral, “The Spread of True and False News Online,” Science, vol. 359, no. 6380, pp. 1146-1151, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Benjamin Riedel et al., “A Simple but Tough-to-Beat Baseline for the Fake News Challenge Stance Detection Task,” arXiv, pp. 1-6, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Hadeer Ahmed, Issa Traore, and Sherif Saad, “Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques,” Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments, Vancouver, BC, Canada, pp. 127-138, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Mykhailo Granik, and Volodymyr Mesyura, “Fake News Detection Using Naive Bayes Classifier,” 2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON), Kyiv, Ukraine, pp. 900-903, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Anjali Jain et al., “A Smart System for Fake News Detection Using Machine Learning,” 2019 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT), Ghaziabad, India, pp. 1-4, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Marco L. Della Vedova et al., “Automatic Online Fake News Detection Combining Content and Social Signals,” 2018 22nd Conference of Open Innovations Association (FRUCT), Jyvaskyla, Finland, pp. 272-279, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Rakib ul Haque et al., “A Complete Bengali Stop Word Detection Mechanism,” 2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV), Spokane, WA, USA, pp. 103-107, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Aswini Thota et al., “Fake News Detection: A Deep Learning Approach,” SMU Data Science Review, vol. 1, no. 3, pp. 1-20, 2018.
[Google Scholar] [Publisher Link]
[12] Verónica Pérez-Rosas et al., “Automatic Detection of Fake News,” arXiv, pp. 1-10, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Md Zobaer Hossain et al., “Banfakenews: A Dataset for Detecting Fake News in Bangla,” arXiv, pp. 1-10, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Tasnuba Sraboni et al., “FakeDetect: Bangla Fake News Detection Model Based on Different Machine Learning Classifiers,” Thesis, Brac University, pp. 1-45, 2021.
[Google Scholar] [Publisher Link]
[15] David H. Wolpert, “Stacked Generalization,” Neural Networks, vol. 5, no. 2, pp. 241-259, 1992.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Arnab Sen Sharma, Maruf Ahmed Mridul, and Md Saiful Islam, “Automatic Detection of Satire in Bangla Documents: A CNN Approach Based on Hybrid Feature Extraction Model,” arXiv, pp. 1-5, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Kai Shu, Deepak Mahudeswaran, and Huan Liu, “Fake News Tracker: A Tool for Fake News Collection, Detection, and Visualization,” Computational and Mathematical Organization Theory, vol. 25, pp. 60-71, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Mohamed Torky, Emad Nabil, and Wael Said, “Proof of Credibility: A Blockchain Approach for Detecting and Blocking Fake News in Social Networks,” International Journal of Advanced Computer Science and Applications, vol. 10, no. 12, pp. 321-327, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Kai Shu, Suhang Wang, and Huan Liu, “Exploiting Tri-Relationship for Fake News Detection,” arXiv, pp. 1-10, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Marco L. Della Vedova et al., “Automatic Online Fake News Detection Combining Content and Social Signals,” 2018 22nd Conference of Open Innovations Association (FRUCT), Jyvaskyla, Finland, pp. 272-279, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Arjun Roy et al., “A Deep Ensemble Framework for Fake News Detection and Multi-Class Classification of Short Political Statements,” Proceedings of the 16th International Conference on Natural Language Processing, International Institute of Information Technology, Hyderabad, India, pp. 9-17, 2019.
[Google Scholar] [Publisher Link]