Abstract
Social media can render content circulating to reach millions with a knack to influence people, despite the questionable authencity of the facts. Internet sources are the most convenient and easy approach to obtain any information these days. Fake news has become the topic of interest for academicians and the rest of society. This kind of propaganda has the power to influence the general perception, offering political groups the ability to control the results of democratic affairs such as elections. Automatic identification of fake news has emerged as one of the significant problems due to the high risks involved. It is challenging in a way because of the complexity levels of accurately interpreting the data. An extensive search has already been performed on English language news data. Our work presents a comparative analysis of fake news classifiers on the low resource Bengali language ‘ban fake news’ dataset from Kaggle. The analysis presented compares deep learning techniques such as LSTM (Long short-term Memory) and BiLSTM (Bi-directional Long short-term Memory) and machine learning methods like Naive Bayes, Passive Aggressive Classifier (PAC), and Random Forest. The comparison has been drawn based on classification metrics such as accuracy, precision, recall, and F1 score. The deep learning method BiLSTM shows 55.92% accuracy while Random Forest, in contrast, has outperformed all the other methods with an accuracy of 62.37%. The work presented in this paper sets a basis for researchers to select the optimum classifiers for their approach towards fake news detection.
- [1] . 2016. The spreading of misinformation online. Proceedings of the National Academy of Sciences 113, 3 (2016), 554–559.Google ScholarCross Ref
- [2] . 2016. Echo chambers: Emotional contagion and group polarization on Facebook. Scientific Reports 6.Google Scholar
- [3] . 2020. A survey of fake news: Fundamental theories, detection methods, and opportunities. ACMComput. Surv 1, 1,
Article 1 .Google Scholar - [4] . 2017. Fake news mitigation via point process-based intervention. arXiv preprint arXiv:1703.07823.Google Scholar
- [5] . 2017. Social media and fake news in the 2016 election. Journal of Economic Perspectives 31, 2 (2017), 211–236.Google ScholarCross Ref
- [6] . Characterizing the life cycle of online news stories using social media reactions. In CSCW'14.Google Scholar
- [7] . 2016. Mining misinformation in social media. Big Data in Complex and Social Networks 123–152.Google Scholar
- [8] . 2017. Social media and fake news in the 2016 election. Technical Report, National Bureau of Economic Research.Google Scholar
- [9] . 2014. When fake news becomes real: Combined exposure to multiple news sources and political attitudes of inefficacy, alienation, and cynicism. Communication Research 41, 3 (2014), 430–454.Google ScholarCross Ref
- [10] . 2016. Social bots distort the 2016 US presidential election online discussion. First Monday 21, 11.Google Scholar
- [11] . Anyone can become a troll: Causes of trolling behavior in online discussions. In CSCW’17.Google Scholar
- [12] . Information credibility on Twitter. In WWW'11.Google Scholar
- [13] . 2015. Misleading online content: Recognizing clickbait as false news. In Proceedings of the 2015 ACM on Workshop on Multimodal Deception Detection 15–19. ACM.Google ScholarDigital Library
- [14] . 2012. Detecting automation of Twitter accounts: Are you a human, bot, or cyborg? IEEE Transactions on Dependable and Secure Computing 9, 6 (2012), 811–824.Google ScholarDigital Library
- [15] . 2015. Computational fact checking from knowledge networks. PloS One 10, 6 (2015), e0128193.Google Scholar
- [16] . 2017. News feature: The genuine problem of fake news. Proceedings of the National Academy of Sciences 114, 48 (2017), 12631–12634.Google Scholar
- [17] . 2018. Fake news: A definition. Informal Logic 38, 1 (2018), 84–117.Google ScholarCross Ref
- [18] . 2015. Unsupervised sentiment analysis for social media images. In IJCAI 2378–2379.Google ScholarDigital Library
- [19] . Gleaning wisdom from the past: Early detection of emerging rumors in social media. In SDM'17.Google Scholar
- [20] . 2017. Detection and resolution of rumours in social media: A survey. arXiv preprint arXiv:1704.00656.Google Scholar
- [21] . 2013. The impact of real news about fake news: Intertextual processes and political satire. International Journal of Public Opinion Research 25, 3 (2013), 323–343.Google ScholarCross Ref
- [22] . 1997. Naive realism in everyday life: Implications for social conflict and misunderstanding. Values and Knowledge 103–135.Google Scholar
- [23] . 2015. Automatic deception detection: Methods for finding fake news. Proceedings of the Association for Information Science and Technology 52, 1 (2015), 1–4.Google ScholarCross Ref
- [24] . Improving generalizability of fake news detection methods using propensity score matching. arXiv – CS – Social and Information Networks.Google Scholar
- [25] . 2012. Automatic detection of rumor on Sina Weibo. In Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics 13. ACM.Google ScholarDigital Library
- [26] . Densely connected CNN with multi-scale feature attention for text classification. Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI-18).Google Scholar
- [27] . Machine learning from theory to algorithms: An overview. Journal of Physics: Conference Series 1142, Second National Conference on Computational Intelligence.Google Scholar
- [28] . 2019. Automatic detection of satire in Bangla documents: A CNN approach based on hybrid feature extraction model. 2019 International Conference on Bangla Speech and Language Processing (ICBSLP), Sylhet, Bangladesh.Google ScholarCross Ref
- [29] . 2019. A benchmark study on machine learning methods for fake news detection. arXiv:1905.04749v1 [cs.CL] 12 May.Google Scholar
- [30] . 2015. A C-LSTM neural network for text classification. arXiv 2015.Google Scholar
- [31] . 2017. Yan Liu CSI: A hybrid deep model for fake news detection. CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management 2017, 797–806.Google ScholarDigital Library
- [32] . 8 amazing secrets for getting more clicks: Detecting clickbaits in news streams using article informality. In AAAI'16.Google Scholar
- [33] . 2015. Click bait: Forward-reference as lure in online news headlines. Journal of Pragmatics 76, (2015), 87–100.Google ScholarCross Ref
- [34] . Stop clickbait: Detecting and preventing clickbaits in online news media. In ASONAM'16.Google Scholar
- [35] . Adaptive spammer detection with sparse group modeling. In ICWSM'17.Google Scholar
- [36] . Weak supervision for fake news detection via reinforcement learning. arXiv – CS – Machine Learning.Google Scholar
- [37] . 2016. Diversity-based boosting algorithm. International Journal of Advanced Computer Science and Applications 7, 5 (2016).Google Scholar
- [38] . 2019. Deep refinement: Capsule network with attention mechanism-based system for text classification. Springer-Verlag London Ltd., part of Springer Nature.Google Scholar
- [39] . 2020. Paraphrase identification using collaborative adversarial networks. Journal of Intelligent and Fuzzy systems 1–12, September 2020.
DOI: DOI: 10.3233/JIFS-191933Google ScholarDigital Library - [40] . 2020. BanFakeNews: A dataset for detecting fake news in Bangla. Proceedings of the 12th Language Resources and Evaluation Conference.Google Scholar
- [41] . 2005. Detection of Bangla fake news using MNB and SVM CLASSIFIER. arXiv:2005.14627v1.Google Scholar
- [42] . 2009. Feature selection for text classification with Naïve Bayes. Expert Systems with Applications.Google Scholar
- [43] . 2018. Genre classification using word embeddings and deep learning. 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Bangalore, 2018, 2142–2146,
DOI: DOI: 10.1109/ICACCI.2018.8554816Google ScholarCross Ref - [44] . 2019. Sarcasm detection using soft attention-based bidirectional long short-term memory model with convolution network. IEEE Access 7, 23319–23328, 2019,
DOI: DOI: 10.1109/ACCESS.2019.2899260Google ScholarCross Ref - [45] . 2016. Text classification improved by integrating bidirectional LSTM with two-dimensional max pooling. arXiv preprint arXiv:1611.06639 (2016).Google Scholar
- [46] . 1998. A study using n-gram features for text categorization. Austrian Research Institute for Artificial Intelligence 3(1998), 1–10.Google Scholar
- [47] . Understanding probabilistic classifiers. ECML'01.Google Scholar
- [48] 2000. Ensemble methods in machine learning. Multiple Classifier Systems 1857, (2000) 1–15.Google ScholarCross Ref
- [49] . 2014. Online passive aggressive active learning and its applications. JMLR: Workshop and Conference Proceedings 39, (2014) 266–282.Google Scholar
- [50] . Recurrent neural network for text classification with multi-task learning. Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16).Google Scholar
- [51] . 2019. Sentiment analysis of comment texts based on BiLSTM. In IEEE Access 7, 51522–51532, 2019,
DOI: DOI: 10.1109/ACCESS.2019.2909919Google ScholarCross Ref - [52] . 2018. Genre classification using feature extraction and deep learning techniques. 2018 10th International Conference on Knowledge and Systems Engineering (KSE), Ho Chi Minh City 2018, 175–180,
DOI: 10.1109/KSE.2018.8573325.Google ScholarCross Ref
Index Terms
- Fake News Classification: A Quantitative Research Description
Recommendations
Fake News Detection Using Hybrid Deep Learning Method
AbstractThe growth of online social networks platforms in recent years has resulted in the widespread dissemination of social news such as commercial adverts, political news, celebrity gossip, and more. This proliferation has resulted in distribution of ...
A Closer Look at Fake News Detection: A Deep Learning Perspective
ICAAI '19: Proceedings of the 3rd International Conference on Advances in Artificial IntelligenceThe increasingly rapid pace of spreading fake news is considered a problem in conjunction with the increasing number of people who are relying upon social media to get news. That earns widespread attention from research communities due to the negative ...
Deep learning for fake news detection on Twitter regarding the 2019 Hong Kong protests
AbstractThe dissemination of fake news on social media platforms is an issue of considerable interest, as it can be used to misinform people or lead them astray, which is particularly concerning when it comes to political events. The recent event of Hong ...
Comments