skip to main content
10.1145/3484824.3484909acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdsmlaiConference Proceedingsconference-collections
research-article

Deep Learning Driven System to Analyze Reliability of COVID Infodemic and News Articles

Published: 13 January 2022 Publication History

Abstract

In the modern era, social media and electronic news play an important role in the dissemination of information across the world. The ease of sharing news articles and information over the electronic media has made it challenging to prevent the outspread of fake news articles. Thus, it has been observed that in the present situation of a pandemic the misleading news specifically about COVID-19 are increasing day by day. Multiple research groups identified this challenge and proposed machine learning-based binary classifiers for categorizing news articles into fake and true classes. But, none of them addressed the challenge of identifying the misleading news. Also, the existing research works do not focus on examining the reliability of the news about COVID-19. Moreover, there is a lack of complete system that provide an integration of front-end and back-end where the users can check the reliability of news article and get recommendation of the sources for validation of news articles. The authors in this manuscript propose the Deep Learning-based tool with multiclass classifier for classifying the news articles into fake, true and misleading. This tool is an integration of front-end and back-end that is equally effective in assessing the reliability of news about COVID-19 and other events. The classifier of the tool has been trained on the integrated dataset comprising general new articles from the globe as well as news articles about COVID-19. The trained classifier reported an accuracy of 88% in classifying the news into Fake, Misleading, and True classes. The higher accuracy than state-of-the-art models validate the acceptance of the tool for real-life applications.

References

[1]
WHO. 2020. Managing the COVID-19 infodemic. Retrieved November 3, 2021 from https://www.who.int/news/item/23-09-2020-managing-the-covid-19-infodemic-promoting-healthy-behaviours-and-mitigating-the-harm-from-misinformation-and-disinformation
[2]
IndiaToday. 2020. Fake news on lockdown duration triggered large scale migration (September 2020). Retrieved November 3, 2021 from https://www.indiatoday.in/india/story/fake-news-triggered-large-scale-migration-during-lockdown-mha-1722102-2020-09-15
[3]
PolitiFact. 2020. COVID-19 is not a simulation. Retrieved November 3, 2021 from https://www.politifact.com/factchecks/2020/dec/17/facebook-posts/no-covid-19-not-simulation/
[4]
Geeta Rani and Monika Agarwal. 2020. Contrast Enhancement Using Optimum Threshold Selection. International Journal of Software Innovation 8, 3 (2020), 96--118. DOI10.4018/IJSI.2020070107
[5]
Monika Agarwal, Geeta Rani, and Vijaypal Singh Dhaka. 2020. Optimized contrast enhancement for tumor detection. International Journal of Imaging Systems and Technology 30, 3 (2020), 687--703.
[6]
Geeta Rani and Anita Jindal. 2019. Real-Time Object Detection and Tracking Using Velocity Control. Smart Systems and IoT: Innovations in Computing Smart Innovation, Systems and Technologies (2019), 767--778.
[7]
Vijaypal Singh Dhaka, Geeta Rani, Meet Ganpatlal Oza, Tarushi Sharma, and Ankit Misra. 2021. A deep learning model for mass screening of COVID-19. International Journal of Imaging Systems and Technology 31, 2 (2021), 483--498.
[8]
Savita, Ganga Sharma, Geeta Rani, and Vijaypal Singh Dhaka. 2020. A Review on Machine Learning Techniques for Prediction of Cardiovascular Diseases. 2020 Sixth International Conference on Parallel, Distributed and Grid Computing (PDGC) (2020).
[9]
Nitesh Pradhan, Geeta Rani, Vijaypal Singh Dhaka, and Ramesh Chandra Poonia. 2020. Diabetes prediction using artificial neural network. Deep Learning Techniques for Biomedical and Health Informatics (2020), 327--339.
[10]
Savita, Ganga Sharma, Geeta Rani, and Vijaypal Singh Dhaka. 2021. Efficient Predictive Modelling for Classification of Coronary Artery Diseases Using Machine Learning Approach. IOP Conference Series: Materials Science and Engineering 1099, 1 (2021), 012068.
[11]
Meet Ganpatlal Oza, Geeta Rani, and Vijaypal Singh Dhaka. 2021. Glaucoma Detection Using Convolutional Neural Networks. Handbook of Research on Disease Prediction Through Data Analytics and Machine Learning (2021), 1--7.
[12]
Nidhi Kundu, Geeta Rani, and Vijaypal Singh Dhaka. 2020. Machine Learning and IoT based Disease Predictor and Alert Generator System. 2020 Fourth International Conference on Computing Methodologies and Communication (ICCMC) (2020).
[13]
Tariq Alhindi, Savvas Petridis, and Smaranda Muresan. 2018. Where is Your Evidence: Improving Fact-checking by Justification Modeling. In Proceedings of the First Workshop on Fact Extraction and VERification (FEVER) (2018).
[14]
Gongde Guo, Hui Wang, David Bell, Yaxin Bi, and Kieran Greer. 2003. KNN Model-Based Approach in Classification. On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE Lecture Notes in Computer Science (2003), 986--996.
[15]
Parth Patwa, Shivam Sharma, Srinivas Pykl, Vineeth Guptha, Gitanjali Kumari, Md Shad Akhtar, Asif Ekbal, Amitava Das, and Tanmoy Chakraborty. 2021. Fighting an Infodemic: COVID-19 Fake News Dataset. Combating Online Hostile Posts in Regional Languages during Emergency Situation Communications in Computer and Information Science (2021), 21--29.
[16]
Pranav Bharadwaj and Zongru Shao. 2019. Fake news detection with semantic features and text mining. International Journal on Natural Language Computing 8, 3 (2019), 17--22.
[17]
Theodoros Evgeniou and Massimiliano Pontil. 2001. Support Vector Machines: Theory and Applications. Machine Learning and Its Applications Lecture Notes in Computer Science (2001), 249--257.
[18]
Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016), 785--794.
[19]
Adnan Hussein, Farzana Ahmad, and Siti Kamaruddin. 2019. Content-Social Based Features for Fake News Detection Model from Twitter. International Journal of Advanced Trends in Computer Science and Engineering 8, 6 (2019), 2806--2810.
[20]
Somya Ranjan Sahoo and B.b. Gupta. 2021. Multiple features based approach for automatic fake news detection on social networks using deep learning. Applied Soft Computing 100 (2021), 106983.
[21]
Samir Bajaj. 2017. "The Pope Has a New Baby!" Fake News Detection Using Deep Learning. (Winter 2017). Retrieved November 3, 2021 from https://web.stanford.edu/class/archive/cs/cs224n/cs224n.1174/reports/2710385.pdf
[22]
Fariba Sadeghi, Amir Bidgoly, and Hossein Amirkhani. Fake News Detection on Social Media Using A Natural Language Inference Approach. (2020)
[23]
Julio C.S. Reis, Andre Correia, Fabricio Murai, Adriano Veloso, Fabricio Benevenuto, and Erik Cambria. 2019. Supervised Learning for Fake News Detection. IEEE Intelligent Systems 34, 2 (2019), 76--81.
[24]
Giovanni Santia and Jake Williams. 2018. BuzzFace: A News Veracity Dataset with Facebook User Commentary and Egos. In Proceedings of the International AAAI Conference on Web and Social Media 12, 1 (2018), 531--540.
[25]
Shlok Gilda. 2017. Evaluating machine learning algorithms for fake news detection. IEEE 15th Student Conference on Research and Development (SCOReD) (2017), 110--115.
[26]
David Corney, Dyaa Albakour, Miguel Martinez and Samir Moussa. 2016. "What do a million news articles look like?". In Proceedings of the First International Workshop on Recent Trends in News Information Retrieval co-located with 38th European Conference on Information Retrieval (2016), Padua, Italy, 42--47.
[27]
Vishal Soni. 2018. Prediction of Geniunity of News using advanced Machine Learning and Natural Language processing Algorithms. International Journal of Innovative Research in Science Engineering and Technology 7, 5 (2018), 6349--6354.
[28]
Kaggle. 2017. Fake News Detection Dataset. Retrieved November 3, 2021 from https://www.kaggle.com/jruvika/fake-news-detection
[29]
Arvinder Pal Singh Bali, Mexson Fernandes, Sourabh Choubey, and Mahima Goel. 2019. Comparative Performance of Machine Learning Algorithms for Fake News Detection. Communications in Computer and Information Science Advances in Computing and Data Sciences (2019), 420--430.
[30]
Sam Hagen. 2017. OpenSource Dataset. GitHub. Retrieved November 3, 2021 from https://github.com/BigMcLargeHuge/opensources
[31]
George McIntire. 2017. Fake Real News Dataset. Retrieved May 29, 2021 from https://www.kdnuggets.com/2017/04/machine-learning-fake-news-accuracy.html
[32]
Vikas Ukani. 2020. News Data Set - Fake OR Real. Kaggle. Retrieved November 3, 2021 from https://www.kaggle.com/vikasukani/news-data-set-fake-news-with-python
[33]
Meg Risdal. 2016. Getting Real About Fake News. Kaggle. Retrieved November 3, 2021 from https://www.kaggle.com/mrisdal/fake-news
[34]
Signal Media. 2016. The Signal Media One-Million News Articles Dataset. Retrieved November 3, 2021 from http://research.signalmedia.co/newsir16/signal-dataset.html
[35]
William Yang Wang. 2017. "Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (2017), 422--426.
[36]
Shivangi Singhal, Rajiv Ratn Shah, Tanmoy Chakraborty, Ponnurangam Kumaraguru, and Shinichi Satoh. 2019. SpotFake: A Multi-modal Framework for Fake News Detection. 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM) (2019), 39--47.
[37]
Multimedia Knowledge and Social Analytics Lab. 2017. Image Verification Corpus. GitHub. Retrieved November 3, 2021 from https://github.com/MKLab-ITI/image-verification-corpus/tree/master/mediaeval2015
[38]
Fariba Sadeghi, Amir Bidgoly and Hossein Amirkhani. 2020. FNID: Fake News Inference Dataset. IEEE Dataport.
[39]
Parth Patwa. 2021. COVID-19 Fake News Detection Dataset. GitHub. Retrieved November 3, 2021 from https://github.com/parthpatwa/covid19-fake-news-detection
[40]
Apurva Wani, Isha Joshi, Snehal Khandve, Vedangi Wagh, and Raviraj Joshi. 2021. Evaluating Deep Learning Approaches for Covid19 Fake News Detection. Combating Online Hostile Posts in Regional Languages during Emergency Situation Communications in Computer and Information Science (2021), 153--163.
[41]
Anmol Kumar. 2020. Fake News Content Detection. Kaggle. Retrieved November 3, 2021 from https://www.kaggle.com/anmolkumar/fake-news-content-detection
[42]
Shahzad Qaiser and Ramsha Ali. 2018. Text Mining: Use of TF-IDF to Examine the Relevance of Words to Documents. International Journal of Computer Applications 181, 1 (2018), 25--29.
[43]
Nitesh Pradhan, Vijaypal Singh Dhaka, Geeta Rani, and Himanshu Chaudhary. 2020. Machine Learning Model for Multi-View Visualization of Medical Images. The Computer Journal (2020).
[44]
Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: Global Vectors for Word Representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014), 1532--1543.
[45]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Bert: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North (2019), 4171--4186.
[46]
Sklearn.metrics. 2021. Retrieved November 3, 2021 from https://scikit-learn.org/stable/modules/generated/sklearn.metrics.precision_recall_fscore_support.html

Index Terms

  1. Deep Learning Driven System to Analyze Reliability of COVID Infodemic and News Articles
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Information & Contributors

            Information

            Published In

            cover image ACM Other conferences
            DSMLAI '21': Proceedings of the International Conference on Data Science, Machine Learning and Artificial Intelligence
            August 2021
            415 pages
            ISBN:9781450387637
            DOI:10.1145/3484824
            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            Published: 13 January 2022

            Permissions

            Request permissions for this article.

            Check for updates

            Author Tags

            1. BERT
            2. COVID-19
            3. Deep Learning
            4. Fake News
            5. SVM

            Qualifiers

            • Research-article
            • Research
            • Refereed limited

            Conference

            DSMLAI '21'

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • 0
              Total Citations
            • 64
              Total Downloads
            • Downloads (Last 12 months)4
            • Downloads (Last 6 weeks)0
            Reflects downloads up to 02 Mar 2025

            Other Metrics

            Citations

            View Options

            Login options

            View options

            PDF

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader

            Figures

            Tables

            Media

            Share

            Share

            Share this Publication link

            Share on social media