Abstract
We explore the use of machine learning techniques to classify a news source for generating unreliable news. Since the advent of the Internet, unreliable news and hoaxes have deceived users. Social media and news outlets are spreading false information to increase the number of viewers or as a part of the psychological competition. In this paper, we present an ensemble classifier using a set of marked true and bogus news articles. Here, the authors develop a classification approach based on text using SVM, Random-Forest, Naïve Bayes, Decision Tree as a base learner in Bagging and AdaBoost. The purpose behind the work is to think of an answer that enable the user to classify and filter some of the false material. Accordingly, we show that the best performing classifiers were AdaBoost-LinearSVM and AdaBoost-Random Forest with 90.70% and 80.17% accuracy, respectively.








Similar content being viewed by others
References
Aldwairi M, Alwahedi A (2018) Detecting fake news in social media networks. In: The 9th international conference on emerging ubiquitous systems and pervasive networks (EUSPN 2018), vol 141, pp 215–222
Conroy NJ, Rubin VL, Chen Y (2015) Automatic deception detection: methods for finding fake news. In: Proceedings of the 78th ASIS&T annual meeting: information science with impact: research in and for the community, American Society for Information Science, Silver Springs, MD, USA, pp 82:1–4. http://dl.acm.org/citation.cfm?id=2857070.2857152
Dungs S, Aker A, Fuhr N, Bontcheva K (2018) Can rumour stance alone predict veracity? In: Proceedings of COLING 2018, the 27th international conference on computational linguistics, pp 3360–3370
Enayet O, El-Beltagy SR (2017) Niletmrg at semeval-2017 task 8: determining rumor and veracity support for rumours on twitter. In: Proceedings of the 11th international workshop on semantic evaluation (SemEval-2017), pp 470–474
Ferreira W, Vlachos A (2016) Emergent: a novel data-set for stance classification. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 1163–1168
Gardiner B (2015) You’ll be outraged at how easy it was to get you to click on this headline. Wired. https://www.wired.com/2015/12/psychology-of-clickbait/. Accessed June 2020
Google Developers. https://developers.google.com/machine-learning/crash-course/embeddings/. Accessed Nov 2019
Han J, Kamber M, Pei J (2012) Data mining concepts and techniques, 3rd edn. Morgan Kaufmann, Burlington
Hardalov M, Koychev I, Nakov P (2016) In search of credible news. In: International conference on Artificial intelligence: methodology, systems, and applications, pp. 172–180. Springer
Hassid J (2011) Four models of the fourth estate: a typology of contemporary Chinese journalists. China Q 208:813–832. https://doi.org/10.1017/S0305741011001019
Jin Z, Cao J, Zhang Y, Luo J (2016) News verification by exploiting conflicting social viewpoints in microblogs. In: AAAI, pp. 2972–2978
Kaggle. https://www.kaggle.com/c/fake-news/data. Accessed 3 Nov 2019)
Marchi R (2012) With Facebook, blogs, and fake news, teens reject journalistic objectivity. J Commun Inq 36:246–262. https://doi.org/10.1177/0196859912458700
Markowitz DM, Hancock JT (2014) Linguistic traces of a scientific fraud: the case of Diederik Stapel. PLoS ONE 9(8):e105937
Ozbay F, Alatas B (2020) Fake news detection within online social media using supervised artificial intelligence algorithms. Phys A Stat Mech Appl 540:123174
Popat K, Mukherjee S, Str¨otgen J, Weikum G (2017) Where the truth lies: explaining the credibility of emerging claims on the web and social media. In: Proceedings of the 26th international conference on world wide web companion. International World Wide Web Conferences Steering Committee, pp 1003–1012
Quoc L, Mikolov T (2014) Distributed representations of sentences and documents. https://arxiv.org/abs/1405.4053
Reis JCS, Correia A, Murai F, Veloso A, Benevenuto F (2019) Explainable machine learning for fake news detection. In: Proceedings of the 10th ACM conference on web science, pp 17–26
Reis JCS, Correia A, Murai F, Veloso A, Benevenuto F (2019b) Supervised learning for fake news detection. IEEE Intell Syst 34(2):76–81
Rubin VL, Chen Y, Conroy NJ (2015) Deception detection for news: three types of fakes. In: Proceedings of the 78th ASIS&T annual meeting: information science with impact: research in and for the community, vol 83. American Society for Information Science, Silver Springs, MD, USA. pp 1–4. http://dl.acm.org/citation.cfm?id=2857070.2857153
Smith J, Leavitt A, Jackson G (2018) Designing new ways to give context to news stories. https://medium.com/facebook-design/designing-new-ways-to-give-context-to-news-stories-f6c13604f450
Spicer RN (2018) Lies, damn lies, alternative facts, fake news, propaganda, pinocchios, pants on fire, disinformation, misinformation, post-truth, data, and statistics. Springer, Cham, pp 1–31. https://doi.org/10.1007/978-3-319-69820-5_1
Zuckerberg M (2016) Facebook post. https://www.facebook.com/zuck/posts/10103253901916271
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Khan, M.Z., Alhazmi, O.H. Study and analysis of unreliable news based on content acquired using ensemble learning (prevalence of fake news on social media). Int J Syst Assur Eng Manag 11 (Suppl 2), 145–153 (2020). https://doi.org/10.1007/s13198-020-01016-4
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13198-020-01016-4