Skip to main content

Advertisement

Log in

Advancements in Fake News Detection: A Comprehensive Machine Learning Approach Across Varied Datasets

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Fake news has become a major social problem in the current period, controlled by modern technology and the unrestricted flow of information across digital platforms. The deliberate spread of inaccurate or misleading information jeopardizes the public's ability to make educated decisions and seriously threatens the credibility of news sources. This study thoroughly examines the intricate terrain of identifying false news, utilizing state-of-the-art tools and creative approaches to tackle this crucial problem at the nexus of information sharing and technology. The study uses advanced machine learning (ML) models comprising multinomial Naive Bayes (MNB), linear support vector classifiers (SVC), random forests (RF), logistic regression (LR), gradient boosting (GB), decision trees (DT), and to discern and identify instances of fake news. The research shows remarkable performance using publicly available datasets, achieving 94% accuracy on the first dataset and 84% on the second. These results underscore the model's efficacy in reliably detecting fake news, thereby contributing substantially to the ongoing discourse on countering misinformation in the digital age. The research not only delves into the technical intricacies of employing diverse ML models but also emphasizes the broader societal implications of mitigating the impact of fake news on public discourse. The findings highlight the pressing need for proactive measures in developing robust systems capable of effectively identifying and addressing the propagation of false information. As technology evolves, the insights derived from this research serve as a foundation for advancing strategies to uphold the integrity of information sources and safeguard the public's ability to make well-informed decisions in an increasingly digitalized world.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The dataset is publicly available (given in Ref [9]).

References

  1. Abdulrahman A, Baykara M. Fake news detection using machine learning and deep learning algorithms. In: 2020 international conference on advanced science and engineering (ICOASE). 2020. https://doi.org/10.1109/icoase51841.2020.9436605.

  2. Ahmed H, Traore I, Saad S. Detection of online fake news using N-gram analysis and machine learning techniques. 2017

  3. Ahmed H, Traore I, Saad S. Detecting opinion spams and fake news using text classification. J Secur Priv. 2018;1(1) (Wiley).

  4. Amazeen MA. Checking the fact-checkers in 2008: predicting political ad scrutiny and assessing consistency. J Political Mark. 2016;15(4):433–64.

    Article  Google Scholar 

  5. Arora P, Pandey M. Fake news detection using machine learning: a survey. 2020.

  6. Baptista JP, Gradim A. Understanding fake news consumption: a reviaew. MDPI. 2020. https://doi.org/10.3390/socsci9100185

  7. Beutel I, Kirschler O, Kokott S. How do fake news and hate speech affect political discussion and target persons and how can they be detected? Central and Eastern European eDem and eGov Days. 2022;342:37–81.

    Article  Google Scholar 

  8. Bharadwaj P, Shao Z. Detecting fake news: a deep learning approach. arXiv preprint arXiv:1903.05781. 2019

  9. Bozku E. Fake news detection datasets. Kaggle. 2022. https://www.kaggle.com/datasets/emineyetm/fake-news-detection-datasets

  10. Capuano N, Fenza G, Loia V, Nota FD. Content based fake news detection with machine and deep learning: a systematic review. Neurocomputing. 2023.

  11. Chelmis C, Cambria E, SI L. Fake news detection on social media: a data mining perspective. ACM Trans Data Sci. 2019.

  12. Chen L, Zhang T, Li T. Gradient boosting model for unbalanced quantitative mass spectra quality assessment. In: 2017 International conference on security, pattern analysis, and cybernetics (SPAC). 2017. https://doi.org/10.1109/spac.2017.8304311

  13. Chen Y, Zhang J, Wu L, Yin D, Zhou X. Fake news detection with human-in-the-loop. 2021.

  14. Wu C, Dongol B, Sifa R, Winslett M. Fake news detection on social media using a multimodal approach. In: Proceedings of the 2020 ACM multimedia conference. 2020.

  15. Da San Martino G, Strapparava C. Leveraging linguistic patterns for fake news detection and mitigation. In: Proceedings of the 27th International conference on computational linguistics. 2018.

  16. Ghenai A, Davey ARH. Real-time classification of news into fact-checked categories. In: IEEE/ACM International conference on advances in social networks analysis and mining. 2018.

  17. Granik M, Mesyura V. Fake news detection using naive Bayes classifier. In 2017 IEEE first Ukraine conference on electrical and computer engineering (UKRCON). IEEE. 2017. pp. 900–903.

  18. Granik M, Mayer KJ, Sukthankar G. Fake news detection on social media: a data mining perspective. In: ACM transactions on data science. 2018.

  19. Islam F, Barbier G, Rokne J. Fake news detection in social media using linguistic metaproperties. ACM Trans Inf Syst. 2018.

  20. Jiang B, Shi Y, Song Y, Li C, Zhang W. Fake news detection with attention mechanism. 2020.

  21. Kabalci Y, Koçabaş B. Fake news detection using machine learning: an ensemble learning approach. J Ambient Intell Humaniz Comput. 2020.

  22. Kaliyar RK. A comprehensive survey on fake news detection techniques. In Handbook of research on applications and implementations of machine learning and artificial intelligence. 2021.

  23. Kaliyar RK, Goswami A, Narang P. Multiclass fake news detection using ensemble machine learning. In: 2019 IEEE 9th international conference on advanced computing (IACC). 2019. https://doi.org/10.1109/iacc48062.2019.8971579

  24. Kaliyar RK, Goswami A, Narang P. Fakebert: fake news detection in social media with a Bert-based deep learning approach. Multimed Tools Appl. 2021;80(8):11765–88. https://doi.org/10.1007/s11042-020-10183-2.

    Article  Google Scholar 

  25. Karimi H, Tang J. Fake news detection: a hierarchical discourse-level structure (HDSF). 2021.

  26. Kaur S, Kaur P. A multilevel fake news detection system using ensemble learning. Int J Mach Learn Cybern. 2019.

  27. Kaur S, Kaur P. Fake news detection using hybrid technique of machine learning classifiers. In Soft Computing and Signal Processing. 2021.

  28. Kudarvalli H, Fiaidhi J. Detecting fake news using machine learning algorithms. 2020. https://doi.org/10.36227/techrxiv.12089133

  29. Li Z, Zhang Y, Wang S (2021) Fake news detection with transfer learning.

  30. Liu J, Cao Y, Lin C, Shu K, Wang S. Fake news detection with multimodal analysis. 2020.

  31. Liu Y, Wu YFB. Early detection of fake news on social media through propagation path classification with recurrent and Convolutional Networks. New Jersey Institute of Technology. https://researchwith.njit.edu/en/publications/early-detection-of-fake-news-on-social-media-through-propagation-. 1970

  32. Mandical RR, Thampi GT, George G, Mathew G (2019) Fake news classification using machine learning: a comprehensive evaluation. Fut Gen Comput Syst.

  33. Mishra R, Bhatia K, Singh A. Fake news detection with recurrent neural networks. 2019. arXiv preprint arXiv:1903.08989.

  34. Olan F, Jayawickrama U, Arakpogun EO, Suklan J, Liu S. Fake news on social media: the Impact on Society. Inf Syst Front. 2022; 1–16.

  35. Potthast, M., Kiesel, J., Reinartz, K., Bevendorff, J., Stein, B., & Holzinger, A. (2017). A Stylometric Inquiry into Hyperpartisan and Fake News.

  36. Preotiuc-Pietro D, Gaman M, Aletras N. Automatically identifying complaints in social media. 2019. arXiv.org. https://arxiv.org/abs/1906.03890

  37. Puri H, Apte V. Detecting fake news on social media using geolocation information. 2020.

  38. Qian F, Gong C, Sharma K, Liu Y. Neural user response generator: Fake news detection with collective user intelligence. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence. 2018. https://doi.org/10.24963/ijcai.2018/533

  39. Rasheed J. Analyzing the effect of filtering and feature-extraction techniques in a machine learning model for identification of infectious disease using radiography imaging. Symmetry. 2022;14(7):1398. https://doi.org/10.3390/sym14071398.

    Article  Google Scholar 

  40. Shu K, Mahudeswaran D, Wang S, Lee D, Liu H. Fake news detection on social media: a data mining perspective. ACM Comput. 2019.

  41. Shu K, Sliva A, Wang S, Tang J, Liu H. Fake news detection on social media. ACM SIGKDD Explor Newsl. 2017;19(1):22–36. https://doi.org/10.1145/3137597.3137600.

    Article  Google Scholar 

  42. Shu K, Sliva A, Wang S, Tang J, Liu H. Fake news detection on social media: a data mining perspective. 2017. arXiv.org. https://arxiv.org/abs/1708.01967

  43. Singh V, Khamparia A, Gudnavar S. Fake news detection using support vector machine and linguistic inquiry word count. In: Advanced data analytics for improved business outcomes. 2020.

  44. Tahir T, Gence S, Rasool G, Umer T, Rasheed J, Yeo SF, Cevik T. Early software defects density prediction: training the international software benchmarking cross projects data using supervised learning. IEEE Access. 2023. https://doi.org/10.1109/ACCESS.2023.3339994.

    Article  Google Scholar 

  45. Verma A, Agrawal P, Hussain M, Varma V. Fake news detection with adversarial learning. 2020

  46. Wang X, Yu F, Jiang L, Meng D. Fake news detection with meta-learning. 2021.

  47. Waziry S, Wardak AB, Rasheed J, Shubair RM, Rajab K, Shaikh A. Performance comparison of machine learning driven approaches for classification of complex noises in quick response code images. Heliyon. 2023. https://doi.org/10.1016/j.heliyon.2023.e15108.

    Article  Google Scholar 

  48. Wu H, Yang P, Zhang C, Wu W, Yu Y (2020) Fake news detection with fact-checking

  49. Zhang S, Dou D, Liu N, Tang J, Lin CY, Cheng X. Combating fake news: a survey on identification and mitigation techniques. ACM Trans Intell Syst Technol (TIST). 2019.

  50. Zhang Y, Huang W, Liu Y (2021) Fake news detection with crowdsourcing.

  51. Zhao H, Chen X, Nguyen T, Huang JZ, Williams G, Chen H. Stratified over-sampling bagging method for random forests on imbalanced data. Intell Secur Inform. 2016. https://doi.org/10.1007/978-3-319-31863-9_5.

    Article  Google Scholar 

Download references

Acknowledgements

All the authors appreciate the valuable contributions of Dr. Fahad Mahmoud Ghabban to the study.

Funding

This research received no external funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jawad Rasheed.

Ethics declarations

Conflict of Interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Aslam, A., Abid, F., Rasheed, J. et al. Advancements in Fake News Detection: A Comprehensive Machine Learning Approach Across Varied Datasets. SN COMPUT. SCI. 5, 583 (2024). https://doi.org/10.1007/s42979-024-02943-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-024-02943-w

Keywords

Navigation