Skip to main content

Enhancing Low-Resource Bangla Fake News Detection through Deep Convolutional Neural Networks

  • Conference paper
  • First Online:
Intelligent Computing and Optimization (ICO 2023)

Abstract

In today’s digital landscape, the detection of false information is of utmost importance, especially in languages like Bangla, which lack abundant natural language processing (NLP) resources. The rapid spread of misinformation through online platforms, particularly within Bangla-speaking communities, has become a pressing concern. However, the limited availability of NLP tools for Bangla has posed significant challenges in developing reliable models for identifying deceptive content. In response to these challenges, researchers have made notable progress in classifying Bangla news using deep learning techniques and language models like BERT. This study presents a detailed exploration of a deep convolutional neural network (CNN) model tailored for categorizing Bangla news articles as authentic or counterfeit. By integrating BERT (Bangla Electra) into the model’s architecture, an impressive accuracy rate of 94.33% was achieved, with our proprietary model surpassing this at 94.5%.To ensure the reliability of results, a range of NLP techniques were applied during data preprocessing, including data cleansing, tokenization, stop word and punctuation removal, and stemming. Feature extraction involved the combined use of TF-IDF and Bag of Words techniques. The dataset, obtained from Kaggle, comprised 7,000 genuine news texts and 1,000 counterfeit news texts. In summary, this research significantly contributes to Bangla news classification by showcasing the effectiveness of deep CNN models in accurately discerning between legitimate and fabricated news articles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://en.wikipedia.org/wiki/Bengali_language.

  2. 2.

    https://huggingface.co/monsoon-nlp/bangla-electra.

References

  1. Adib QAR, Mehedi MHK, Sakib MS, Patwary KK, Hossain MS, Rasel AA (2021) A deep hybrid learning approach to detect Bangla fake news. In: 2021 5th international symposium on multidisciplinary studies and innovative technologies (ISMSIT). IEEE, pp 442–447

    Google Scholar 

  2. Ahmed M, Hossain MS, Islam RU, Andersson K (2022) Explainable text classification model for covid-19 fake news detection. J Internet Services Inf Secur (JISIS) 12(2):51–69

    Google Scholar 

  3. Bali APS, Fernandes M, Choubey S, Goel M (2019) Comparative performance of machine learning algorithms for fake news detection. In: Advances in computing and data sciences: third international conference, ICACDS 2019, Ghaziabad, India, 12–13 Apr 2019, Revised Selected Papers, Part II 3. Springer, pp 420–430

    Google Scholar 

  4. Clark K, Luong MT, Le QV, Manning CD (2020) Electra: pre-training text encoders as discriminators rather than generators. arXiv:2003.10555

  5. Das S, et al (2023) Deep transfer learning-based foot no-ball detection in live cricket match. Comput Intell Neurosci

    Google Scholar 

  6. Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805

  7. Do CB, Ng AY (2005) Transfer learning for text classification. Adv Neural Inf Process Syst 18

    Google Scholar 

  8. Hossain MZ, Rahman MA, Islam MS, Kar S (2020) Banfakenews: a dataset for detecting fake news in Bangla (2020). arXiv:2004.08789

  9. Hossain MS, Ahmed F, Andersson K (2017) A belief rule based expert system to assess tuberculosis under uncertainty. J Med Syst 41(3):43

    Google Scholar 

  10. Hossain MS, Rahaman S, Mustafa R, Andersson K (2018) A belief rule-based expert system to assess suspicion of acute coronary syndrome (acs) under uncertainty. Soft Comput 22:7571–7586

    Google Scholar 

  11. Hussain MG, Rashidul Hasan M, Rahman M, Protim J, Al Hasan S (2020) Detection of Bangla fake news using mnb and svm classifier. In: 2020 International conference on computing, electronics & communications engineering (iCCECE), pp 81–85. https://doi.org/10.1109/iCCECE49321.2020.9231167

  12. Islam F et al (2020) Bengali fake news detection. In: 2020 IEEE 10th international conference on intelligent systems (IS). IEEE, pp 281–287

    Google Scholar 

  13. Islam MM, Mahmud T, Hossain MS (2016) Belief-rule-based intelligent decision system to select hospital location. Indones J Electric Eng Comput Sci 1(3):607–618

    Google Scholar 

  14. Karim R, Khaliluzzaman M, Mahmud T et al (2023) An expert system for clinical risk assessment of polycystic ovary syndrome under uncertainty

    Google Scholar 

  15. Keya AJ, Afridi S, Maria AS, Pinki SS, Ghosh J, Mridha MF (2021) Fake news detection based on deep learning. In: 2021 International conference on science & contemporary technologies (ICSCT), pp 1–6. https://doi.org/10.1109/ICSCT53883.2021.9642565

  16. Mahmud T et al (2023) Exploring deep transfer learning ensemble for improved diagnosis and classification of Alzheimer’s disease. In: 2023 International conference on brain informatics. Springer, pp. 1–12

    Google Scholar 

  17. Mahmud T, Das S, Ptaszynski M, Hossain MS, Andersson K, Barua K (2022) Reason based machine learning approach to detect Bangla abusive social media comments. In: International conference on intelligent computing & optimization. Springer, pp 489–498

    Google Scholar 

  18. Mahmud T, Hossain MS (2012) An evidential reasoning-based decision support system to support house hunting. Int J Comput Appl 57(21):51–58

    Google Scholar 

  19. Mahmud T, Islam D, Begum M, Das S, Dey L, Barua K (2022) A decision concept to support house hunting. Int J Adv Comput Sci Appl (IJACSA) 13(10), 768–774

    Google Scholar 

  20. Mahmud T, Ptaszynski M, Eronen J, Masui F (2023) Cyberbullying detection for low-resource languages and dialects: Review of the state of the art. Inf Process Manag 60(5):103454

    Google Scholar 

  21. Mahmud T, Ptaszynski M, Masui F (2023) Vulgar remarks detection in chittagonian dialect of Bangla. arXiv:2308.15448

  22. Mahmud T, Rahman KN, Hossain MS (2013) Evaluation of job offers using the evidential reasoning approach. Global J Comput Sci Technol

    Google Scholar 

  23. Nahar N, Ara F, Neloy MAI, Biswas A, Hossain MS, Andersson K (2021) Feature selection based machine learning to improve prediction of Parkinson disease. In: Brain informatics: 14th international conference, BI 2021, virtual event, 17–19 Sept 2021, Proceedings 14. Springer, pp 496–508

    Google Scholar 

  24. Patwary MJA, Akter S, Mahmud T (2014) An expert system to detect uterine cancer under uncertainty. IOSR J Comput Eng (IOSR-JCE) 2278–0661, e-ISSN

    Google Scholar 

  25. Pérez-Rosas V, Kleinberg B, Lefevre A, Mihalcea R (2017) Automatic detection of fake news. arXiv:1708.07104

  26. Rahman MM, Pramanik MA, Sadik R, Roy M, Chakraborty P (2020) Bangla documents classification using transformer based deep learning models. In: 2020 2nd International conference on sustainable technologies for industry 4.0 (STI). IEEE, pp 1–5

    Google Scholar 

  27. Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake news detection on social media: a data mining perspective. ACM SIGKDD Explor Newsl. 19(1):22–36

    Google Scholar 

  28. Shu K, Wang S, Liu H (2017) Exploiting tri-relationship for fake news detection, vol 8. arXiv:1712.07709

  29. Tanvir R, Shawon MTR, Mehedi MHK, Mahtab MM, Rasel AA (2022) A gan-bert based approach for Bengali text classification with a few labeled examples. In: Distributed computing and artificial intelligence, 19th international conference. Springer, pp 20–30

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tanjim Mahmud .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Habiba, S.U. et al. (2024). Enhancing Low-Resource Bangla Fake News Detection through Deep Convolutional Neural Networks. In: Vasant, P., et al. Intelligent Computing and Optimization. ICO 2023. Lecture Notes in Networks and Systems, vol 1167. Springer, Cham. https://doi.org/10.1007/978-3-031-73318-5_11

Download citation

Publish with us

Policies and ethics