Skip to main content
Log in

Enhancing cyberbullying detection: a comparative study of ensemble CNN–SVM and BERT models

  • Review Paper
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Technological improvements have increased the number of people who use online social networking sites, resulting in an increase in cyberbullying. Bullies can attack victims through a large network of online social networking platforms. Cyberbullying is an umbrella term encompassing a wide range of online abuse, including but not limited to harassment, doxing, and reputation attacks. These attacks frequently leave the victim(s) with persistent mental scars, leading to desperate measures such as depression, self-harm, and suicidal thoughts. Given the effects of cyberbullying, there is an urgent need to prosecute and prevent such crimes. This paper gives a comprehensive review as well the empirical analysis of the machine learning, ensemble based and transformer-based models for the cyberbullying detection. This paper proposes two architectures to efficiently detect cyberbullying pattern. The proposed ensemble model makes use of CNN to extract the relevant features and the classification is performed by the SVM. Another proposed architecture utilizes the pre-trained model BERT to detect cyberbullying behavior on online platforms. Both the proposed models were tested on two separate datasets and achieved maximum accuracy of 96.88 and 97.34% for ensemble and BERT models, respectively. This paper provides a thorough examination of the various methodologies used for cyberbullying detection and conducts an empirical and comparative analysis of the presented models with traditional and current algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau RJ (2011) Sentiment analysis of twitter data. In: Proceedings of the workshop on language in social media (LSM 2011), pp 30–38

  • Ahmed MF, Mahmud Z, Biash ZT, Ryen AAN, Hossain A, Ashraf FB (2021) Cyberbullying detection using deep neural network from social media comments in bangla language. arXiv preprint arXiv:2106.04506

  • Al-Ajlan MA, Ykhlef M (2018) Optimized twitter cyberbullying detection based on deep learning. In: 2018 21st Saudi Computer Society National Computer Conference (NCC). IEEE, pp 1–5

  • Al-Garadi MA, Varathan KD, Ravana SD (2016) Cybercrime detection in online communications: the experimental case of cyberbullying detection in the Twitter network. Comput Hum Behav 63:433–443

    Article  Google Scholar 

  • Almutiry S, Abdel Fattah M (2021) Arabic cyberbullying detection using arabic sentiment analysis. Egypt J Lang Eng 8(1):39–50

    Article  Google Scholar 

  • Ates EC, Bostanci E, Guzel MS (2021) Comparative performance of machine learning algorithms in cyberbullying detection: using turkish language preprocessing techniques. arXiv preprint arXiv:2101.12718

  • Banerjee V, Telavane J, Gaikwad P, Vartak P (2019) Detection of cyberbullying using deep neural network. In: 2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS), IEEE. pp 604–607

  • Bauman S, Cross D, Walker J (2013) Principles of cyberbullying research. In: definition, methods, and measures, p 2013

  • Bengio Y, Ducharme R, Vincent P (2000) A neural probabilistic language model. In: Advances in neural information processing systems, vol 13

  • Chawla NV (2009) Data mining for imbalanced datasets: an overview. In: Data mining and knowledge discovery handbook, pp.875–886

  • Huang Q, Singh VK, Atrey PK (2014) Cyber bullying detection using social and textual analysis. In: Proceedings of the 3rd international workshop on socially-aware multimedia, pp 3–6

  • Huang Q, Singh VK, Atrey PK (2014) Cyber bullying detection using social and textual analysis. In: Proceedings of the 3rd international workshop on socially-aware multimedia. pp 3–6

  • Jain V, Kumar V, Pal V, Vishwakarma DK (2021) Detection of cyberbullying on social media using machine learning. In: 2021 5th International Conference on Computing Methodologies and Communication (ICCMC). IEEE, pp 1091–1096

  • Maher D (2008) Cyberbullying: an ethnographic case study of one Australian upper primary school class. Youth Stud Australia 27(4):50–57

    MathSciNet  Google Scholar 

  • Mangaonkar A, Hayrapetian A, Raje R (2015) Collaborative detection of cyberbullying behavior in Twitter data. In: 2015 IEEE international conference on electro/information technology (EIT), IEEE, pp 611–616

  • Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, vol 26

  • Nandhini BS, Sheeba JI (2015) Cyberbullying detection and classification using information retrieval algorithm. In: Proceedings of the 2015 international conference on advanced research in computer science engineering & technology (ICARCSET 2015), pp 1–5

  • Perera A, Fernando P (2021) Accurate cyberbullying detection and prevention on social media. Procedia Comput Sci 181:605–611

    Article  Google Scholar 

  • Raj C, Agarwal A, Bharathy G, Narayan B, Prasad M (2021) Cyberbullying detection: hybrid models based on machine learning and natural language processing techniques. Electronics 10(22):2810

    Article  Google Scholar 

  • Reynolds K, Kontostathis A, Edwards L (2011) Using machine learning to detect cyberbullying. In: 2011 10th International conference on machine learning and applications and workshops, vol 2. IEEE, pp 241–244

  • Rosa H, Pereira N, Ribeiro R, Ferreira PC, Carvalho JP, Oliveira S, Coheur L, Paulino P, Simão AV, Trancoso I (2019) Automatic cyberbullying detection: a systematic review. Comput Hum Behav 93:333–345

    Article  Google Scholar 

  • Roy PK, Singh A, Tripathy AK, Das TK (2022) Cyberbullying detection: an ensemble learning approach. Int J Comput Sci Eng 25(3):315–324

    Google Scholar 

  • Sahni A, Raja N (2017) Analyzation and detection of cyberbullying: a Twitter based Indian case study. In: International Conference on Recent Developments in Science, Engineering and Technology, Springer, Singapore, pp 484–497.

  • Saravanaraj A, Sheeba JI, Devaneyan SP (2016) Automatic detection of cyberbullying from twitter. Int J Comput Sci Inf Technol Secur 6(6):26–31

    Google Scholar 

  • Singh VK, Huang Q, Atrey PK (2016) Cyberbullying detection using probabilistic socio-textual information fusion. In: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), IEEE, pp 884–887

  • Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems, vol 30

  • Wang J, Fu K, Lu CT (2020) SOSNet: a graph convolutional network approach to fine-grained cyberbullying detection. In: IEEE international conference on big data (Big Data). Atlanta, GA, USA, pp 1699–1708. https://doi.org/10.1109/BigData50022.2020.9378065

  • Zhao R, Mao K (2016) Cyberbullying detection based on semantic-enhanced marginalized denoising auto-encoder. IEEE Trans Affect Comput 8(3):328–339

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Contributions

[HS]: "Conceived and designed the experiments," "Performed the experiments," "Analyzed the data," "Wrote the paper," [Himashri Mehra]: "Conceived and designed the experiments," "Performed the experiments," "Analyzed the data," "Wrote the paper," [RR]: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise. GJ: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise AS: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise AD: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise

Corresponding author

Correspondence to Ritu Rani.

Ethics declarations

Conflict of interest

The authors declare that they have no conflicts of interest related to this study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Saini, H., Mehra, H., Rani, R. et al. Enhancing cyberbullying detection: a comparative study of ensemble CNN–SVM and BERT models. Soc. Netw. Anal. Min. 14, 1 (2024). https://doi.org/10.1007/s13278-023-01158-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-023-01158-w

Keywords

Navigation