Abstract
Technological improvements have increased the number of people who use online social networking sites, resulting in an increase in cyberbullying. Bullies can attack victims through a large network of online social networking platforms. Cyberbullying is an umbrella term encompassing a wide range of online abuse, including but not limited to harassment, doxing, and reputation attacks. These attacks frequently leave the victim(s) with persistent mental scars, leading to desperate measures such as depression, self-harm, and suicidal thoughts. Given the effects of cyberbullying, there is an urgent need to prosecute and prevent such crimes. This paper gives a comprehensive review as well the empirical analysis of the machine learning, ensemble based and transformer-based models for the cyberbullying detection. This paper proposes two architectures to efficiently detect cyberbullying pattern. The proposed ensemble model makes use of CNN to extract the relevant features and the classification is performed by the SVM. Another proposed architecture utilizes the pre-trained model BERT to detect cyberbullying behavior on online platforms. Both the proposed models were tested on two separate datasets and achieved maximum accuracy of 96.88 and 97.34% for ensemble and BERT models, respectively. This paper provides a thorough examination of the various methodologies used for cyberbullying detection and conducts an empirical and comparative analysis of the presented models with traditional and current algorithms.
Similar content being viewed by others
References
Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau RJ (2011) Sentiment analysis of twitter data. In: Proceedings of the workshop on language in social media (LSM 2011), pp 30–38
Ahmed MF, Mahmud Z, Biash ZT, Ryen AAN, Hossain A, Ashraf FB (2021) Cyberbullying detection using deep neural network from social media comments in bangla language. arXiv preprint arXiv:2106.04506
Al-Ajlan MA, Ykhlef M (2018) Optimized twitter cyberbullying detection based on deep learning. In: 2018 21st Saudi Computer Society National Computer Conference (NCC). IEEE, pp 1–5
Al-Garadi MA, Varathan KD, Ravana SD (2016) Cybercrime detection in online communications: the experimental case of cyberbullying detection in the Twitter network. Comput Hum Behav 63:433–443
Almutiry S, Abdel Fattah M (2021) Arabic cyberbullying detection using arabic sentiment analysis. Egypt J Lang Eng 8(1):39–50
Ates EC, Bostanci E, Guzel MS (2021) Comparative performance of machine learning algorithms in cyberbullying detection: using turkish language preprocessing techniques. arXiv preprint arXiv:2101.12718
Banerjee V, Telavane J, Gaikwad P, Vartak P (2019) Detection of cyberbullying using deep neural network. In: 2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS), IEEE. pp 604–607
Bauman S, Cross D, Walker J (2013) Principles of cyberbullying research. In: definition, methods, and measures, p 2013
Bengio Y, Ducharme R, Vincent P (2000) A neural probabilistic language model. In: Advances in neural information processing systems, vol 13
Chawla NV (2009) Data mining for imbalanced datasets: an overview. In: Data mining and knowledge discovery handbook, pp.875–886
Huang Q, Singh VK, Atrey PK (2014) Cyber bullying detection using social and textual analysis. In: Proceedings of the 3rd international workshop on socially-aware multimedia, pp 3–6
Huang Q, Singh VK, Atrey PK (2014) Cyber bullying detection using social and textual analysis. In: Proceedings of the 3rd international workshop on socially-aware multimedia. pp 3–6
Jain V, Kumar V, Pal V, Vishwakarma DK (2021) Detection of cyberbullying on social media using machine learning. In: 2021 5th International Conference on Computing Methodologies and Communication (ICCMC). IEEE, pp 1091–1096
Maher D (2008) Cyberbullying: an ethnographic case study of one Australian upper primary school class. Youth Stud Australia 27(4):50–57
Mangaonkar A, Hayrapetian A, Raje R (2015) Collaborative detection of cyberbullying behavior in Twitter data. In: 2015 IEEE international conference on electro/information technology (EIT), IEEE, pp 611–616
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, vol 26
Nandhini BS, Sheeba JI (2015) Cyberbullying detection and classification using information retrieval algorithm. In: Proceedings of the 2015 international conference on advanced research in computer science engineering & technology (ICARCSET 2015), pp 1–5
Perera A, Fernando P (2021) Accurate cyberbullying detection and prevention on social media. Procedia Comput Sci 181:605–611
Raj C, Agarwal A, Bharathy G, Narayan B, Prasad M (2021) Cyberbullying detection: hybrid models based on machine learning and natural language processing techniques. Electronics 10(22):2810
Reynolds K, Kontostathis A, Edwards L (2011) Using machine learning to detect cyberbullying. In: 2011 10th International conference on machine learning and applications and workshops, vol 2. IEEE, pp 241–244
Rosa H, Pereira N, Ribeiro R, Ferreira PC, Carvalho JP, Oliveira S, Coheur L, Paulino P, Simão AV, Trancoso I (2019) Automatic cyberbullying detection: a systematic review. Comput Hum Behav 93:333–345
Roy PK, Singh A, Tripathy AK, Das TK (2022) Cyberbullying detection: an ensemble learning approach. Int J Comput Sci Eng 25(3):315–324
Sahni A, Raja N (2017) Analyzation and detection of cyberbullying: a Twitter based Indian case study. In: International Conference on Recent Developments in Science, Engineering and Technology, Springer, Singapore, pp 484–497.
Saravanaraj A, Sheeba JI, Devaneyan SP (2016) Automatic detection of cyberbullying from twitter. Int J Comput Sci Inf Technol Secur 6(6):26–31
Singh VK, Huang Q, Atrey PK (2016) Cyberbullying detection using probabilistic socio-textual information fusion. In: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), IEEE, pp 884–887
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems, vol 30
Wang J, Fu K, Lu CT (2020) SOSNet: a graph convolutional network approach to fine-grained cyberbullying detection. In: IEEE international conference on big data (Big Data). Atlanta, GA, USA, pp 1699–1708. https://doi.org/10.1109/BigData50022.2020.9378065
Zhao R, Mao K (2016) Cyberbullying detection based on semantic-enhanced marginalized denoising auto-encoder. IEEE Trans Affect Comput 8(3):328–339
Author information
Authors and Affiliations
Contributions
[HS]: "Conceived and designed the experiments," "Performed the experiments," "Analyzed the data," "Wrote the paper," [Himashri Mehra]: "Conceived and designed the experiments," "Performed the experiments," "Analyzed the data," "Wrote the paper," [RR]: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise. GJ: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise AS: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise AD: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest related to this study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Saini, H., Mehra, H., Rani, R. et al. Enhancing cyberbullying detection: a comparative study of ensemble CNN–SVM and BERT models. Soc. Netw. Anal. Min. 14, 1 (2024). https://doi.org/10.1007/s13278-023-01158-w
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-023-01158-w