Enhancing cyberbullying detection: a comparative study of ensemble CNN–SVM and BERT models

Saini, Hiteshi; Mehra, Himashri; Rani, Ritu; Jaiswal, Garima; Sharma, Arun; Dev, Amita

doi:10.1007/s13278-023-01158-w

Enhancing cyberbullying detection: a comparative study of ensemble CNN–SVM and BERT models

Review Paper
Published: 02 December 2023

Volume 14, article number 1, (2024)
Cite this article

Social Network Analysis and Mining Aims and scope Submit manuscript

Hiteshi Saini¹,
Himashri Mehra¹,
Ritu Rani¹,
Garima Jaiswal²,
Arun Sharma¹ &
…
Amita Dev¹

372 Accesses
Explore all metrics

Abstract

Technological improvements have increased the number of people who use online social networking sites, resulting in an increase in cyberbullying. Bullies can attack victims through a large network of online social networking platforms. Cyberbullying is an umbrella term encompassing a wide range of online abuse, including but not limited to harassment, doxing, and reputation attacks. These attacks frequently leave the victim(s) with persistent mental scars, leading to desperate measures such as depression, self-harm, and suicidal thoughts. Given the effects of cyberbullying, there is an urgent need to prosecute and prevent such crimes. This paper gives a comprehensive review as well the empirical analysis of the machine learning, ensemble based and transformer-based models for the cyberbullying detection. This paper proposes two architectures to efficiently detect cyberbullying pattern. The proposed ensemble model makes use of CNN to extract the relevant features and the classification is performed by the SVM. Another proposed architecture utilizes the pre-trained model BERT to detect cyberbullying behavior on online platforms. Both the proposed models were tested on two separate datasets and achieved maximum accuracy of 96.88 and 97.34% for ensemble and BERT models, respectively. This paper provides a thorough examination of the various methodologies used for cyberbullying detection and conducts an empirical and comparative analysis of the presented models with traditional and current algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Detection and Prevention of Cyberbullying Using Ensemble Classifier

Smart Cyberbullying Detection with Machine Learning

A Robust Ensemble Learning Model for Fine-Grained Detection of Cyber Harassment

References

Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau RJ (2011) Sentiment analysis of twitter data. In: Proceedings of the workshop on language in social media (LSM 2011), pp 30–38
Ahmed MF, Mahmud Z, Biash ZT, Ryen AAN, Hossain A, Ashraf FB (2021) Cyberbullying detection using deep neural network from social media comments in bangla language. arXiv preprint arXiv:2106.04506
Al-Ajlan MA, Ykhlef M (2018) Optimized twitter cyberbullying detection based on deep learning. In: 2018 21st Saudi Computer Society National Computer Conference (NCC). IEEE, pp 1–5
Al-Garadi MA, Varathan KD, Ravana SD (2016) Cybercrime detection in online communications: the experimental case of cyberbullying detection in the Twitter network. Comput Hum Behav 63:433–443
Article Google Scholar
Almutiry S, Abdel Fattah M (2021) Arabic cyberbullying detection using arabic sentiment analysis. Egypt J Lang Eng 8(1):39–50
Article Google Scholar
Ates EC, Bostanci E, Guzel MS (2021) Comparative performance of machine learning algorithms in cyberbullying detection: using turkish language preprocessing techniques. arXiv preprint arXiv:2101.12718
Banerjee V, Telavane J, Gaikwad P, Vartak P (2019) Detection of cyberbullying using deep neural network. In: 2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS), IEEE. pp 604–607
Bauman S, Cross D, Walker J (2013) Principles of cyberbullying research. In: definition, methods, and measures, p 2013
Bengio Y, Ducharme R, Vincent P (2000) A neural probabilistic language model. In: Advances in neural information processing systems, vol 13
Chawla NV (2009) Data mining for imbalanced datasets: an overview. In: Data mining and knowledge discovery handbook, pp.875–886
Huang Q, Singh VK, Atrey PK (2014) Cyber bullying detection using social and textual analysis. In: Proceedings of the 3rd international workshop on socially-aware multimedia, pp 3–6
Huang Q, Singh VK, Atrey PK (2014) Cyber bullying detection using social and textual analysis. In: Proceedings of the 3rd international workshop on socially-aware multimedia. pp 3–6
Jain V, Kumar V, Pal V, Vishwakarma DK (2021) Detection of cyberbullying on social media using machine learning. In: 2021 5th International Conference on Computing Methodologies and Communication (ICCMC). IEEE, pp 1091–1096
Maher D (2008) Cyberbullying: an ethnographic case study of one Australian upper primary school class. Youth Stud Australia 27(4):50–57
MathSciNet Google Scholar
Mangaonkar A, Hayrapetian A, Raje R (2015) Collaborative detection of cyberbullying behavior in Twitter data. In: 2015 IEEE international conference on electro/information technology (EIT), IEEE, pp 611–616
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, vol 26
Nandhini BS, Sheeba JI (2015) Cyberbullying detection and classification using information retrieval algorithm. In: Proceedings of the 2015 international conference on advanced research in computer science engineering & technology (ICARCSET 2015), pp 1–5
Perera A, Fernando P (2021) Accurate cyberbullying detection and prevention on social media. Procedia Comput Sci 181:605–611
Article Google Scholar
Raj C, Agarwal A, Bharathy G, Narayan B, Prasad M (2021) Cyberbullying detection: hybrid models based on machine learning and natural language processing techniques. Electronics 10(22):2810
Article Google Scholar
Reynolds K, Kontostathis A, Edwards L (2011) Using machine learning to detect cyberbullying. In: 2011 10th International conference on machine learning and applications and workshops, vol 2. IEEE, pp 241–244
Rosa H, Pereira N, Ribeiro R, Ferreira PC, Carvalho JP, Oliveira S, Coheur L, Paulino P, Simão AV, Trancoso I (2019) Automatic cyberbullying detection: a systematic review. Comput Hum Behav 93:333–345
Article Google Scholar
Roy PK, Singh A, Tripathy AK, Das TK (2022) Cyberbullying detection: an ensemble learning approach. Int J Comput Sci Eng 25(3):315–324
Google Scholar
Sahni A, Raja N (2017) Analyzation and detection of cyberbullying: a Twitter based Indian case study. In: International Conference on Recent Developments in Science, Engineering and Technology, Springer, Singapore, pp 484–497.
Saravanaraj A, Sheeba JI, Devaneyan SP (2016) Automatic detection of cyberbullying from twitter. Int J Comput Sci Inf Technol Secur 6(6):26–31
Google Scholar
Singh VK, Huang Q, Atrey PK (2016) Cyberbullying detection using probabilistic socio-textual information fusion. In: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), IEEE, pp 884–887
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems, vol 30
Wang J, Fu K, Lu CT (2020) SOSNet: a graph convolutional network approach to fine-grained cyberbullying detection. In: IEEE international conference on big data (Big Data). Atlanta, GA, USA, pp 1699–1708. https://doi.org/10.1109/BigData50022.2020.9378065
Zhao R, Mao K (2016) Cyberbullying detection based on semantic-enhanced marginalized denoising auto-encoder. IEEE Trans Affect Comput 8(3):328–339
Article Google Scholar

Download references

Author information

Authors and Affiliations

Indira Gandhi Delhi Technical University for Women, New Delhi, India
Hiteshi Saini, Himashri Mehra, Ritu Rani, Arun Sharma & Amita Dev
Bennett University, Greater Noida, India
Garima Jaiswal

Authors

Hiteshi Saini
View author publications
You can also search for this author in PubMed Google Scholar
Himashri Mehra
View author publications
You can also search for this author in PubMed Google Scholar
Ritu Rani
View author publications
You can also search for this author in PubMed Google Scholar
Garima Jaiswal
View author publications
You can also search for this author in PubMed Google Scholar
Arun Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Amita Dev
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

[HS]: "Conceived and designed the experiments," "Performed the experiments," "Analyzed the data," "Wrote the paper," [Himashri Mehra]: "Conceived and designed the experiments," "Performed the experiments," "Analyzed the data," "Wrote the paper," [RR]: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise. GJ: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise AS: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise AD: Supervision, "Analyzed the data," "Wrote the paper,” “Reviewed and edited the paper", Provided technical expertise

Corresponding author

Correspondence to Ritu Rani.

Ethics declarations

Conflict of interest

The authors declare that they have no conflicts of interest related to this study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Saini, H., Mehra, H., Rani, R. et al. Enhancing cyberbullying detection: a comparative study of ensemble CNN–SVM and BERT models. Soc. Netw. Anal. Min. 14, 1 (2024). https://doi.org/10.1007/s13278-023-01158-w

Download citation

Received: 07 June 2023
Revised: 09 August 2023
Accepted: 25 October 2023
Published: 02 December 2023
DOI: https://doi.org/10.1007/s13278-023-01158-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhancing cyberbullying detection: a comparative study of ensemble CNN–SVM and BERT models

Abstract

Access this article

Similar content being viewed by others

Detection and Prevention of Cyberbullying Using Ensemble Classifier

Smart Cyberbullying Detection with Machine Learning

A Robust Ensemble Learning Model for Fine-Grained Detection of Cyber Harassment

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Enhancing cyberbullying detection: a comparative study of ensemble CNN–SVM and BERT models

Abstract

Access this article

Similar content being viewed by others

Detection and Prevention of Cyberbullying Using Ensemble Classifier

Smart Cyberbullying Detection with Machine Learning

A Robust Ensemble Learning Model for Fine-Grained Detection of Cyber Harassment

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation