Skip to main content
Log in

Performance analysis of annotation detection techniques for cyber-bullying messages using word-embedded deep neural networks

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

In recent times, online harassment due to cyber-bullying is significantly increased with the growth of social media users. Cyber-bullying is a technique to harass users using electronic messages. Many researchers attack this problem using natural language processing. Most of them detect whether a message is a bully or not. In this paper, multiple deep learning models are introduced to detect not only bullying messages but also the annotation of cyber-bullying. Annotation detection of cyber-bullying assigns a proper description in which category a message belongs. The advantage of annotation detection is to warn the user by giving an alert message with proper annotation when the user sends or posts a message on social media. If this feature is combined with popular social network sites like Facebook, Twitter, WhatsApp, etc., this can be an additional filter to alert the user that they are going to post or send a bullied message of which type. Social media messages are unstructured as it includes text, URL link, emojis, abbreviations, etc. Most of the previous works are conducted to detect bullying messages only considering important words in the text, neglecting the other attributes in the message like URL links, emojis, and abbreviations. In this paper, an advanced pre-processing technique is proposed by considering some of the attributes in the messages like URL, abbreviation, number, emojis, etc., to detect bullying messages. In this work, six models, i.e., three deep learning models combined with two different word-embedding models have been employed for annotation detection. The performances of each of these six models are measured twice, by employing traditional pre-processing, and proposed advanced pre-processing. The experimental results show that the advanced pre-processing works better in the case of all six models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Agrawal S, Awekar A (2018) Deep learning for detecting cyberbullying across multiple social media platforms. In: European conference on information retrieval, Springer, pp 141–153

  • Al-Ajlan MA, Ykhlef M (2018) Optimized Twitter Cyberbullying Detection based on Deep Learning. In: 21st Saudi computer society national computer conference (NCC)

  • Arseneault L, Shakoor S (2009) Bullying victimization in youths and mental health problems: 'Much ado about nothing'?. Psychol Med 40

  • Banerjee V, Telavane J, Gaikwad P, Vartak P (2019) Detection of cyberbullying using deep neural network. In: 5th International conference on advanced computing and communication systems (ICACCS)

  • Bradley AE (1997) The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159

    Article  Google Scholar 

  • Bruwaene DV, Huanget Q, Inkpen D (2020) A multi-platform dataset for detecting cyberbullying in social media. Lang Resour Eval 54(4):851–874

    Article  Google Scholar 

  • Camacho-Collados J, Pileavar MT (2018) On the role of text preprocessing in neural network architectures: an evaluation study on text categorization and sentiment analysis. In: Proceedings of the 2018 EMNLP workshop blackbox NLP: analyzing and interpreting neural networks for NLP, pp 40–46

  • Campbell M, Bauman S (2018) Cyberbullying: definition, consequences, prevalence. reducing cyberbullying in schools: international evidence-based best practices, Elsevier, London, UK, pp. 3–16.

  • Chatzakou D, Leontiadis I, Blackburn J, Cristofaro ED, Stringhini G, Vakali A, Kourtellis N (2019) Detecting cyberbullying and cyberaggression in social media. ACM Trans Web (TWEB) 13(3):1–51

    Article  Google Scholar 

  • Cheng L, Li J, Silva YN, Hall DL, Liu H (2019) PIBully: personalized cyberbullying detection with peer influence. In: Proceeding of twenty-eighth international jt. conference artificial intelligence, pp 5829–5835

  • Cyberbullying data set (2020) https://data.mendeley.com/datasets/jf4pzyvnpj/1#__sid=js0

  • Dadvar M, Eckert K (2020) cyberbullying detection in social networks using deep learning based models. Springer Nature Switzerland AG 2020:245–255

    Google Scholar 

  • Ghosh S, Chaki A, Kudeshia A (2021) Cyberbully detection using 1D-CNN and LSTM. In: Proceedings of international conference on communication, circuits, and systems, Springer Singapore, pp 295–301

  • Helmy AA, Omar YMK, Hodhod R (2018) An innovative word encoding method for text classification using convolutional neural network. In: 14th International computer engineering conference (ICENCO2018), pp 42–47

  • Iwendi C, Srivastava G, Khan S, Maddikunta PK (2020) Cyberbullying detection solutions based on deep learning architectures. Springer-Verlag GmbH Germany, part of Springer Nature

  • Lu N, Wu G, Zhang Z, Zheng Y, Ren Y, ChooK R (2020) Cyberbullying detection in social media text based on character-level convolutional neural network with shortcuts. Concurrency Comput Pract Exp

  • Muneer A, Fati SM (2020) A comparative analysis of machine learning techniques for cyberbullying detection on Twitter. Future Internet 12(11):187

    Article  Google Scholar 

  • Pennington J, Socher R, Manning CD (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP).

  • Pew Research Center (2018) A majority of teens have experienced some form of cyberbullying. https://www.pewresearch.org/internet/2018/09/27/a-majority-of-teens-have-experienced-some-form-of-cyberbullying/.

  • Pew Research Center (2021) The State of Online Harassment. https://www.pewresearch.org/internet/2021/01/13/the-state-of-online-harassment/

  • Sharma D, Kishore J, Sharma N, Duggal M (2017) Aggression in schools: cyberbullying and gender issues. Asian J Psychiatr 29:142–145

    Article  Google Scholar 

  • Uysal AK, Gunal S (2014) The impact of preprocessing on text classification. Inf Process Manage 50(1):104–112

    Article  Google Scholar 

  • Zhang J, Otomo T, Li L, Nakajima S (2019) Cyberbullying detection on twitter using multiple textual features. In: IEEE 10th International Conference on Awareness Science and Technology (iCAST), Japan, pp 1–6

  • Zhang X, Tong J, Vishwamitra N, Whittaker E, Mazer J, Kowalski R, Hu H, Luo F, Macbeth J, Dillon E (2016) Cyberbullying detection with a pronunciation based convolutional neural network. IEEE Int Conf Mach Learn Appl, pp 740–745

  • Zhao Z, Gao M, Luo F, Zhang Y, Xiong Q (2020) LSHWE: improving similarity-based word embedding with locality sensitive hashing for cyberbullying detection. Int Joint Conf Neural Net (IJCNN)

Download references

Author information

Authors and Affiliations

Authors

Contributions

Both the authors wrote the manuscript, prepared tables and figures, and reviewed the manuscript.

Corresponding author

Correspondence to Siddhartha Banerjee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Giri, S., Banerjee, S. Performance analysis of annotation detection techniques for cyber-bullying messages using word-embedded deep neural networks. Soc. Netw. Anal. Min. 13, 23 (2023). https://doi.org/10.1007/s13278-022-01023-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-022-01023-2

Keywords

Navigation