Abstract
In recent times, online harassment due to cyber-bullying is significantly increased with the growth of social media users. Cyber-bullying is a technique to harass users using electronic messages. Many researchers attack this problem using natural language processing. Most of them detect whether a message is a bully or not. In this paper, multiple deep learning models are introduced to detect not only bullying messages but also the annotation of cyber-bullying. Annotation detection of cyber-bullying assigns a proper description in which category a message belongs. The advantage of annotation detection is to warn the user by giving an alert message with proper annotation when the user sends or posts a message on social media. If this feature is combined with popular social network sites like Facebook, Twitter, WhatsApp, etc., this can be an additional filter to alert the user that they are going to post or send a bullied message of which type. Social media messages are unstructured as it includes text, URL link, emojis, abbreviations, etc. Most of the previous works are conducted to detect bullying messages only considering important words in the text, neglecting the other attributes in the message like URL links, emojis, and abbreviations. In this paper, an advanced pre-processing technique is proposed by considering some of the attributes in the messages like URL, abbreviation, number, emojis, etc., to detect bullying messages. In this work, six models, i.e., three deep learning models combined with two different word-embedding models have been employed for annotation detection. The performances of each of these six models are measured twice, by employing traditional pre-processing, and proposed advanced pre-processing. The experimental results show that the advanced pre-processing works better in the case of all six models.
Similar content being viewed by others
References
Agrawal S, Awekar A (2018) Deep learning for detecting cyberbullying across multiple social media platforms. In: European conference on information retrieval, Springer, pp 141–153
Al-Ajlan MA, Ykhlef M (2018) Optimized Twitter Cyberbullying Detection based on Deep Learning. In: 21st Saudi computer society national computer conference (NCC)
Arseneault L, Shakoor S (2009) Bullying victimization in youths and mental health problems: 'Much ado about nothing'?. Psychol Med 40
Banerjee V, Telavane J, Gaikwad P, Vartak P (2019) Detection of cyberbullying using deep neural network. In: 5th International conference on advanced computing and communication systems (ICACCS)
Bradley AE (1997) The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159
Bruwaene DV, Huanget Q, Inkpen D (2020) A multi-platform dataset for detecting cyberbullying in social media. Lang Resour Eval 54(4):851–874
Camacho-Collados J, Pileavar MT (2018) On the role of text preprocessing in neural network architectures: an evaluation study on text categorization and sentiment analysis. In: Proceedings of the 2018 EMNLP workshop blackbox NLP: analyzing and interpreting neural networks for NLP, pp 40–46
Campbell M, Bauman S (2018) Cyberbullying: definition, consequences, prevalence. reducing cyberbullying in schools: international evidence-based best practices, Elsevier, London, UK, pp. 3–16.
Chatzakou D, Leontiadis I, Blackburn J, Cristofaro ED, Stringhini G, Vakali A, Kourtellis N (2019) Detecting cyberbullying and cyberaggression in social media. ACM Trans Web (TWEB) 13(3):1–51
Cheng L, Li J, Silva YN, Hall DL, Liu H (2019) PIBully: personalized cyberbullying detection with peer influence. In: Proceeding of twenty-eighth international jt. conference artificial intelligence, pp 5829–5835
Cyberbullying data set (2020) https://data.mendeley.com/datasets/jf4pzyvnpj/1#__sid=js0
Dadvar M, Eckert K (2020) cyberbullying detection in social networks using deep learning based models. Springer Nature Switzerland AG 2020:245–255
Ghosh S, Chaki A, Kudeshia A (2021) Cyberbully detection using 1D-CNN and LSTM. In: Proceedings of international conference on communication, circuits, and systems, Springer Singapore, pp 295–301
Helmy AA, Omar YMK, Hodhod R (2018) An innovative word encoding method for text classification using convolutional neural network. In: 14th International computer engineering conference (ICENCO2018), pp 42–47
Iwendi C, Srivastava G, Khan S, Maddikunta PK (2020) Cyberbullying detection solutions based on deep learning architectures. Springer-Verlag GmbH Germany, part of Springer Nature
Lu N, Wu G, Zhang Z, Zheng Y, Ren Y, ChooK R (2020) Cyberbullying detection in social media text based on character-level convolutional neural network with shortcuts. Concurrency Comput Pract Exp
Muneer A, Fati SM (2020) A comparative analysis of machine learning techniques for cyberbullying detection on Twitter. Future Internet 12(11):187
Pennington J, Socher R, Manning CD (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP).
Pew Research Center (2018) A majority of teens have experienced some form of cyberbullying. https://www.pewresearch.org/internet/2018/09/27/a-majority-of-teens-have-experienced-some-form-of-cyberbullying/.
Pew Research Center (2021) The State of Online Harassment. https://www.pewresearch.org/internet/2021/01/13/the-state-of-online-harassment/
Sharma D, Kishore J, Sharma N, Duggal M (2017) Aggression in schools: cyberbullying and gender issues. Asian J Psychiatr 29:142–145
Uysal AK, Gunal S (2014) The impact of preprocessing on text classification. Inf Process Manage 50(1):104–112
Zhang J, Otomo T, Li L, Nakajima S (2019) Cyberbullying detection on twitter using multiple textual features. In: IEEE 10th International Conference on Awareness Science and Technology (iCAST), Japan, pp 1–6
Zhang X, Tong J, Vishwamitra N, Whittaker E, Mazer J, Kowalski R, Hu H, Luo F, Macbeth J, Dillon E (2016) Cyberbullying detection with a pronunciation based convolutional neural network. IEEE Int Conf Mach Learn Appl, pp 740–745
Zhao Z, Gao M, Luo F, Zhang Y, Xiong Q (2020) LSHWE: improving similarity-based word embedding with locality sensitive hashing for cyberbullying detection. Int Joint Conf Neural Net (IJCNN)
Author information
Authors and Affiliations
Contributions
Both the authors wrote the manuscript, prepared tables and figures, and reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Giri, S., Banerjee, S. Performance analysis of annotation detection techniques for cyber-bullying messages using word-embedded deep neural networks. Soc. Netw. Anal. Min. 13, 23 (2023). https://doi.org/10.1007/s13278-022-01023-2
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-022-01023-2