Performance analysis of annotation detection techniques for cyber-bullying messages using word-embedded deep neural networks

Giri, Surajit; Banerjee, Siddhartha

doi:10.1007/s13278-022-01023-2

Performance analysis of annotation detection techniques for cyber-bullying messages using word-embedded deep neural networks

Original Article
Published: 14 January 2023

Volume 13, article number 23, (2023)
Cite this article

Social Network Analysis and Mining Aims and scope Submit manuscript

Surajit Giri¹ &
Siddhartha Banerjee¹

339 Accesses
4 Citations
Explore all metrics

Abstract

In recent times, online harassment due to cyber-bullying is significantly increased with the growth of social media users. Cyber-bullying is a technique to harass users using electronic messages. Many researchers attack this problem using natural language processing. Most of them detect whether a message is a bully or not. In this paper, multiple deep learning models are introduced to detect not only bullying messages but also the annotation of cyber-bullying. Annotation detection of cyber-bullying assigns a proper description in which category a message belongs. The advantage of annotation detection is to warn the user by giving an alert message with proper annotation when the user sends or posts a message on social media. If this feature is combined with popular social network sites like Facebook, Twitter, WhatsApp, etc., this can be an additional filter to alert the user that they are going to post or send a bullied message of which type. Social media messages are unstructured as it includes text, URL link, emojis, abbreviations, etc. Most of the previous works are conducted to detect bullying messages only considering important words in the text, neglecting the other attributes in the message like URL links, emojis, and abbreviations. In this paper, an advanced pre-processing technique is proposed by considering some of the attributes in the messages like URL, abbreviation, number, emojis, etc., to detect bullying messages. In this work, six models, i.e., three deep learning models combined with two different word-embedding models have been employed for annotation detection. The performances of each of these six models are measured twice, by employing traditional pre-processing, and proposed advanced pre-processing. The experimental results show that the advanced pre-processing works better in the case of all six models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Exploration of Machine Learning and Deep Learning Techniques for Offensive Text Detection in Social Media—A Systematic Review

Toxic Comment Classification Implementing CNN Combining Word Embedding Technique

HiTACoD: Hierarchical Framework for Textual Abusive Content Detection

Article 25 September 2023

Ovais Bashir Gashroo & Monica Mehrotra

References

Agrawal S, Awekar A (2018) Deep learning for detecting cyberbullying across multiple social media platforms. In: European conference on information retrieval, Springer, pp 141–153
Al-Ajlan MA, Ykhlef M (2018) Optimized Twitter Cyberbullying Detection based on Deep Learning. In: 21st Saudi computer society national computer conference (NCC)
Arseneault L, Shakoor S (2009) Bullying victimization in youths and mental health problems: 'Much ado about nothing'?. Psychol Med 40
Banerjee V, Telavane J, Gaikwad P, Vartak P (2019) Detection of cyberbullying using deep neural network. In: 5th International conference on advanced computing and communication systems (ICACCS)
Bradley AE (1997) The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159
Article Google Scholar
Bruwaene DV, Huanget Q, Inkpen D (2020) A multi-platform dataset for detecting cyberbullying in social media. Lang Resour Eval 54(4):851–874
Article Google Scholar
Camacho-Collados J, Pileavar MT (2018) On the role of text preprocessing in neural network architectures: an evaluation study on text categorization and sentiment analysis. In: Proceedings of the 2018 EMNLP workshop blackbox NLP: analyzing and interpreting neural networks for NLP, pp 40–46
Campbell M, Bauman S (2018) Cyberbullying: definition, consequences, prevalence. reducing cyberbullying in schools: international evidence-based best practices, Elsevier, London, UK, pp. 3–16.
Chatzakou D, Leontiadis I, Blackburn J, Cristofaro ED, Stringhini G, Vakali A, Kourtellis N (2019) Detecting cyberbullying and cyberaggression in social media. ACM Trans Web (TWEB) 13(3):1–51
Article Google Scholar
Cheng L, Li J, Silva YN, Hall DL, Liu H (2019) PIBully: personalized cyberbullying detection with peer influence. In: Proceeding of twenty-eighth international jt. conference artificial intelligence, pp 5829–5835
Cyberbullying data set (2020) https://data.mendeley.com/datasets/jf4pzyvnpj/1#__sid=js0
Dadvar M, Eckert K (2020) cyberbullying detection in social networks using deep learning based models. Springer Nature Switzerland AG 2020:245–255
Google Scholar
Ghosh S, Chaki A, Kudeshia A (2021) Cyberbully detection using 1D-CNN and LSTM. In: Proceedings of international conference on communication, circuits, and systems, Springer Singapore, pp 295–301
Helmy AA, Omar YMK, Hodhod R (2018) An innovative word encoding method for text classification using convolutional neural network. In: 14th International computer engineering conference (ICENCO2018), pp 42–47
Iwendi C, Srivastava G, Khan S, Maddikunta PK (2020) Cyberbullying detection solutions based on deep learning architectures. Springer-Verlag GmbH Germany, part of Springer Nature
Lu N, Wu G, Zhang Z, Zheng Y, Ren Y, ChooK R (2020) Cyberbullying detection in social media text based on character-level convolutional neural network with shortcuts. Concurrency Comput Pract Exp
Muneer A, Fati SM (2020) A comparative analysis of machine learning techniques for cyberbullying detection on Twitter. Future Internet 12(11):187
Article Google Scholar
Pennington J, Socher R, Manning CD (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP).
Pew Research Center (2018) A majority of teens have experienced some form of cyberbullying. https://www.pewresearch.org/internet/2018/09/27/a-majority-of-teens-have-experienced-some-form-of-cyberbullying/.
Pew Research Center (2021) The State of Online Harassment. https://www.pewresearch.org/internet/2021/01/13/the-state-of-online-harassment/
Sharma D, Kishore J, Sharma N, Duggal M (2017) Aggression in schools: cyberbullying and gender issues. Asian J Psychiatr 29:142–145
Article Google Scholar
Uysal AK, Gunal S (2014) The impact of preprocessing on text classification. Inf Process Manage 50(1):104–112
Article Google Scholar
Zhang J, Otomo T, Li L, Nakajima S (2019) Cyberbullying detection on twitter using multiple textual features. In: IEEE 10th International Conference on Awareness Science and Technology (iCAST), Japan, pp 1–6
Zhang X, Tong J, Vishwamitra N, Whittaker E, Mazer J, Kowalski R, Hu H, Luo F, Macbeth J, Dillon E (2016) Cyberbullying detection with a pronunciation based convolutional neural network. IEEE Int Conf Mach Learn Appl, pp 740–745
Zhao Z, Gao M, Luo F, Zhang Y, Xiong Q (2020) LSHWE: improving similarity-based word embedding with locality sensitive hashing for cyberbullying detection. Int Joint Conf Neural Net (IJCNN)

Download references

Author information

Authors and Affiliations

Department of Computer Science, Ramakrishna Mission Residential College (Autonomous), Narendrapur, Kolkata, India
Surajit Giri & Siddhartha Banerjee

Authors

Surajit Giri
View author publications
You can also search for this author in PubMed Google Scholar
Siddhartha Banerjee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Both the authors wrote the manuscript, prepared tables and figures, and reviewed the manuscript.

Corresponding author

Correspondence to Siddhartha Banerjee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Giri, S., Banerjee, S. Performance analysis of annotation detection techniques for cyber-bullying messages using word-embedded deep neural networks. Soc. Netw. Anal. Min. 13, 23 (2023). https://doi.org/10.1007/s13278-022-01023-2

Download citation

Received: 30 July 2022
Revised: 22 December 2022
Accepted: 31 December 2022
Published: 14 January 2023
DOI: https://doi.org/10.1007/s13278-022-01023-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Performance analysis of annotation detection techniques for cyber-bullying messages using word-embedded deep neural networks

Abstract

Access this article

Similar content being viewed by others

An Exploration of Machine Learning and Deep Learning Techniques for Offensive Text Detection in Social Media—A Systematic Review

Toxic Comment Classification Implementing CNN Combining Word Embedding Technique

HiTACoD: Hierarchical Framework for Textual Abusive Content Detection

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Performance analysis of annotation detection techniques for cyber-bullying messages using word-embedded deep neural networks

Abstract

Access this article

Similar content being viewed by others

An Exploration of Machine Learning and Deep Learning Techniques for Offensive Text Detection in Social Media—A Systematic Review

Toxic Comment Classification Implementing CNN Combining Word Embedding Technique

HiTACoD: Hierarchical Framework for Textual Abusive Content Detection

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation