ABSTRACT
Cyberbullying remarks based on social media platforms have seriously endangered the physical and mental health of netizens. In order to purify the network environment and create a healthy atmosphere, it needs to be identified. At present, most of the existing studies only analyze the overall features of the text, while ignoring the semantic representation of words at local locations. In order to mine the richer semantic features of text, this paper proposes a BHF model, which uses the BERT pre-training model to extract global semantic feature information, and then uses the Hierarchical Attention Network (HAN) to calculate the two-dimensional space of words and short sentences. Local semantic feature information is obtained by fusing two layers of semantic feature information to obtain three-dimensional semantic feature representation of words, short sentences and full text. The experimental results show that based on the BHF model, the semantic features of the text can be better extracted on the three types of problems of cyberbullying detection, and the accuracy of cyberbullying remarks detection can be greatly improved.
- [1] Wulczyn E, Thain N, Dixon L. Ex Machina: Personal Attacks Seen at Scale[C]. The Web Conference. International World Wide Web Conferences Steering Committee, 2017, pp. 1391–1399.Google Scholar
- [2] Zhou J, Shiren Y E, Wang H. Text Sentiment Classification Based on Deep Convolutional Neural Network Model[J]. Computer Engineering, 2019.Google Scholar
- [3] Shao L, Y Zhou, University L T. Semantic Rules and RNN Based Sentiment Classification for Online Reviews. 2019.Google Scholar
- [4] Zhou Zhihua, Machine Learning(chinese) [w] Beijing: Tsinghua University Press, 2016, pp. 30–32.Google Scholar
- [5] Yin D, Xue Z, Hong L. Detection of harassment on Web 2. 0[J]. Proceedings of the Content Analysis in the Web Workshop at Www, 2009, pp. 1–7.Google Scholar
- [6] Dinakar K, Reichart R, Lieberman H. Modeling the Detection of Textual Cyberbullying[C]. The Social Mobile Web, Papers from the 2011 ICWSM Workshop, Barcelona, Catalonia, Spain, July 21, 2011. DBLP, 2011.Google Scholar
- [7] Chavan V S, Shylaja S S. Machine learning approach for detection of cyber-aggressive comments by peers on social media network[C]. 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI). IEEE, 2015.Google ScholarCross Ref
- [8] Zhao R, Zhou A, Mao K. Automatic detection of cyberbullying on social networks based on bullying features[C]. ACM. ACM, 2016, pp. 1–6.Google Scholar
- [9] PJC Pérez, Valdez, C. MISAAC:Instant messaging tool for Ciberbullying Detection.Google Scholar
- [10] Wang Kun, Cui Yanpeng, Hu Jianwei. Cyberbullying Detection, Based on the FastText and Word Similarity Schemes[J]. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2020.Google Scholar
- [11] Iwendi C, Srivastava G, Khan S. Cyberbullying detection solutions based on deep learning architectures[J]. Multimedia Systems, 2020.Google Scholar
- [12] Pitsilis G K, Ramampiaro H, Langseth H. Detecting Offensive Language in Tweets Using Deep Learn [DB/OL]. 2019.Google Scholar
- [13] M. Mahat, “Detecting Cyberbullying Across Multiple Social Media Platforms Using Deep Learning,” 2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), 2021, pp. 299–301, doi: 10.1109/ICACITE51222.2021.9404736.Google ScholarCross Ref
- [14] Karthik D, Roi R, Henry L. Modeling the detection of textual cyberbullying. In Workshop on The Social Mobile Web, ICWSM, 2020.Google Scholar
- [15] Mohit Chandra, Ashwin Pathak, Eesha Dutta, Paryul Jain, Manish Gupta, Manish Shrivastava, and Ponnurangam Kumaraguru. 2020. AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts. In Proceedings of the 28th International Conference on Computational Linguistics, Barcelona, Spain (Online). International Committee on Computational Linguistics, 2020, pp. 6277–6283.Google Scholar
- [16] Devlin J, Chang M W, Lee K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding[J]. 2018.Google Scholar
- [17] Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin, 2017, pp. 6000–6010.Google Scholar
- [18] Nikolaos Pappas, Andrei Popescu-Belis. Multilingual Hierarchical Attention Networks for Document Classification. 8th International Joint Conference on Natural Language Processing, 2017, pp. 1015–1025.Google Scholar
Index Terms
- BHF: BERT-based Hierarchical Attention Fusion Network for Cyberbullying Remarks Detection
Recommendations
Parental mediation, cyberbullying, and cybertrolling
Researchers are concerned with identifying the risk and protective factors associated with adolescents' involvement in cyberharassment. One such factor is parental mediation of children's electronic technology use. Little attention has been given to how ...
Does BERT Pay Attention to Cyberbullying?
SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information RetrievalSocial media have brought threats like cyberbullying, which can lead to stress, anxiety, depression, and in some severe cases, suicide attempts. Detecting cyberbullying can help to warn/ block bullies and provide support to victims. However, very few ...
Prevalence of cyberbullying and predictors of cyberbullying perpetration among Korean adolescents
This study aimed to investigate the prevalence of cyberbullying and factors in cyberbullying perpetration with a national sample of 4000 adolescents selected through multi-stage cluster sampling. The respondents were 2166 boys (54.1%) and 1834 girls (...
Comments