Automatic Cyberbullying Detection on Twitter Using Bullying Expression Dictionary

Zhang, Jianwei; Otomo, Taiga; Li, Lin; Nakajima, Shinsuke

doi:10.1007/978-3-030-73280-6_25

Jianwei Zhang¹²,
Taiga Otomo¹²,
Lin Li¹³ &
…
Shinsuke Nakajima¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12672))

Included in the following conference series:

Asian Conference on Intelligent Information and Database Systems

1802 Accesses
1 Citations

Abstract

Cyberbullying has become a serious problem with the spread of personal computers, smartphones and SNS. In this paper, for automatic cyberbullying detection on Twitter, we construct a bullying expression dictionary, which registers bullying words and their degrees related to bullying. The words registered in the dictionary are those that appear in the collected bullying-related tweets, and the bullying degrees attached to the words are calculated using SO-PMI. We also construct models to automatically classify bullying and non-bullying tweets by extracting multiple features including the bullying expression dictionary and combining them with multiple machine learning algorithms. We evaluate the classification performance of bullying and non-bullying tweets using the constructed models. The experimental results show that the bullying expression dictionary can contribute to cyberbullying detection in most of the machine learning algorithms and that the best model can obtain an evaluation of over 90%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

stopbullying.gov. Facts About Bullying (2017). https://www.stopbullying.gov/media/facts/index.html Accessed 27 Dec 2020
Bullying investigation results in Japan (2018). https://www.mext.go.jp/content/1410392.pdf Accessed 27 Dec 2020
Zhang, J., Otomo, T., Li, L., Nakajima, S.: Cyberbullying detection on twitter using multiple textual features. In: iCAST 2019, pp. 1–6 (2019)
Google Scholar
Zhang, J., Minami, K., Kawai, Y., Shiraishi, Y., Kumamoto, T.: Personalized web search using emotional features. CD-ARES 2013, 69–83 (2013)
Google Scholar
Takamura, H., Inui, T., Okumura, M.: Extracting semantic orientations of words using spin model. ACL 2005, 133–140 (2005)
Google Scholar
Burnap, P., Williams, M.L.: Cyber hate speech on twitter: an application of machine classification and statistical modeling for policy and decision making. Policy Int. 7(2), 223–242 (2015)
Google Scholar
Rafiq, R.I., Hosseinmardi, H., Han, R., Lv, Q., Mishra, S., Mattson, S.A.: Careful what you share in six seconds: detecting cyberbullying instances in vine. ASONAM 2015, 617–622 (2015)
Article Google Scholar
Hosseinmardi, H., Mattson, S.A., Ibn Rafiq, R., Han, R., Lv, Q., Mishra, S.: Analyzing labeled cyberbullying incidents on the Instagram social network. SocInfo 2015. LNCS, vol. 9471, pp. 49–66. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27433-1_4
Chapter Google Scholar
Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y., Chang, Y.: Abusive language detection in online user content. WWW 2016, 145–153 (2016)
Article Google Scholar
Chatzakou, D., Kourtellis, N., Blackburn, J., Cristofaro, E.D., Stringhini, G., Vakali, A.: Mean birds: detecting aggression and bullying on Twitter. WebSci 2017, 13–22 (2017)
Article Google Scholar
Rafiq, R.I., Hosseinmardi, H., Han, R., Lv, Q., Mishra, S.: Scalable and timely detection of cyberbullying in online social networks. SAC 2018, 1738–1747 (2018)
Google Scholar
Li, C.: Explainable detection of fake news and cyberbullying on social media. WWW 2020, 398 (2020)
Article Google Scholar
Cheng, L., Shu, K., Wu, S., Silva, Y.N., Hall, D.L., Liu, H.: Unsupervised cyberbullying detection via time-informed gaussian mixture model. CIKM 2020, 185–194 (2020)
Article Google Scholar
Schmidt, A., Wiegand, M.: A survey on hate speech detection using natural language processing. In: SocialNLP@EACL 2017, pp. 1–10 (2017)
Google Scholar
Van Hee, C., et al.: Detection and fine-grained classification of cyberbullying events. RANLP 2015, 672–680 (2015)
Google Scholar
Ross, B., et al.: Measuring the reliability of hate speech annotations: the case of the European refugee crisis. In: NLP4CMC 2017, pp. 6–9 (2017)
Google Scholar
Ptaszynski, M., Masui, F., Kimura, Y., Rzepka, R., Araki, K.: Automatic extraction of harmful sentence patterns with application in cyberbullying detection. LTC 2015, 349–362 (2015)
Google Scholar
Ptaszynski, M., Eronen, J.K.K., Masui, F.: Learning deep on cyberbullying is always better than brute force. LaCATODA 2017, 3–10 (2017)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. NIPS 2013, 3111–3119 (2013)
Google Scholar
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. ICML 2014, 1188–1196 (2014)
Google Scholar
Nitta, T., Masui, F., Ptaszynski, M., Kimura, Y., Rzepka, R., Araki, K.: Detecting cyberbullying entries on informal school websites based on category relevance maximization. IJCNLP 2013, 579–586 (2013)
Google Scholar
Hatakeyama, S., Masui, F., Ptaszynski, M., Yamamoto, K.: Statistical analysis of automatic seed word acquisition to improve harmful expression extraction in cyberbullying detection. IJETI 6(2), 165–172 (2016)
Google Scholar
Morita, H., Kawahara, D., Kurohashi, S.: Morphological analysis for unsegmented languages using recurrent neural network language model. EMNLP 2015, 2292–2297 (2015)
Google Scholar
Wang, G., Araki, K.: Modifying SO-PMI for Japanese weblog opinion mining by using a balancing factor and detecting neutral expressions. ACL 2007, 189–192 (2007)
Google Scholar
Yahoo! Japan crowdsourcing. https://crowdsourcing.yahoo.co.jp/ Accessed 27 Dec 2020 from

Download references

Acknowledgments

This research was supported by JSPS 19K12230.

Author information

Authors and Affiliations

Iwate University, Morioka, Japan
Jianwei Zhang & Taiga Otomo
Wuhan University of Technology, Wuhan, China
Lin Li
Kyoto Sangyo University, Kyoto, Japan
Shinsuke Nakajima

Authors

Jianwei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Taiga Otomo
View author publications
You can also search for this author in PubMed Google Scholar
Lin Li
View author publications
You can also search for this author in PubMed Google Scholar
Shinsuke Nakajima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianwei Zhang .

Editor information

Editors and Affiliations

Wrocław University of Science and Technology, Wrocław, Poland
Ngoc Thanh Nguyen
King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand
Suphamit Chittayasothorn
Nanyang Technological University, Singapore, Singapore
Dusit Niyato
Wrocław University of Science and Technology, Wrocław, Poland
Bogdan Trawiński

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J., Otomo, T., Li, L., Nakajima, S. (2021). Automatic Cyberbullying Detection on Twitter Using Bullying Expression Dictionary. In: Nguyen, N.T., Chittayasothorn, S., Niyato, D., Trawiński, B. (eds) Intelligent Information and Database Systems. ACIIDS 2021. Lecture Notes in Computer Science(), vol 12672. Springer, Cham. https://doi.org/10.1007/978-3-030-73280-6_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-73280-6_25
Published: 05 April 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-73279-0
Online ISBN: 978-3-030-73280-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics