Sexist Hate Speech: Identifying Potential Online Verbal Violence Instances

Santana, Brenda Salenave; Vanin, Aline Aver; Wives, Leandro Krug

doi:10.1007/978-3-030-98305-5_17

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13208))

Included in the following conference series:

International Conference on Computational Processing of the Portuguese Language

752 Accesses

Abstract

Online communication provides space for content dissemination and opinion sharing. However, the limit between opinion and offense might be exceeded, characterizing hate speech. Moreover, its automatic detection is challenging, and approaches focused on the Portuguese language are scarce. This paper proposes an interface between linguistic concepts and computational interventions to support hate speech detection. We applied a Natural Language Processing pipeline involving topic modeling and semantic role labeling, allowing a semi-automatic identification of hate speech. We also discuss how such speech qualifies as a type of verbal violence widespread on social networks to reinforce a sexist stereotype. Finally, we use Twitter data to analyze information that resulted in virtual attacks against a specific person. As an achievement, this work validates the use of linguistic features to annotate data either as hate speech or not. It also proposes using fallacies as a potential additional feature to identify potential intolerant discourses.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Sexism or gender discrimination is prejudice and, sometimes, discrimination based on a person’s gender or sex.
2.
Available in https://github.com/brendasalenave/sexist_hate_speech.
3.
https://developer.twitter.com.
4.
Pejorative reference to a person who is affiliated to the Brazilian Worker’s Party, i.e., Partido dos Trabalhadores.
5.
Available in: https://www.perspectiveapi.com.
6.
There are additional metrics offered for other languages.

References

Almeida, G., Cunha, J.: curso discurso de ódio, tô fora: ferramentas para uma internet cordial (2020, unpublished)
Google Scholar
de Barros, D.L.P.: O discurso intolerante na internet: enunciação e interação. In: Proceedings of XVII CONGRESO INTERNACIONAL ASOCIACIÓN DE LINGÜÍSTICA Y FILOLOGÍA DE AMÉRICA LATINA (ALFAL 2014) (2014)
Google Scholar
Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated hate speech detection and the problem of offensive language. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 11 (2017)
Google Scholar
Delobelle, P., Cunha, M., Cano, E.M., Peperkamp, J., Berendt, B.: Computational ad hominem detection. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pp. 203–209 (2019)
Google Scholar
Duran, M.S., Aluísio, S.M.: Propbank-Br: a Brazilian treebank annotated with semantic role labels. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey, pp. 1862–1867. European Language Resources Association (ELRA), May 2012. http://www.lrec-conf.org/proceedings/lrec2012/pdf/272_Paper.pdf
Fillmore, C.J.: Frames and the semantics of understanding. Quaderni di semantica 6(2), 222–254 (1985)
Google Scholar
Fortuna, P., Nunes, S.: A survey on automatic detection of hate speech in text. ACM Comput. Surv. (CSUR) 51(4), 1–30 (2018)
Article Google Scholar
Gallego, E.S., et al.: O ódio como política: a reinvenção das direitas no Brasil, pp. 33–40. Boitempo, São Paulo (2018)
Google Scholar
He, L., Lee, K., Levy, O., Zettlemoyer, L.: Jointly predicting predicates and arguments in neural semantic role labeling. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Melbourne, Australia, pp. 364–369. Association for Computational Linguistics, July 2018. https://doi.org/10.18653/v1/P18-2058. https://www.aclweb.org/anthology/P18-2058
MacAvaney, S., Yao, H.R., Yang, E., Russell, K., Goharian, N., Frieder, O.: Hate speech detection: challenges and solutions. PLoS One 14(8) (2019)
Google Scholar
Mills, S.: Language and Sexism. Cambridge University Press, Cambridge (2008). https://doi.org/10.1017/CBO9780511755033
Book Google Scholar
Nemer, D.: The three types of Whatsapp users getting Brazil’s Jair Bolsonaro elected. The Guardian 25 (2018)
Google Scholar
Oliveira, A.S.M.: Semantic role labeling in portuguese: improving the state of the art with transfer learning and BERT-based models. M.s. thesis, Faculdade de Ciências. Universidade do Porto, Porto, Portugal (2020). https://repositorio-aberto.up.pt/bitstream/10216/130371/2/431435.pdf
Ostrowski, D.A.: Using latent dirichlet allocation for topic modelling in twitter. In: Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015), pp. 493–497, February 2015. https://doi.org/10.1109/ICOSC.2015.7050858
Santana, B.S., Vanin, A.A.: Detecting group beliefs related to 2018’s Brazilian elections in tweets: a combined study on modeling topics and sentiment analysis. In: Proceedings of the Workshop on Digital Humanities and Natural Language Processing (DHandNLP 2020) co-located with International Conference on the Computational Processing of Portuguese (PROPOR 2020) (2020)
Google Scholar
Warner, W., Hirschberg, J.: Detecting hate speech on the world wide web. In: Proceedings of the Second Workshop on Language in Social Media, pp. 19–26. Association for Computational Linguistics (2012)
Google Scholar
Zimmerman, S., Kruschwitz, U., Fox, C.: Improving hate speech detection with deep learning ensembles. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (2018)
Google Scholar

Download references

Acknowledgments

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001.

Author information

Authors and Affiliations

Postgraduate Program in Computing, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
Brenda Salenave Santana & Leandro Krug Wives
Department of Education and Humanities, Federal University of Health Sciences of Porto Alegre, Porto Alegre, Brazil
Aline Aver Vanin

Authors

Brenda Salenave Santana
View author publications
You can also search for this author in PubMed Google Scholar
Aline Aver Vanin
View author publications
You can also search for this author in PubMed Google Scholar
Leandro Krug Wives
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Brenda Salenave Santana .

Editor information

Editors and Affiliations

Universidade de Fortaleza, Fortaleza, Brazil
Vládia Pinheiro
CiTIUS - Universidade de Santiago de Compostela, Santiago de Compostela, Spain
Pablo Gamallo
Universidade Nova de Lisboa, Lisbon, Portugal
Raquel Amaro
University of Sheffield, Sheffield, UK
Carolina Scarton
INESC-ID, Lisbon, Portugal
Fernando Batista
Federal University of São Carlos, São Carlos, Brazil
Diego Silva
University of Lisbon, Lisbon, Portugal
Catarina Magro
Sentimonitor, Porto Alegre, Brazil
Hugo Pinto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Santana, B.S., Vanin, A.A., Wives, L.K. (2022). Sexist Hate Speech: Identifying Potential Online Verbal Violence Instances. In: Pinheiro, V., et al. Computational Processing of the Portuguese Language. PROPOR 2022. Lecture Notes in Computer Science(), vol 13208. Springer, Cham. https://doi.org/10.1007/978-3-030-98305-5_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-98305-5_17
Published: 16 March 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-98304-8
Online ISBN: 978-3-030-98305-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics