Estimating Aggressiveness of Russian Texts by Means of Machine Learning

Levonevskiy, Dmitriy; Malov, Dmitrii; Vatamaniuk, Irina

doi:10.1007/978-3-030-26061-3_28

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11658))

Included in the following conference series:

International Conference on Speech and Computer

1268 Accesses
2 Citations

Abstract

This paper considers emotional assessment of texts in Russian using machine learning on the example of aggression detection. It summarizes the related work, methods, models and datasets, describes actual problems, proposes a text processing pipeline and a software system for training neural networks on heterogeneous datasets. The experiments show that neural networks trained on the annotated corpora both in Russian and English, allow to determine whether a text item in Russian contains an aggressive message. Authors thoroughly compare different assessment methods, particularly corpus-based approaches, machine learning solutions and hybrid variants. Results, obtained here, can be used to estimate the aggressiveness probability, for example, to rank messages for subsequent manual verification. They also enable feasibility studies on the possibilities of detecting a particular type of emotion in a text using corpora in other languages. The paper highlights further research directions, where different Python toolkits (NLTK, Keras) could be used for better model performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Identification and Classification of Textual Aggression in Social Media: Resource Creation and Evaluation

Aggressive Bangla Text Detection Using Machine Learning and Deep Learning Algorithms

Using Cognitive Learning Method to Analyze Aggression in Social Media Text

References

Kocharov, D.A., Menshikova, A.P.: Detection of prominent words in Russian texts using linguistic features. SPIIRAS Proc. 6, 216–236 (2017)
Article Google Scholar
Glazkova, A.V.: An approach to text classification based on age groups of addressees. SPIIRAS Proc. 3, 51–69 (2017)
Article Google Scholar
Vorobiev, V.I., Evnevich, E.L., Levonevskiy, D.K., Fatkieva, R.R., Fedorchenko, L.N.: A study and selection of cryptographic standards on the basis of text mining. SPIIRAS Proc. 5, 69–87 (2016)
Article Google Scholar
Ventirozos, F.K., Varlamis, I., Tsatsaronis, G.: Detecting aggressive behavior in discussion threads using text mining. In: Gelbukh, A. (ed.) CICLing 2017. LNCS, vol. 10762, pp. 420–431. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-77116-8_31
Chapter Google Scholar
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5(4), 1093–1113 (2014)
Article Google Scholar
Chatzakou, D., Kourtellis, N., Blackburn, J., De Cristofaro, E., Stringhini, G., Vakali, A..: Mean birds: detecting aggression and bullying on twitter. In Proceedings of the 2017 ACM on Web Science Conference, pp. 13–22. ACM (2017)
Google Scholar
Van Hee, C., et al.: Automatic detection of cyberbullying in social media text. PLoS One 13(10), e0203794 (2018)
Article Google Scholar
Tommasel, A., Rodriguez, J.M., Godoy, D.: Textual aggression detection through deep learning. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying, TRAC-2018, pp. 177–187 (2018)
Google Scholar
Golem, V., Karan, M., Šnajder, J.: Combining shallow and deep learning for aggressive text detection. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying, TRAC-2018, pp. 188–198 (2018)
Google Scholar
Escalante, H.J., Villatoro-Tello, E., Garza, S.E., López-Monroy, A.P., Montes-y-Gómez, M., Villaseñor-Pineda, L.: Early detection of deception and aggressiveness using profile-based representations. Expert Syst. Appl. 89, 99–111 (2017)
Article Google Scholar
Serrano-Guerrero, J., Olivas, J.A., Romero, F.P., Herrera-Viedma, E.: Sentiment analysis: a review and comparative analysis of web services. Inf. Sci. 311, 18–38 (2015)
Article Google Scholar
Mäntylä, M.V., Graziotin, D., Kuutila, M.: The evolution of sentiment analysis—a review of research topics, venues, and top cited papers. Comput. Sci. Rev. 27, 16–32 (2018)
Article Google Scholar
Jo, H., Kim, S.M., Ryu, J.: What we really want to find by sentiment analysis: the relationship between computational models and psychological state. arXiv preprint arXiv:1704.03407 (2017)
Smirnov, I.V., SHelmanov, A.O., Kuznecova, E.S., Hramoin, I.V.: Semantiko-sintaksicheskij analiz estestvennykh yazykov. CHast’ II. Metod semantiko-sintaksicheskogo analiza tekstov (Semantic-syntactic analysis of natural languages. Part II. Method of semantic-syntactic analysis of texts). Iskusstvennyj intellekt i prinyatie reshenij, vol. 1, pp. 11–24. ISA RAS, Moscow (2014)
Google Scholar
Plutchik, R.: A general psychoevolutionary theory of emotion. In: Theories of Emotion, pp. 3–33. Academic Press (1980)
Google Scholar
Mejova, Y., Srinivasan, P.: Exploring feature definition and selection for sentiment classifiers. In: Fifth International AAAI Conference on Weblogs and Social Media (2011)
Google Scholar
Reyes, A., Rosso, P.: Making objective decisions from subjective data: detecting irony in customer reviews. Decis. Support Syst. 53(4), 754–760 (2012)
Article Google Scholar
Bostan, L.A.M., Klinger, R.: An analysis of annotated corpora for emotion classification in text. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 2104–2119 (2018)
Google Scholar
Rubtsova, Y.: Constricting a corpus for sentiment classification training. Softw. Syst. 1(109), 72–79 (2015)
Article Google Scholar
Ekman, P.: An argument for basic emotions. Cogn. Emot. 6(3–4), 169–200 (1992)
Article Google Scholar
Levonevskii, D., SHumskaya, O., Velichko, Uzdyaev, M., Malov, D.: Methods for determination of psychophysiological condition of user within smart environment based on complex analysis of heterogeneous data. Paper presented at the 14th International Conference on Electromechanics and Robotics “Zavalishin’s Readings”, ER(ZR)-2019 (2019)
Google Scholar
Sentiment Analysis in Text. https://data.world/crowdflower/sentiment-analysis-in-text. Accessed 15 Feb 2019
Emotion, Sentiment, and Stance Labeled Data. http://saifmohammad.com/WebPages/SentimentEmotionLabeledData.html. Accessed 21 Jan 2019
Buechel, S., Hahn, U.: EMOBANK: studying the impact of annotation perspective and representation format on dimensional emotion analysis. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, vol. 2, pp. 578–585 (2017)
Google Scholar
Risch, J., Krestel, R.: Aggression identification using deep learning and data augmentation. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (co-located with COLING), pp. 150–158 (2018)
Google Scholar
Yussupova, N., Bogdanova, D., Boyko, M.: Applying of sentiment analysis for texts in Russian based on machine learning approach. In: IMMM 2012: The Second International Conference on Advances in Information Mining and Management, pp. 8–14 (2012)
Google Scholar
Neidenthal, P.M., Kranth-Gruber, S., Ric, F.: Psychology of Emotions: Interpersonal, Experiential, and Cognitive Approach. Psychology Press, New York (2006)
Google Scholar

Download references

Acknowledgment

This research is supported by the Russian Foundation for Basic Research (project No. 18-29-22061_MK).

Author information

Authors and Affiliations

St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), 14th Line, 39, 199178, St. Petersburg, Russia
Dmitriy Levonevskiy, Dmitrii Malov & Irina Vatamaniuk

Authors

Dmitriy Levonevskiy
View author publications
You can also search for this author in PubMed Google Scholar
Dmitrii Malov
View author publications
You can also search for this author in PubMed Google Scholar
Irina Vatamaniuk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dmitriy Levonevskiy .

Editor information

Editors and Affiliations

Utrecht University, Utrecht, The Netherlands
Albert Ali Salah
St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences, St. Petersburg, Russia
Alexey Karpov
Moscow State Linguistic University, Moscow, Russia
Rodmonga Potapova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Levonevskiy, D., Malov, D., Vatamaniuk, I. (2019). Estimating Aggressiveness of Russian Texts by Means of Machine Learning. In: Salah, A., Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2019. Lecture Notes in Computer Science(), vol 11658. Springer, Cham. https://doi.org/10.1007/978-3-030-26061-3_28

Download citation

DOI: https://doi.org/10.1007/978-3-030-26061-3_28
Published: 24 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26060-6
Online ISBN: 978-3-030-26061-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Estimating Aggressiveness of Russian Texts by Means of Machine Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Identification and Classification of Textual Aggression in Social Media: Resource Creation and Evaluation

Aggressive Bangla Text Detection Using Machine Learning and Deep Learning Algorithms

Using Cognitive Learning Method to Analyze Aggression in Social Media Text

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Estimating Aggressiveness of Russian Texts by Means of Machine Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Identification and Classification of Textual Aggression in Social Media: Resource Creation and Evaluation

Aggressive Bangla Text Detection Using Machine Learning and Deep Learning Algorithms

Using Cognitive Learning Method to Analyze Aggression in Social Media Text

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation