Bi-lingual Intent Classification of Twitter Posts: A Roadmap

Adekotujo, Akinlolu Solomon; Lee, JooYoung; Enikuomehin, Ayokunle Oluwatoyin; Mazzara, Manuel; Aribisala, Segun Benjamin

doi:10.1007/978-3-030-14687-0_1

Akinlolu Solomon Adekotujo^19,20,21,
JooYoung Lee²⁰,
Ayokunle Oluwatoyin Enikuomehin¹⁹,
Manuel Mazzara²⁰ &
…
Segun Benjamin Aribisala¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 925))

Included in the following conference series:

International Conference in Software Engineering for Defence Applications

487 Accesses

Abstract

A core advantage of social media platforms is the freedom that comes with the way users express their opinions and share information as they deem fit, in line with the subject of discussion. Advances in text analytics have allowed researchers to adequately classify information expressed in natural language text, which emanates in millions per minute, under well-defined categories like “hate” or “radicalized” content which provide further insight into intent of the sender. This analysis is important for social media intelligence and information security. Commercial intent classifications have witnessed several research attentions. However, social intent classification of topics in line with hate, radicalized posts, have witnessed little research effort. The focus of this study is to develop a roadmap of a model for automatic bilingual intent classification of hate speech. This empirical model will involve the use of bi-gram words for intent classification. The feature extraction will include expected cross entropy, while topic modeling will use supervised context-based n-gram approach. Classification will be done using ensemble-based approach which will include the use of Naïve Bayes and Support Vector Machine. This study will also discuss the differences between the concept of fake news, stance and intent identification. We anticipate that the proposed roadmap, if implemented, will be useful in the classification of intent as it relates to hate speech in bilingual twitter post. The proposed model has the potential to improve intent classification and that could be useful in hate speech detection, which can avert social or security problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Kröll M, Strohmaier M (2015) Associating intent with sentiment in weblogs. In: International conference on applications of natural language to information systems. Springer, Cham
Google Scholar
Albright J (2016) The# Election2016 micro-propaganda machine. https://medium.com/@d1gi/the-election2016-micro-propaganda-machine-383449cc1fba#.idanl6i8z. Accessed 15 Jan 2017
Lozhnikov N, Derczynski L, Mazzara M (2018) Stance prediction for Russian: data and analysis
Google Scholar
Shu K et al (2017) Fake news detection on social media: a data mining perspective, 19(1):22–36
Google Scholar
Purohit H et al (2015) Intent classification of short-text on social media. In: 2015 IEEE international conference on smart city/SocialCom/SustainCom (SmartCity). IEEE
Google Scholar
Kröll M, Strohmaier M (2009) Analyzing human intentions in natural language text. In: Proceedings of the fifth international conference on knowledge capture. ACM
Google Scholar
Mohtarami M et al (2018) Automatic stance detection using end-to-end memory networks
Google Scholar
Zubiaga A et al (2018) Detection and resolution of rumours in social media: a survey, 51(2):32
Google Scholar
Lozhnikov N, Derczynski L, Mazzara M (2018) Stance prediction for russian: data and analysis. arXiv preprint arXiv:1809.01574
Dai HK et al (2006) Detecting online commercial intention (OCI). In: Proceedings of the 15th international conference on world wide web. ACM
Google Scholar
Kirsh D (1990) When is information explicitly represented? Information, language and cognition - the Vancouver studies in cognitive science. UBC Press, pp 340–365
Google Scholar
Ajzen I (1991) The theory of planned behavior, 50(2):179–211
Google Scholar
Malle BF, Knobe J (1997) The folk concept of intentionality, 33(2):101–121
Google Scholar
Sloman SA et al (2012) A causal model of intentionality judgment, 27(2):154–180
Google Scholar
Melnikov A et al (2018) Towards dynamic interaction-based reputation models. In: 2018 IEEE 32nd international conference on advanced information networking and applications (AINA). IEEE
Google Scholar
Hollerit B, Kröll M, Strohmaier M (2013) Towards linking buyers and sellers: detecting commercial intent on Twitter. In: Proceedings of the 22nd international conference on world wide web. ACM
Google Scholar
Benczúr A et al (2007) Web spam detection via commercial intent analysis. In: Proceedings of the 3rd international workshop on adversarial information retrieval on the web. ACM, pp 89–92
Google Scholar
Lewandowski D, Drechsler J, Von Mach S (2012) Deriving query intents from web search engine queries. J Am Soc Inform Sci Technol 63(9):1773–1788
Article Google Scholar
Guo Q, Agichtein E, Clarke CL, Ashkan A (2008) Understanding “abandoned” ads: towards personalized commercial intent inference via mouse movement analysis. Inf Retr Advert IRA 2008:27–30
Google Scholar
Lewandowski D (2011) The influence of commercial intent of search results on their perceived relevance. In: Proceedings of the 2011 iConference. ACM, pp 452–458
Google Scholar
Ben-David A, Matamoros-Fernandez A (2016) Hate speech and covert discrimination on social media: monitoring the Facebook pages of extreme-right political parties in Spain. Int J Commun 10:1167–1193
Google Scholar
Wang X, McCallum A, Wei X: Topical n-grams: phrase and topic discovery, with an application to information retrieval. In: ICDM. IEEE, pp 697–702
Google Scholar
Chavhan RN (2016) Solutions to detect and analyze online radicalization, 1(4)
Google Scholar
Agarwal S, Sureka A (2017) Characterizing linguistic attributes for automatic classification of intent based racist/radicalized posts on Tumblr micro-blogging website
Google Scholar
Montejo-Ráez A et al (2014) A knowledge-based approach for polarity classification in Twitter, 65(2):414–425
Google Scholar
Balahur A, Perea-Ortega JM (2015) Sentiment analysis system adaptation for multilingual processing: the case of tweets, 51(4):547–556
Google Scholar
Montoyo A, MartíNez-Barco P, Balahur A (2012) Subjectivity and sentiment analysis: an overview of the current state of the area and envisaged developments. Elsevier (2012)
Google Scholar
Vilares D et al (2017) Supervised sentiment analysis in multilingual environments, 53(3):595–607
Google Scholar
Gomes HM et al (2017) A survey on ensemble learning for data stream classification, 50(2):23
Google Scholar
Sanfilippo A et al (2009) VIM: a platform for violent intent modeling. In: Social computing and behavioral modeling. Springer, Heidelberg, pp 1–11
Google Scholar
Ben-David A, Matamoros-Fernandez A (2016) Hate speech and covert discrimination on social media: monitoring the Facebook pages of extreme-right political parties in Spain, 10:1167–1193
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Lagos State University, Lagos, Nigeria
Akinlolu Solomon Adekotujo, Ayokunle Oluwatoyin Enikuomehin & Segun Benjamin Aribisala
Innopolis University, Innopolis, Russia
Akinlolu Solomon Adekotujo, JooYoung Lee & Manuel Mazzara
Computer, Information and Management Studies Department, The Administrative Staff College of Nigeria, Badagry, Nigeria
Akinlolu Solomon Adekotujo

Authors

Akinlolu Solomon Adekotujo
View author publications
You can also search for this author in PubMed Google Scholar
JooYoung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Ayokunle Oluwatoyin Enikuomehin
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Mazzara
View author publications
You can also search for this author in PubMed Google Scholar
Segun Benjamin Aribisala
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Akinlolu Solomon Adekotujo .

Editor information

Editors and Affiliations

University of Bologna, Bologna, Italy
Paolo Ciancarini
Innopolis University, Innopolis, Russia
Manuel Mazzara
Innopolis University, Innopolis, Russia
Angelo Messina
Innopolis University, Innopolis, Russia
Alberto Sillitti
Innopolis University, Innopolis, Russia
Giancarlo Succi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Adekotujo, A.S., Lee, J., Enikuomehin, A.O., Mazzara, M., Aribisala, S.B. (2020). Bi-lingual Intent Classification of Twitter Posts: A Roadmap. In: Ciancarini, P., Mazzara, M., Messina, A., Sillitti, A., Succi, G. (eds) Proceedings of 6th International Conference in Software Engineering for Defence Applications. SEDA 2018. Advances in Intelligent Systems and Computing, vol 925. Springer, Cham. https://doi.org/10.1007/978-3-030-14687-0_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-14687-0_1
Published: 19 March 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14686-3
Online ISBN: 978-3-030-14687-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics