Multilingual Sentiment Analysis on Twitter Data Towards Enhanced Policy Making

Manias, George; Kiourtis, Athanasios; Mavrogiorgou, Argyro; Kyriazis, Dimosthenis

doi:10.1007/978-3-031-08337-2_27

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 647))

Included in the following conference series:

IFIP International Conference on Artificial Intelligence Applications and Innovations

1016 Accesses
3 Citations

Abstract

The great expansion in the usage and popularity of social media platforms during the last decades has led to the production of an enormous real-time volume of social texts and posts, including tweets, that are being produced by users. These collections of social data can be potentially useful and provide useful insights to policymakers to adjust new user-centric policies and regulations. However, extracting and analyzing valuable information and knowledge out of these data is a challenging task as concerns the high multilingualism that describes these data. Thus, both the research and the business communities focus on the utilization of multilingual approaches and solutions to enhance the policy making procedures. To investigate a portion of these challenges this research work performs a comparative analysis of two multilingual sentiment analysis approaches. In this context, three multilingual deep learning classifiers and a zero-shot classification approach were utilized and compared. Their comparison has unveiled insightful outcomes and has a two-fold interpretation. Multilingual deep learning classifiers that have pre-trained and evaluated in monolingual data achieve high performances and transfer inference when applied afterwards in multilingual data. However, the zero-shot classification approach fails to achieve high accuracies in monolingual data as in contrary to when applied on multilingual data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Global Social Media Stats | DataReportal – Global Digital Insights. https://datareportal.com/social-media-users. Accessed 27 Feb 2022
Wiederhold, B.K.: Social media use during social distancing. Cyberpsychol. Behav. Soc. Netw. 23(5), 275–276 (2020)
Article Google Scholar
Alalwan, A.A.: Investigating the impact of social media advertising features on customer purchase intention. Int. J. Inf. Manag. 42, 65–77 (2018)
Article Google Scholar
Social media marketing - statistics & facts. https://www.statista.com/topics/1538/social-media-marketing/#dossierKeyfigures. Accessed 26 Feb 2022
Social Media Statistics For 2022 [+Infographic] | Statusbrew. https://statusbrew.com/insights/social-media-statistics. Accessed 25 Feb 2022
The Latest Twitter Stats: Everything You Need to Know | DataReportal – Global Digital Insights. https://datareportal.com/essential-twitter-stats. Accessed 26 Feb 2022
Păvăloaia, V.D., et al.: Opinion mining on social media data: sentiment analysis of user preferences. Sustainability 11(16), 4459 (2019)
Article Google Scholar
Tao, D., Yang, P., Feng, H.: Utilization of text mining as a big da-ta analysis tool for food science and nutrition. Compr. Rev. Food Sci. Food Safety 19(2), 875–894 (2020)
Article Google Scholar
Sánchez-Núñez, P., et al.: Opinion mining, sentiment analysis and emotion understanding in advertising: a bibliometric analysis. IEEE Access 8, 134563–134576 (2020)
Article Google Scholar
Manias, G., et al.: An evaluation of neural machine translation and pre-trained word embeddings in multilingual neural sentiment analysis. In: 2020 IEEE International Conference on PIC, pp. 274–283. IEEE (2020)
Google Scholar
Sadia, A., Khan, F., Bashir, F.: An overview of lexicon-based approach for sentiment analysis. In: IEEC 2018, pp. 1–6 (2018)
Google Scholar
Conneau, A., et al.: unsupervised cross-lingual representation learning at scale. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 8440–8451. Association for Computational Linguistics (2020)
Google Scholar
Aydoğan, M., Karci, A.: Improving the accuracy using pre-trained word embeddings on deep neural networks for Turkish text classification. Physica A 541, 123288 (2020)
Article Google Scholar
Schwenk, H., Li, X.: A corpus for multilingual document classification in eight languages. arXiv preprint arXiv:1805.09821 (2018)
Eriguchi, A., et al.: Zero-shot cross-lingual classification using multilingual neural machine translation. arXiv preprint arXiv:1809.04686 (2018)
Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., Liu, C.: A survey on deep transfer learning. In: Kůrková, V., Manolopoulos, Y., Hammer, B., Iliadis, L., Maglogiannis, I. (eds.) ICANN 2018. LNCS, vol. 11141, pp. 270–279. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01424-7_27
Xian, Y., et al.: Zero-shot learning—a comprehensive evaluation of the good, the bad and the ugly. IEEE Trans. Pattern Anal. Mach. Intell. 41(9), 2251–2265 (2018)
Article Google Scholar
Artetxe, M., Schwenk, H.: Massively multilingual sentence embeddings for zero-shot cross-lingual transfer and beyond. TACL 7, 597–610 (2019)
Article Google Scholar
Sarkar, A., Reddy, S., Iyengar, R.S.: Zero-shot multilingual sentiment analysis using hierarchical attentive network and BERT. In: Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval, pp. 49–56 (2019)
Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Conference of the NAACL-HLT 2019, pp. 4171–4186 (2019)
Google Scholar
Barbieri, F., Anke, L.E., Camacho-Collados, J.: XLM-T: A multilingual language model toolkit for twitter. arXiv preprint arXiv:2104.12250 (2021)
Gonzalez, J.A., Hurtado, L.F., Pla, F.: TWilBert: pre-trained deep bidirectional transformers for Spanish Twitter. Neurocomputing 426, 58–69 (2021)
Article Google Scholar
Pota, M., Ventura, M., Catelli, R., Esposito, M.: An effective BERT-based pipeline for Twitter sentiment analysis: a case study in Italian. Sensors 21(1), 133 (2021)
Article Google Scholar
Polignano, M., et al.: Alberto: Italian BERT language understanding model for NLP challenging tasks based on tweets. In: 6th Italian Conference on Computational Linguistics, CLiC-it 2019, vol. 2481, pp. 1–6. CEUR (2019)
Google Scholar
Wehrmann, J., Becker, W.E., Barros, R.C.: A multi-task neural network for multilingual sentiment classification and language detection on twitter. In: Proceedings of the 33rd Annual ACM Symposium on Applied Computing, pp. 1805–1812 (2018)
Google Scholar
Yang, Y., et al.: Multilingual universal sentence encoder for semantic retrieval. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 87–94 (2020)
Google Scholar
Janocha, K., Czarnecki, W.M.: On Loss functions for deep neural networks in classification. Schedae Informaticae 25 (2017)
Google Scholar
Afridi, M.J., Ross, A., Shapiro, E.M.: On automated source selection for transfer learning in convolutional neural networks. Pattern Recognit. 73, 65–75 (2018)
Article Google Scholar
Manaswi, N.K.: RNN and LSTM. In: Deep Learning with Applications Using Python, pp. 115–126. Apress, Berkeley (2018)
Google Scholar
Muhammad, P.F., Kusumaningrum, R., Wibowo, A.: Sentiment analysis using Word2vec and long short-term memory (LSTM) for Indonesian hotel reviews. Procedia Comput. Sci. 179, 728–735 (2021)
Article Google Scholar
Kumar, A., Albuquerque, V.H.C.: Sentiment analysis using XLM-R transformer and zero-shot transfer learning on resource-poor Indian language. Trans. Asian Low Resour. Lang. Inf. Process. 20(5), 1–13 (2021)
Article Google Scholar
Hugging Face. https://huggingface.co. Accessed 26 Feb 2022
joeddav/xlm-roberta-large-xnli-Hugging Face. https://huggingface.co/joeddav/xlm-roberta-large-xnli. Accessed 02 Mar 2022
Conneau, A., et al.: XNLI: evaluating cross-lingual sentence representations. In: Proceedings of the EMNLP 2018, pp. 2475–2485 (2020)
Google Scholar
Wine Reviews | Kaggle. https://www.kaggle.com/zynicide/wine-reviews/home. Accessed 05 Mar 2022
TensorFlow. https://www.tensorflow.org/. Accessed 27 Feb 2022
Keras: the Python deep learning API. https://keras.io/. Accessed 27 Feb 2022
Google Colaboratory. https://colab.research.google.com/. Accessed 09 Mar 2022
Wine-Searcher. https://www.wine-searcher.com/wine-scores. Accessed 28 Mar 2022
Kyriazis, D., et al.: PolicyCLOUD: analytics as a service facilitating efficient data-driven public policy management. In: Maglogiannis, I., Iliadis, L., Pimenidis, E. (eds.) AIAI 2020. IAICT, vol. 583, pp. 141–150. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49161-1_13
Chapter Google Scholar

Download references

Acknowledgement

The research leading to the results presented in this paper has received funding from the European Union’s funded Project PolicyCLOUD under grant agreement no 870675.

Author information

Authors and Affiliations

University of Piraeus, Piraeus, Greece
George Manias, Athanasios Kiourtis, Argyro Mavrogiorgou & Dimosthenis Kyriazis

Authors

George Manias
View author publications
You can also search for this author in PubMed Google Scholar
Athanasios Kiourtis
View author publications
You can also search for this author in PubMed Google Scholar
Argyro Mavrogiorgou
View author publications
You can also search for this author in PubMed Google Scholar
Dimosthenis Kyriazis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to George Manias .

Editor information

Editors and Affiliations

University of Piraeus, Piraeus, Greece
Ilias Maglogiannis
Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
University of Sunderland, Sunderland, UK
John Macintyre
Universidade do Minho, Guimaraes, Portugal
Paulo Cortez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Manias, G., Kiourtis, A., Mavrogiorgou, A., Kyriazis, D. (2022). Multilingual Sentiment Analysis on Twitter Data Towards Enhanced Policy Making. In: Maglogiannis, I., Iliadis, L., Macintyre, J., Cortez, P. (eds) Artificial Intelligence Applications and Innovations. AIAI 2022. IFIP Advances in Information and Communication Technology, vol 647. Springer, Cham. https://doi.org/10.1007/978-3-031-08337-2_27

Download citation

DOI: https://doi.org/10.1007/978-3-031-08337-2_27
Published: 10 June 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-08336-5
Online ISBN: 978-3-031-08337-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)