Skip to main content

Multilingual Sentiment Analysis on Twitter Data Towards Enhanced Policy Making

  • Conference paper
  • First Online:
Artificial Intelligence Applications and Innovations (AIAI 2022)

Abstract

The great expansion in the usage and popularity of social media platforms during the last decades has led to the production of an enormous real-time volume of social texts and posts, including tweets, that are being produced by users. These collections of social data can be potentially useful and provide useful insights to policymakers to adjust new user-centric policies and regulations. However, extracting and analyzing valuable information and knowledge out of these data is a challenging task as concerns the high multilingualism that describes these data. Thus, both the research and the business communities focus on the utilization of multilingual approaches and solutions to enhance the policy making procedures. To investigate a portion of these challenges this research work performs a comparative analysis of two multilingual sentiment analysis approaches. In this context, three multilingual deep learning classifiers and a zero-shot classification approach were utilized and compared. Their comparison has unveiled insightful outcomes and has a two-fold interpretation. Multilingual deep learning classifiers that have pre-trained and evaluated in monolingual data achieve high performances and transfer inference when applied afterwards in multilingual data. However, the zero-shot classification approach fails to achieve high accuracies in monolingual data as in contrary to when applied on multilingual data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Global Social Media Stats | DataReportal – Global Digital Insights. https://datareportal.com/social-media-users. Accessed 27 Feb 2022

  2. Wiederhold, B.K.: Social media use during social distancing. Cyberpsychol. Behav. Soc. Netw. 23(5), 275–276 (2020)

    Article  Google Scholar 

  3. Alalwan, A.A.: Investigating the impact of social media advertising features on customer purchase intention. Int. J. Inf. Manag. 42, 65–77 (2018)

    Article  Google Scholar 

  4. Social media marketing - statistics & facts. https://www.statista.com/topics/1538/social-media-marketing/#dossierKeyfigures. Accessed 26 Feb 2022

  5. Social Media Statistics For 2022 [+Infographic] | Statusbrew. https://statusbrew.com/insights/social-media-statistics. Accessed 25 Feb 2022

  6. The Latest Twitter Stats: Everything You Need to Know | DataReportal – Global Digital Insights. https://datareportal.com/essential-twitter-stats. Accessed 26 Feb 2022

  7. Păvăloaia, V.D., et al.: Opinion mining on social media data: sentiment analysis of user preferences. Sustainability 11(16), 4459 (2019)

    Article  Google Scholar 

  8. Tao, D., Yang, P., Feng, H.: Utilization of text mining as a big da-ta analysis tool for food science and nutrition. Compr. Rev. Food Sci. Food Safety 19(2), 875–894 (2020)

    Article  Google Scholar 

  9. Sánchez-Núñez, P., et al.: Opinion mining, sentiment analysis and emotion understanding in advertising: a bibliometric analysis. IEEE Access 8, 134563–134576 (2020)

    Article  Google Scholar 

  10. Manias, G., et al.: An evaluation of neural machine translation and pre-trained word embeddings in multilingual neural sentiment analysis. In: 2020 IEEE International Conference on PIC, pp. 274–283. IEEE (2020)

    Google Scholar 

  11. Sadia, A., Khan, F., Bashir, F.: An overview of lexicon-based approach for sentiment analysis. In: IEEC 2018, pp. 1–6 (2018)

    Google Scholar 

  12. Conneau, A., et al.: unsupervised cross-lingual representation learning at scale. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 8440–8451. Association for Computational Linguistics (2020)

    Google Scholar 

  13. AydoÄŸan, M., Karci, A.: Improving the accuracy using pre-trained word embeddings on deep neural networks for Turkish text classification. Physica A 541, 123288 (2020)

    Article  Google Scholar 

  14. Schwenk, H., Li, X.: A corpus for multilingual document classification in eight languages. arXiv preprint arXiv:1805.09821 (2018)

  15. Eriguchi, A., et al.: Zero-shot cross-lingual classification using multilingual neural machine translation. arXiv preprint arXiv:1809.04686 (2018)

  16. Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., Liu, C.: A survey on deep transfer learning. In: Kůrková, V., Manolopoulos, Y., Hammer, B., Iliadis, L., Maglogiannis, I. (eds.) ICANN 2018. LNCS, vol. 11141, pp. 270–279. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01424-7_27

  17. Xian, Y., et al.: Zero-shot learning—a comprehensive evaluation of the good, the bad and the ugly. IEEE Trans. Pattern Anal. Mach. Intell. 41(9), 2251–2265 (2018)

    Article  Google Scholar 

  18. Artetxe, M., Schwenk, H.: Massively multilingual sentence embeddings for zero-shot cross-lingual transfer and beyond. TACL 7, 597–610 (2019)

    Article  Google Scholar 

  19. Sarkar, A., Reddy, S., Iyengar, R.S.: Zero-shot multilingual sentiment analysis using hierarchical attentive network and BERT. In: Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval, pp. 49–56 (2019)

    Google Scholar 

  20. Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Conference of the NAACL-HLT 2019, pp. 4171–4186 (2019)

    Google Scholar 

  21. Barbieri, F., Anke, L.E., Camacho-Collados, J.: XLM-T: A multilingual language model toolkit for twitter. arXiv preprint arXiv:2104.12250 (2021)

  22. Gonzalez, J.A., Hurtado, L.F., Pla, F.: TWilBert: pre-trained deep bidirectional transformers for Spanish Twitter. Neurocomputing 426, 58–69 (2021)

    Article  Google Scholar 

  23. Pota, M., Ventura, M., Catelli, R., Esposito, M.: An effective BERT-based pipeline for Twitter sentiment analysis: a case study in Italian. Sensors 21(1), 133 (2021)

    Article  Google Scholar 

  24. Polignano, M., et al.: Alberto: Italian BERT language understanding model for NLP challenging tasks based on tweets. In: 6th Italian Conference on Computational Linguistics, CLiC-it 2019, vol. 2481, pp. 1–6. CEUR (2019)

    Google Scholar 

  25. Wehrmann, J., Becker, W.E., Barros, R.C.: A multi-task neural network for multilingual sentiment classification and language detection on twitter. In: Proceedings of the 33rd Annual ACM Symposium on Applied Computing, pp. 1805–1812 (2018)

    Google Scholar 

  26. Yang, Y., et al.: Multilingual universal sentence encoder for semantic retrieval. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 87–94 (2020)

    Google Scholar 

  27. Janocha, K., Czarnecki, W.M.: On Loss functions for deep neural networks in classification. Schedae Informaticae 25 (2017)

    Google Scholar 

  28. Afridi, M.J., Ross, A., Shapiro, E.M.: On automated source selection for transfer learning in convolutional neural networks. Pattern Recognit. 73, 65–75 (2018)

    Article  Google Scholar 

  29. Manaswi, N.K.: RNN and LSTM. In: Deep Learning with Applications Using Python, pp. 115–126. Apress, Berkeley (2018)

    Google Scholar 

  30. Muhammad, P.F., Kusumaningrum, R., Wibowo, A.: Sentiment analysis using Word2vec and long short-term memory (LSTM) for Indonesian hotel reviews. Procedia Comput. Sci. 179, 728–735 (2021)

    Article  Google Scholar 

  31. Kumar, A., Albuquerque, V.H.C.: Sentiment analysis using XLM-R transformer and zero-shot transfer learning on resource-poor Indian language. Trans. Asian Low Resour. Lang. Inf. Process. 20(5), 1–13 (2021)

    Article  Google Scholar 

  32. Hugging Face. https://huggingface.co. Accessed 26 Feb 2022

  33. joeddav/xlm-roberta-large-xnli-Hugging Face. https://huggingface.co/joeddav/xlm-roberta-large-xnli. Accessed 02 Mar 2022

  34. Conneau, A., et al.: XNLI: evaluating cross-lingual sentence representations. In: Proceedings of the EMNLP 2018, pp. 2475–2485 (2020)

    Google Scholar 

  35. Wine Reviews | Kaggle. https://www.kaggle.com/zynicide/wine-reviews/home. Accessed 05 Mar 2022

  36. TensorFlow. https://www.tensorflow.org/. Accessed 27 Feb 2022

  37. Keras: the Python deep learning API. https://keras.io/. Accessed 27 Feb 2022

  38. Google Colaboratory. https://colab.research.google.com/. Accessed 09 Mar 2022

  39. Wine-Searcher. https://www.wine-searcher.com/wine-scores. Accessed 28 Mar 2022

  40. Kyriazis, D., et al.: PolicyCLOUD: analytics as a service facilitating efficient data-driven public policy management. In: Maglogiannis, I., Iliadis, L., Pimenidis, E. (eds.) AIAI 2020. IAICT, vol. 583, pp. 141–150. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49161-1_13

    Chapter  Google Scholar 

Download references

Acknowledgement

The research leading to the results presented in this paper has received funding from the European Union’s funded Project PolicyCLOUD under grant agreement no 870675.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to George Manias .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 IFIP International Federation for Information Processing

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Manias, G., Kiourtis, A., Mavrogiorgou, A., Kyriazis, D. (2022). Multilingual Sentiment Analysis on Twitter Data Towards Enhanced Policy Making. In: Maglogiannis, I., Iliadis, L., Macintyre, J., Cortez, P. (eds) Artificial Intelligence Applications and Innovations. AIAI 2022. IFIP Advances in Information and Communication Technology, vol 647. Springer, Cham. https://doi.org/10.1007/978-3-031-08337-2_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-08337-2_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-08336-5

  • Online ISBN: 978-3-031-08337-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics