Abstract
The great expansion in the usage and popularity of social media platforms during the last decades has led to the production of an enormous real-time volume of social texts and posts, including tweets, that are being produced by users. These collections of social data can be potentially useful and provide useful insights to policymakers to adjust new user-centric policies and regulations. However, extracting and analyzing valuable information and knowledge out of these data is a challenging task as concerns the high multilingualism that describes these data. Thus, both the research and the business communities focus on the utilization of multilingual approaches and solutions to enhance the policy making procedures. To investigate a portion of these challenges this research work performs a comparative analysis of two multilingual sentiment analysis approaches. In this context, three multilingual deep learning classifiers and a zero-shot classification approach were utilized and compared. Their comparison has unveiled insightful outcomes and has a two-fold interpretation. Multilingual deep learning classifiers that have pre-trained and evaluated in monolingual data achieve high performances and transfer inference when applied afterwards in multilingual data. However, the zero-shot classification approach fails to achieve high accuracies in monolingual data as in contrary to when applied on multilingual data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Global Social Media Stats | DataReportal – Global Digital Insights. https://datareportal.com/social-media-users. Accessed 27 Feb 2022
Wiederhold, B.K.: Social media use during social distancing. Cyberpsychol. Behav. Soc. Netw. 23(5), 275–276 (2020)
Alalwan, A.A.: Investigating the impact of social media advertising features on customer purchase intention. Int. J. Inf. Manag. 42, 65–77 (2018)
Social media marketing - statistics & facts. https://www.statista.com/topics/1538/social-media-marketing/#dossierKeyfigures. Accessed 26 Feb 2022
Social Media Statistics For 2022 [+Infographic] | Statusbrew. https://statusbrew.com/insights/social-media-statistics. Accessed 25 Feb 2022
The Latest Twitter Stats: Everything You Need to Know | DataReportal – Global Digital Insights. https://datareportal.com/essential-twitter-stats. Accessed 26 Feb 2022
Păvăloaia, V.D., et al.: Opinion mining on social media data: sentiment analysis of user preferences. Sustainability 11(16), 4459 (2019)
Tao, D., Yang, P., Feng, H.: Utilization of text mining as a big da-ta analysis tool for food science and nutrition. Compr. Rev. Food Sci. Food Safety 19(2), 875–894 (2020)
Sánchez-Núñez, P., et al.: Opinion mining, sentiment analysis and emotion understanding in advertising: a bibliometric analysis. IEEE Access 8, 134563–134576 (2020)
Manias, G., et al.: An evaluation of neural machine translation and pre-trained word embeddings in multilingual neural sentiment analysis. In: 2020 IEEE International Conference on PIC, pp. 274–283. IEEE (2020)
Sadia, A., Khan, F., Bashir, F.: An overview of lexicon-based approach for sentiment analysis. In: IEEC 2018, pp. 1–6 (2018)
Conneau, A., et al.: unsupervised cross-lingual representation learning at scale. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 8440–8451. Association for Computational Linguistics (2020)
AydoÄŸan, M., Karci, A.: Improving the accuracy using pre-trained word embeddings on deep neural networks for Turkish text classification. Physica A 541, 123288 (2020)
Schwenk, H., Li, X.: A corpus for multilingual document classification in eight languages. arXiv preprint arXiv:1805.09821 (2018)
Eriguchi, A., et al.: Zero-shot cross-lingual classification using multilingual neural machine translation. arXiv preprint arXiv:1809.04686 (2018)
Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., Liu, C.: A survey on deep transfer learning. In: Kůrková, V., Manolopoulos, Y., Hammer, B., Iliadis, L., Maglogiannis, I. (eds.) ICANN 2018. LNCS, vol. 11141, pp. 270–279. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01424-7_27
Xian, Y., et al.: Zero-shot learning—a comprehensive evaluation of the good, the bad and the ugly. IEEE Trans. Pattern Anal. Mach. Intell. 41(9), 2251–2265 (2018)
Artetxe, M., Schwenk, H.: Massively multilingual sentence embeddings for zero-shot cross-lingual transfer and beyond. TACL 7, 597–610 (2019)
Sarkar, A., Reddy, S., Iyengar, R.S.: Zero-shot multilingual sentiment analysis using hierarchical attentive network and BERT. In: Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval, pp. 49–56 (2019)
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Conference of the NAACL-HLT 2019, pp. 4171–4186 (2019)
Barbieri, F., Anke, L.E., Camacho-Collados, J.: XLM-T: A multilingual language model toolkit for twitter. arXiv preprint arXiv:2104.12250 (2021)
Gonzalez, J.A., Hurtado, L.F., Pla, F.: TWilBert: pre-trained deep bidirectional transformers for Spanish Twitter. Neurocomputing 426, 58–69 (2021)
Pota, M., Ventura, M., Catelli, R., Esposito, M.: An effective BERT-based pipeline for Twitter sentiment analysis: a case study in Italian. Sensors 21(1), 133 (2021)
Polignano, M., et al.: Alberto: Italian BERT language understanding model for NLP challenging tasks based on tweets. In: 6th Italian Conference on Computational Linguistics, CLiC-it 2019, vol. 2481, pp. 1–6. CEUR (2019)
Wehrmann, J., Becker, W.E., Barros, R.C.: A multi-task neural network for multilingual sentiment classification and language detection on twitter. In: Proceedings of the 33rd Annual ACM Symposium on Applied Computing, pp. 1805–1812 (2018)
Yang, Y., et al.: Multilingual universal sentence encoder for semantic retrieval. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 87–94 (2020)
Janocha, K., Czarnecki, W.M.: On Loss functions for deep neural networks in classification. Schedae Informaticae 25 (2017)
Afridi, M.J., Ross, A., Shapiro, E.M.: On automated source selection for transfer learning in convolutional neural networks. Pattern Recognit. 73, 65–75 (2018)
Manaswi, N.K.: RNN and LSTM. In: Deep Learning with Applications Using Python, pp. 115–126. Apress, Berkeley (2018)
Muhammad, P.F., Kusumaningrum, R., Wibowo, A.: Sentiment analysis using Word2vec and long short-term memory (LSTM) for Indonesian hotel reviews. Procedia Comput. Sci. 179, 728–735 (2021)
Kumar, A., Albuquerque, V.H.C.: Sentiment analysis using XLM-R transformer and zero-shot transfer learning on resource-poor Indian language. Trans. Asian Low Resour. Lang. Inf. Process. 20(5), 1–13 (2021)
Hugging Face. https://huggingface.co. Accessed 26 Feb 2022
joeddav/xlm-roberta-large-xnli-Hugging Face. https://huggingface.co/joeddav/xlm-roberta-large-xnli. Accessed 02 Mar 2022
Conneau, A., et al.: XNLI: evaluating cross-lingual sentence representations. In: Proceedings of the EMNLP 2018, pp. 2475–2485 (2020)
Wine Reviews | Kaggle. https://www.kaggle.com/zynicide/wine-reviews/home. Accessed 05 Mar 2022
TensorFlow. https://www.tensorflow.org/. Accessed 27 Feb 2022
Keras: the Python deep learning API. https://keras.io/. Accessed 27 Feb 2022
Google Colaboratory. https://colab.research.google.com/. Accessed 09 Mar 2022
Wine-Searcher. https://www.wine-searcher.com/wine-scores. Accessed 28 Mar 2022
Kyriazis, D., et al.: PolicyCLOUD: analytics as a service facilitating efficient data-driven public policy management. In: Maglogiannis, I., Iliadis, L., Pimenidis, E. (eds.) AIAI 2020. IAICT, vol. 583, pp. 141–150. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49161-1_13
Acknowledgement
The research leading to the results presented in this paper has received funding from the European Union’s funded Project PolicyCLOUD under grant agreement no 870675.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 IFIP International Federation for Information Processing
About this paper
Cite this paper
Manias, G., Kiourtis, A., Mavrogiorgou, A., Kyriazis, D. (2022). Multilingual Sentiment Analysis on Twitter Data Towards Enhanced Policy Making. In: Maglogiannis, I., Iliadis, L., Macintyre, J., Cortez, P. (eds) Artificial Intelligence Applications and Innovations. AIAI 2022. IFIP Advances in Information and Communication Technology, vol 647. Springer, Cham. https://doi.org/10.1007/978-3-031-08337-2_27
Download citation
DOI: https://doi.org/10.1007/978-3-031-08337-2_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-08336-5
Online ISBN: 978-3-031-08337-2
eBook Packages: Computer ScienceComputer Science (R0)