Abstract
The goal of this study is to learn more about how the public sees COVID-19 pandemic behaviors and to identify important themes of concern expressed by Tunisian social media users during the epidemic. Around 23K comments were collected, written in both Arabic and Latin characters in the Tunisian dialect. Native language experts manually tagged these comments for sarcasm identification (sarcastic and non-sarcastic). In addition to health, our dataset contains comments on entertainment, social, sports, religion, and politics, all of which are impacted by COVID-19. This research examines the sarcasm expressed in Tunisian social media comments regarding the novel COVID-19 from its appearance in the first half of 2020. We also provide benchmarking findings applying machine learning and deep learning algorithms for sarcasm detection. We obtained an accuracy of above 80%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alhajji, M., Al Khalifah, A., Aljubran, M., Alkhalifah, M.: Sentiment analysis of tweets in Saudi Arabia regarding governmental preventive measures to contain covid-19. In: MDPI AG (2020)
Althagafi, A., Althobaiti, G., Alhakami, H., Alsubait, T.: Arabic tweets sentiment analysis about online learning during covid-19 in Saudi Arabia. Int. J. Adv. Comput. Sci. Appl 12, 620–625 (2021)
Ameur, M.S.H., Aliane, H.: Aracovid19-ssd: Arabic covid-19 sentiment and sarcasm detection dataset. arXiv preprint arXiv:2110.01948 (2021)
Besdouri, F.Z., Mekki, A., Zribi, I., Ellouze, M.: Improvement of the cota-orthography system through language modeling. In: IEEE/ACS 18th International Conference on Computer Systems and Applications, pp. 1–7. IEEE (2021)
Collobert, R., Weston, J.: A unified architecture for natural language processing: Deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 160–167 (2008)
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
Fix, E., Hodges, J.L.: Discriminatory analysis. nonparametric discrimination: Consistency properties. Int. Stat. Rev./Rev. Internationale de Statistique 57(3), 238–247 (1989)
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)
Habbat, N., Anoun, H., Hassouni, L.: Sentiment analysis and topic modeling on Arabic twitter data during covid-19 pandemic. Indonesian J. Innov. Appl. Sci. (IJIAS) 2(1), 60–67 (2022)
Ho, T.K.: Random decision forests. In: Proceedings of 3rd International Conference on Document Analysis and Recognition, vol. 1, pp. 278–282. IEEE (1995)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Madani, Y., Erritali, M., Bouikhalene, B.: Analyzing Moroccan tweets to extract sentiments related to the coronavirus pandemic: a new classification approach. In: Fakir, M., Baslam, M., El Ayachi, R. (eds.) CBI 2021. LNBIP, vol. 416, pp. 33–42. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-76508-8_3
Mekki, A., Zribi, I., Ellouze, M., Belguith, L.H.: Treebank creation and parser generation for Tunisian social media text. In: IEEE/ACS 17th International Conference on Computer Systems and Applications, pp. 1–8. IEEE (2020)
Mekki, A., Zribi, I., Ellouze, M., Belguith, L.H.: Sentence boundary detection of various forms of Tunisian Arabic. Lang. Resour. Eval. 56(1), 357–385 (2022)
Mekki, A., Zribi, I., Ellouze Khmekhem, M., Hadrich Belguith, L.: Critical description of TA linguistic resources. In: The 4th International Conference on Arabic Computational Linguistics (ACLing 2018) & Procedia Computer Science, November 17–19 2018. Dubai, United Arab Emirates (2018)
Salzberg, S.L.: C4. 5: Programs for Machine Learning by j. Ross Quinlan. Morgan Kaufmann Publishers, Inc. Burlington (1993)
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer-Verlag, New York (1995). https://doi.org/10.1007/978-1-4757-2440-0
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Mekki, A., Zribi, I., Ellouze, M., Belguith, L.H. (2022). Sarcasm Detection in Tunisian Social Media Comments: Case of COVID-19. In: Ceci, M., Flesca, S., Masciari, E., Manco, G., Raś, Z.W. (eds) Foundations of Intelligent Systems. ISMIS 2022. Lecture Notes in Computer Science(), vol 13515. Springer, Cham. https://doi.org/10.1007/978-3-031-16564-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-031-16564-1_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16563-4
Online ISBN: 978-3-031-16564-1
eBook Packages: Computer ScienceComputer Science (R0)