Sarcasm Detection in Tunisian Social Media Comments: Case of COVID-19

Mekki, Asma; Zribi, Inès; Ellouze, Mariem; Belguith, Lamia Hadrich

doi:10.1007/978-3-031-16564-1_5

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13515))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

767 Accesses
2 Citations

Abstract

The goal of this study is to learn more about how the public sees COVID-19 pandemic behaviors and to identify important themes of concern expressed by Tunisian social media users during the epidemic. Around 23K comments were collected, written in both Arabic and Latin characters in the Tunisian dialect. Native language experts manually tagged these comments for sarcasm identification (sarcastic and non-sarcastic). In addition to health, our dataset contains comments on entertainment, social, sports, religion, and politics, all of which are impacted by COVID-19. This research examines the sarcasm expressed in Tunisian social media comments regarding the novel COVID-19 from its appearance in the first half of 2020. We also provide benchmarking findings applying machine learning and deep learning algorithms for sarcasm detection. We obtained an accuracy of above 80%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Alhajji, M., Al Khalifah, A., Aljubran, M., Alkhalifah, M.: Sentiment analysis of tweets in Saudi Arabia regarding governmental preventive measures to contain covid-19. In: MDPI AG (2020)
Google Scholar
Althagafi, A., Althobaiti, G., Alhakami, H., Alsubait, T.: Arabic tweets sentiment analysis about online learning during covid-19 in Saudi Arabia. Int. J. Adv. Comput. Sci. Appl 12, 620–625 (2021)
Google Scholar
Ameur, M.S.H., Aliane, H.: Aracovid19-ssd: Arabic covid-19 sentiment and sarcasm detection dataset. arXiv preprint arXiv:2110.01948 (2021)
Besdouri, F.Z., Mekki, A., Zribi, I., Ellouze, M.: Improvement of the cota-orthography system through language modeling. In: IEEE/ACS 18th International Conference on Computer Systems and Applications, pp. 1–7. IEEE (2021)
Google Scholar
Collobert, R., Weston, J.: A unified architecture for natural language processing: Deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 160–167 (2008)
Google Scholar
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
MathSciNet MATH Google Scholar
Fix, E., Hodges, J.L.: Discriminatory analysis. nonparametric discrimination: Consistency properties. Int. Stat. Rev./Rev. Internationale de Statistique 57(3), 238–247 (1989)
Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)
Article MathSciNet Google Scholar
Habbat, N., Anoun, H., Hassouni, L.: Sentiment analysis and topic modeling on Arabic twitter data during covid-19 pandemic. Indonesian J. Innov. Appl. Sci. (IJIAS) 2(1), 60–67 (2022)
Article Google Scholar
Ho, T.K.: Random decision forests. In: Proceedings of 3rd International Conference on Document Analysis and Recognition, vol. 1, pp. 278–282. IEEE (1995)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Madani, Y., Erritali, M., Bouikhalene, B.: Analyzing Moroccan tweets to extract sentiments related to the coronavirus pandemic: a new classification approach. In: Fakir, M., Baslam, M., El Ayachi, R. (eds.) CBI 2021. LNBIP, vol. 416, pp. 33–42. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-76508-8_3
Chapter Google Scholar
Mekki, A., Zribi, I., Ellouze, M., Belguith, L.H.: Treebank creation and parser generation for Tunisian social media text. In: IEEE/ACS 17th International Conference on Computer Systems and Applications, pp. 1–8. IEEE (2020)
Google Scholar
Mekki, A., Zribi, I., Ellouze, M., Belguith, L.H.: Sentence boundary detection of various forms of Tunisian Arabic. Lang. Resour. Eval. 56(1), 357–385 (2022)
Article Google Scholar
Mekki, A., Zribi, I., Ellouze Khmekhem, M., Hadrich Belguith, L.: Critical description of TA linguistic resources. In: The 4th International Conference on Arabic Computational Linguistics (ACLing 2018) & Procedia Computer Science, November 17–19 2018. Dubai, United Arab Emirates (2018)
Google Scholar
Salzberg, S.L.: C4. 5: Programs for Machine Learning by j. Ross Quinlan. Morgan Kaufmann Publishers, Inc. Burlington (1993)
Google Scholar
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer-Verlag, New York (1995). https://doi.org/10.1007/978-1-4757-2440-0
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

ANLP Research Group, MIRACL, University of Sfax, Sfax, Tunisia
Asma Mekki, Inès Zribi, Mariem Ellouze & Lamia Hadrich Belguith

Authors

Asma Mekki
View author publications
You can also search for this author in PubMed Google Scholar
Inès Zribi
View author publications
You can also search for this author in PubMed Google Scholar
Mariem Ellouze
View author publications
You can also search for this author in PubMed Google Scholar
Lamia Hadrich Belguith
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Asma Mekki .

Editor information

Editors and Affiliations

Università degli Studi di Bari Aldo Moro, Bari, Italy
Michelangelo Ceci
Università della Calabria, Rende, Italy
Sergio Flesca
Università Federico II di Napoli, Naples, Italy
Elio Masciari
ICAR-CNR, Rende, Italy
Giuseppe Manco
University of North Carolina, Charlotte, NC, USA
Zbigniew W. Raś

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mekki, A., Zribi, I., Ellouze, M., Belguith, L.H. (2022). Sarcasm Detection in Tunisian Social Media Comments: Case of COVID-19. In: Ceci, M., Flesca, S., Masciari, E., Manco, G., Raś, Z.W. (eds) Foundations of Intelligent Systems. ISMIS 2022. Lecture Notes in Computer Science(), vol 13515. Springer, Cham. https://doi.org/10.1007/978-3-031-16564-1_5

Download citation

DOI: https://doi.org/10.1007/978-3-031-16564-1_5
Published: 26 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16563-4
Online ISBN: 978-3-031-16564-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Sarcasm Detection in Tunisian Social Media Comments: Case of COVID-19