skip to main content
research-article

Automatic Algerian Sarcasm Detection from Texts and Images

Published: 19 July 2024 Publication History

Abstract

In recent years, the number of Algerian Internet users has significantly increased, providing a valuable opportunity for collecting and utilizing opinions and sentiments expressed online. They now post not just texts but also images. However, to benefit from this wealth of information, it is crucial to address the challenge of sarcasm detection, which poses a limitation in sentiment analysis. Sarcasm often involves the use of nonliteral and ambiguous language, making its detection complex. To enhance the quality and relevance of sentiment analysis, it is essential to develop effective methods for sarcasm detection. By overcoming this limitation, we can fully harness the expressed online opinions and benefit from their valuable insights for a better understanding of trends and sentiments among the Algerian public. In this work, our aim is to develop a comprehensive system that addresses sarcasm detection in Algerian dialect, encompassing both text and image analysis. We propose a hybrid approach that combines linguistic characteristics and machine learning techniques for text analysis. Additionally, for image analysis, we utilized the deep learning model VGG-19 for image classification, and employed the EasyOCR technique for Arabic text extraction. By integrating these approaches, we strive to create a robust system capable of detecting sarcasm in both textual and visual content in the Algerian dialect. Our system achieved an accuracy of 92.79% for the textual models and 89.28% for the visual model.

References

[1]
P. P. Rokade and K. D. Aruna. 2019. Business intelligence analytics using sentiment analysis—A survey. International Journal of Electrical and Computer Engineering 9, 1 (2019), 613.
[2]
L. Moudjari, K. Akli-Astouati, and F. Benamara. 2020. An Algerian corpus and an annotation platform for opinion and emotion analysis. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, 1202–1210,.
[3]
R. Rahmoun. 2022. Etats Des Lieux Du Marketing électronique En Algérie. Algerian Scientific Journal Platform, Les Cahiers du MECAS 18 (2022), 163–177.
[4]
H. Saadane and N. Habash. 2015. A conventional orthography for Algerian Arabic. In Proceedings of the 2nd Workshop on Arabic Natural Language Processing. 69–79.
[5]
M. Benrabah. 2005. The language planning situation in Algeria. Current Issues in Language Planning 6, 4 (2005), 379–502.
[6]
N. Babanejad, H. Davoudi, A. An, and M. Papagelis. 2020. Affective and contextual embedding for sarcasm detection. In Proceedings of the 28th International Conference on Computational Linguistics. 225–243.
[7]
J. Devlin, M. Chang, K. Lee, and K. Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota. Association for Computational Linguistics, 4171--4186.
[8]
W. Antoun, F. Baly, and H. Hajj. 2020. AraBERT: Transformer-based model for Arabic language understanding. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, 9–15.
[9]
M. Abdul-Mageed, A. Elmadany, and E. M. B. Nagoudi. 2020. ARBERT & MARBERT: Deep bidirectional transformers for Arabic.
[10]
L. K. Ahire, Sachin D. Babar, Gitanjali R. Shinde, Parikshit N. Mahalle, Gitanjali R. Shinde, Nilanjan Dey, and Aboul Ella Hassanien. 2021. Sarcasm detection in online social network: Myths, realities, and issues. Security Issues and Privacy Threats in Smart Ubiquitous Computing. Springer.
[11]
A. D. Dave and N. P. Desai. 2016. A comprehensive study of classification techniques for sarcasm detection on textual data. In Proceedings of the 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT). IEEE. 1985–1991.
[12]
M. Bouazizi and T. O. Ohtsuki. 2016. A pattern-based approach for sarcasm detection on Twitter. IEEE Access 4 (2016), 5477–5488.
[13]
A. Joshi, P. Bhattacharyya, and M. J. Carman. 2017. Automatic sarcasm detection: A survey. ACM Computing Surveys (CSUR) 50, 5 (2017), 1–22.
[14]
S. K. Bharti, K. S. Babu, and S. K. Jena. 2015. Parsing-based sarcasm sentiment recognition in Twitter data. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 1373–1380.
[15]
M. S. M. Suhaimin, M. H. A. Hijazi, R. Alfred, and F. Coenen. 2017. Natural language processing based features for sarcasm detection: An investigation using bilingual social media texts. In Proceedings of the 2017 8th International Conference on Information technology (ICIT). IEEE. 703–709.
[16]
K. Sundararajan and A. Palanisamy. 2020. Multi-rule based ensemble feature selection model for sarcasm type detection in Twitter. Computational Intelligence and Neuroscience (2020), 2860479. DOI:
[17]
V. P. Jariwala. 2020. Optimal feature extraction based machine learning approach for sarcasm type detection in news headlines. International Journal of Computer Applications 975 (2020), 8887.
[18]
S. Hiai and K. Shimada. 2016. A sarcasm extraction method based on patterns of evaluation expressions. In Proceedings of the 2016 5th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI). IEEE. 31–36.
[19]
I. A. Farha and W. Magdy. 2020. From Arabic sentiment analysis to sarcasm detection: The ArSarcasm dataset. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. 32–39.
[20]
R. González-Ibánez, S. Muresan, and N. Wacholder. 2011. Identifying sarcasm in Twitter: A closer look. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 581–586.
[21]
E. Riloff, A. Qadir, P. Surve, L. De Silva, N. Gilbert, and R. Huang. 2013. Sarcasm as contrast between a positive sentiment and negative situation. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. 704–714.
[22]
H. K. Kumar and B. S. Harish. 2018. Sarcasm classification: A novel approach by using content based feature selection method. Procedia Computer Science 143 (2018), 378–386.
[23]
N. Pawar and S. Bhingarkar. 2020. Machine learning based sarcasm detection on Twitter data. In Proceedings of the 2020 5th International Conference on Communication and Electronics Systems (ICCES). IEEE. 957–961.
[24]
R. Gupta, J. Kumar, and H. Agrawal. 2020. A statistical approach for sarcasm detection using Twitter data. In Proceedings of the 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS). IEEE. 633–638.
[25]
C. I. Eke, A. A. Norman, and L. Shuib. 2021. Multi-feature fusion framework for sarcasm identification on Twitter data: A machine learning based approach. PLOS One 16, 6 (2021), e0252918.
[26]
A. Y. Abdullah Amer and T. Siddiqu. 2022. A novel algorithm for sarcasm detection using supervised machine learning approach. AIMS Electronics and Electrical Engineering 6, 4 (2022), 345–369.
[27]
V. Govindan and V. Balakrishnan. 2022. A machine learning approach in analysing the effect of hyperboles using negative sentiment tweets for sarcasm detection. Journal of King Saud University - Computer and Information Sciences 34, 8 (2022), 5110–5120.
[28]
J. Karoui, F. B. Zitoune, and V. Moriceau. 2017. Soukhria: Towards an irony detection system for Arabic in social media. Procedia Computer Science 117 (2017), 161–168.
[29]
D. Al-Ghadhban, E. Alnkhilan, L. Tatwany, and M. Alrazgan. 2017. Arabic sarcasm detection in Twitter. In Proceedings of the 2017 International Conference on Engineering & MIS (ICEMIS). IEEE. 1–7.
[30]
M. M. Abuteir and E. S. Elsamani. 2021. Automatic sarcasm detection in Arabic text: A supervised classification approach. International Journal of New Technology and Research 7, 8 (2021), 1–11.
[31]
M. A. Abdelaal, M. A. Fattah, and M. M. Arafa. 2022. Predicting sarcasm and polarity in Arabic text automatically: Supervised machine learning approach. Journal of Theoretical and Applied Information Technology 100, 8 (2022).
[32]
L. Ren, B. Xu, H. Lin, X. Liu, and L. Yang. 2020. Sarcasm detection with sentiment semantics enhanced multi-level memory network. Neurocomputing 401 (2020), 320–326.
[33]
J. Liu, S. Tian, L. Yu, X. Shi, and F. Wang. 2024. Image-text fusion transformer network for sarcasm detection. Multimedia Tools and Applications 83, 14 (2024), 41895--41909.
[34]
S. S. Salim, A. N. Ghanshyam, D. M. Ashok, D. B. Mazahir, and B. S. Thakare. 2020. Deep LSTM-RNN with word embedding for sarcasm detection on Twitter. In Proceedings of the 2020 International Conference for Emerging Technology (INCET). IEEE. 1–4.
[35]
I. A. Farha and W. Magdy. 2021. Benchmarking transformer-based language models for Arabic sentiment and sarcasm detection. In Proceedings of the 6th Arabic Natural Language Processing Workshop. 21–31.
[36]
D. K. Sharma, B. Singh, S. Agarwal, H. Kim, and R. Sharma. 2022. Sarcasm detection over social media platforms using hybrid auto-encoder-based model. Electronics 11, 18 (2022), 2844.
[37]
M. Khodak, N. Saunshi, and K. Vodrahalli. 2017. A large self-annotated corpus for sarcasm.
[38]
R. Misra and P. Arora. 2019. Sarcasm detection using hybrid neural network.
[39]
T. Ptáček, I. Habernal, and J. Hong. 2014. Sarcasm detection on Czech and English Twitter. In Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 213–223.
[40]
O. Vitman, Y. Kostiuk, G. Sidorov, and A. Gelbukh. 2023. Sarcasm detection framework using context, emotion and sentiment features. Expert Systems with Applications 234 (2023), 121068.
[41]
D. K. Sharma, B. Singh, S. Agarwal, N. Pachauri, A. A. Alhussan, and H. A. Abdallah. 2023. Sarcasm detection over social media platforms using hybrid ensemble model with fuzzy logic. Electronics 12, 4 (2023), 937.
[42]
A. Abuzayed and H. Al-Khalifa. 2021. Sarcasm and sentiment detection in Arabic tweets using BERT-based models and data augmentation. In Proceedings of the 6th Arabic Natural Language Processing Workshop. 312–317.
[43]
D. Faraj and M. Abdullah. 2021. SarcasmDet at sarcasm detection task 2021 in Arabic using AraBERT pretrained model. In Proceedings of the 6th Arabic Natural Language Processing Workshop. 345–350.
[44]
L. Bashmal and D. AlZeer. 2021. ArSarcasm shared task: An ensemble BERT model for SarcasmDetection in Arabic tweets. In Proceedings of the 6th Arabic Natural Language Processing Workshop. 323–328.
[45]
H. AlMazrua, N. AlHazzani, A. AlDawod, L. AlAwlaqi, N. AlReshoudi, H. Al-Khalifa, and L. AlDhubayi. 2022. Sa′7r: A Saudi dialect irony dataset. In Proceedings of the 5th Workshop on Open-Source Arabic Corpora and Processing Tools with Shared Tasks on Qur'an QA and Fine-Grained Hate Speech Detection. 60–70.
[46]
A. Mekki, I. Zribi, M. Ellouze, and L. H. Belguith. 2022. A Tunisian benchmark social media data set for COVID-19 sentiment analysis and sarcasm detection.
[47]
A. Kaseb and M. Farouk. 2023. SAIDS: A novel approach for sentiment analysis informed of dialect and sarcasm.
[48]
A. Ameur, S. Hamdi, and S. B. Yahia. 2023. Domain adaptation approach for Arabic sarcasm detection in hotel reviews based on hybrid learning. Procedia Computer Science 225 (2023), 3898–3908.
[49]
R. W. Smith. 2013. History of the Tesseract OCR engine: What worked and what didn't. In Document Recognition and Retrieval XX, Vol. 8658. SPIE, 865802.
[50]
J. Subramanian, V. Sridharan, K. Shu, and H. Liu. 2019. Exploiting emojis for sarcasm detection. Lecture Notes in Computer Science, Vol. 11549. Springer, 70–80.
[51]
K. Mott. 2019. State classification of cooking objects using a VGG CNN.
[52]
P. S. Thakur, T. Sheorey, and A. Ojha. 2023. VGG-ICNN: A lightweight CNN model for crop disease identification. Multimedia Tools and Applications 82, 1 (2023), 497–520.
[53]
P. Kralj Novak, J. Smailović, B. Sluban, and I. Mozetič. 2015. Sentiment of emojis. PLOS One 10, 12 (2015), e0144296.
[54]
I. A. Farha, S. Wilson, S. Oprea, and W. Magdy. 2022. Sarcasm detection is way too easy! An empirical comparison of human and machine sarcasm detection. In Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022. 5284–5295.
[55]
P. Chaudhari and C. Chandankhede. 2017. Literature survey of sarcasm detection. In Proceedings of the 2017 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET). IEEE, 2041–2046.
[56]
M. Sree Ram Kiran Nag, G. Srinivas, K. Venkata Rao, S. Vakkalanka, and S. Nagendram. 2022. An efficient procedure for identifying the similarity between French and English languages with sequence matcher technique. In Advances in Data Science and Management: Proceedings of ICDSM 2021. Springer Nature, Singapore. 29–40.
[57]
H. I. Lim. 2021. A study on dropout techniques to reduce overfitting in deep neural networks. In Advanced Multimedia and Ubiquitous Engineering: MUE-FutureTech 2020. Springer, Singapore. 133–139.
[58]
L. Anolli, R. Ciceri, and M. G. Infantino. 2000. Irony as a game of implicitness: Acoustic profiles of ironic communication. Journal of Psycholinguistic Research 29 (2000), 275–311.
[59]
W. Mohamed Ahmed. 2020. Semiotics of elections in political caricature of online newspaper: A case study of 2018 presidential Egyptian elections. المجلة العربية لبحوث الاعلام والاتصال 30 (2020), 2–40.
[60]
R. Sarwar, A. Mahmood, M. S. Riaz, and G. Mustafa. 2023. Political Represenation of Cartoons Published in Pakistani English Newspapers: A Semiotic Analysis. PalArch's Journal of Archaeology of Egypt/Egyptology 20, 2 (2023), 546–559.
[61]
K. Simonyan and A. Zisserman. 2014. Very deep convolutional networks for large-scale image recognition.
[62]
X. Pei, Y. hong Zhao, L. Chen, Q. Guo, Z. Duan, Y. Pan, and H. Hou. 2023. Robustness of machine learning to color, size change, normalization, and image enhancement on micrograph datasets with large sample differences. Materials & Design 232 (2023), 112086.
[63]
M. M. Abuteir and E. S. Elsamani. 2021. SVM based approach for detecting sarcasm in Arabic text. International Journal of Advanced Research in Computer and Communication Engineering 10, 8 (August 2021).
[64]
M. A. Galal, A. H. Yousef, H. H. Zayed, and W. Medhat. 2024. Arabic sarcasm detection: An enhanced fine-tuned language model approach. Ain Shams Engineering Journal 15, 6 (2024), 102736.

Cited By

View all
  • (2025)Supposititious Sarcasm Detection and Sentiment Analysis Coping Hindi Language in Social Networks Harnessing Zipf- Mandelbrot Probabilistic Optimisation and Perplexity Entropy LearningACM Transactions on Asian and Low-Resource Language Information Processing10.1145/371206124:2(1-28)Online publication date: 16-Jan-2025
  • (2024)Improving the Accuracy of Sarcasm Detection in Text Data Using a Smooth Support Vector Classification Model with Word-Emoji Embedding for News and Indian Indigenous Languages2024 International Conference on System, Computation, Automation and Networking (ICSCAN)10.1109/ICSCAN62807.2024.10894530(1-6)Online publication date: 27-Dec-2024

Index Terms

  1. Automatic Algerian Sarcasm Detection from Texts and Images

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 23, Issue 7
      July 2024
      254 pages
      EISSN:2375-4702
      DOI:10.1145/3613605
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 19 July 2024
      Online AM: 03 June 2024
      Accepted: 24 May 2024
      Revised: 06 February 2024
      Received: 14 September 2023
      Published in TALLIP Volume 23, Issue 7

      Check for updates

      Author Tags

      1. Sentiment analysis
      2. Sarcasm detection
      3. Algerian Dialect
      4. linguistic features
      5. BERT model

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)157
      • Downloads (Last 6 weeks)10
      Reflects downloads up to 02 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)Supposititious Sarcasm Detection and Sentiment Analysis Coping Hindi Language in Social Networks Harnessing Zipf- Mandelbrot Probabilistic Optimisation and Perplexity Entropy LearningACM Transactions on Asian and Low-Resource Language Information Processing10.1145/371206124:2(1-28)Online publication date: 16-Jan-2025
      • (2024)Improving the Accuracy of Sarcasm Detection in Text Data Using a Smooth Support Vector Classification Model with Word-Emoji Embedding for News and Indian Indigenous Languages2024 International Conference on System, Computation, Automation and Networking (ICSCAN)10.1109/ICSCAN62807.2024.10894530(1-6)Online publication date: 27-Dec-2024

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      Full Text

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media