ABSTRACT
Since sarcasm has inverse meaning from what is said or written, it is very hard to detect sarcasm. Therefore, detecting sarcasm is an important task in Natural Language Processing (NLP) field. In this study, we use interjection, intensifier, capital letters, elongated words, and punctuation marks as hyperbole features to detect sarcasm in Indonesian tweets. Particularly, these hyperbole features are utilized by Support Vector Machine (SVM), Random Forest (RF), and RF+Bagging to classify Indonesian tweets in our testing data as sarcasm or not-sarcasm. English tweets obtained from Kaggle and SemEval are employed as our training data, while Indonesian tweets obtained from Drone Emprit are used as the testing data. Our experimental results show that our model with hyperbole features classifies more the tweets in the testing data as sarcasm than that without hyperbole ones. Our observation indicates that using hyperbole features could contribute well to detecting sarcasm.
- Mondher Bouazizi and Tomoaki Otsuki Ohtsuki. 2016. A Pattern-Based Approach for Sarcasm Detection on Twitter. IEEE Access 4 (Sept. 2016), 5477–5488. https://doi.org/10.1109/ACCESS.2016.2594194Google ScholarCross Ref
- Jason Brownlee. 2020. Random Oversampling and Undersampling for Imbalanced Classification. Retrieved June 30, 2022 from https://machinelearningmastery.com/random-oversampling-and-undersampling-for-imbalanced-classification/Google Scholar
- Cambridge. 2022. Dictionary Cambridge - Hasil Penelusuran cyberbullying. Retrieved July 1, 2022 from https://dictionary.cambridge.org/dictionary/english/cyberbullyingGoogle Scholar
- Zheng Lin Chia, Michal Ptaszynski, Fumito Masui, Gniewosz Leliwa, and Michal Wroczynski. 2021. Machine Learning and feature engineering-based study into sarcasm and irony classification with application to cyberbullying detection. Information Processing and Management 58 (July 2021), 12 pages. https://doi.org/10.1016/j.ipm.2021.102600Google ScholarDigital Library
- Richard Chin. 2011. The Science of Sarcasm? Yeah, Right. Retrieved June 19, 2022 from https://www.smithsonianmag.com/science-nature/the-science-of-sarcasm-yeah-right-25038/Google Scholar
- Vanessa Van Edwards. 2022. Sarcasm: What It Is and Why It Hurts Us. Retrieved July 1, 2022 from https://www.scienceofpeople.com/sarcasm-why-it-hurts-us/Google Scholar
- Ismail Fahmi. 2018. Drone Emprit Academic: Software for social media monitoring and analytics. Retrieved June 8, 2022 from https://dea.uii.ac.id/Google Scholar
- Vithyatheri Govindan and Vimala Balakrishnan. 2022. A machine learning approach in analysing the effect of hyperboles using negative sentiment tweets for sarcasm detection. Journal of King Saud University - Computer and Information Sciences (Jan. 2022). https://doi.org/10.1016/j.jksuci.2022.01.008Google ScholarCross Ref
- Kamus Besar Bahasa Indonesia (KBBI). 2016. KBBI Daring - Hasil Penelusuran Sarkasme. Retrieved June 13, 2022 from https://kbbi.kemdikbud.go.id/entri/sarkasmeGoogle Scholar
- Jennifer Ling and Roman Klinger. 2016. An Empirical, Quantitative Analysis of the Differences Between Sarcasm and Irony. In European semantic web conference(ESWC 2016, Vol. 9989). Springer International Publishing, 203–216. https://doi.org/10.1007/978-3-319-47602-5Google ScholarCross Ref
- Yunqian Ma and Haibo He. 2013. Imbalanced learning: foundations, algorithms, and applications. John Wiley & Sons.Google Scholar
- P Mahesha and DS Vinod. 2015. Gaussian Mixture Model Based Classification of Stuttering Dysfluencies. Journal of Intelligent Systems 25, 3 (July 2015), 387–399. https://doi.org/10.1515/jisys-2014-0140Google Scholar
- Diana Maynard and Mark A Greenwood. 2014. Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis. In Proceedings of the Ninth International Conference on Language Resources and Evaluation(LREC’ 14). ELRA, Reykjavik, Iceland, 4238––4243. http://www.lrec-conf.org/proceedings/lrec2014/pdf/67_Paper.pdfGoogle Scholar
- Neelam Mukhtar, Mohammad Abid Khan, Nadia Chiragh, and Shah Nazir. 2018. Identification and handling of intensifiers for enhancing accuracy of Urdu sentiment analysis. Expert Systems 35, 6 (Dec. 2018). https://doi.org/10.1111/exsy.12317Google ScholarCross Ref
- Masrah Azrifah Azmi Murad. 2018. Sarcasm: Are You being E-Bullied?Retrieved June 24, 2022 from https://www.youtube.com/watch?v=r0-B1fhHgeIGoogle Scholar
- Shubham Kumar Nigam and Mosab Shaheen. 2022. Plumeria at SemEval-2022 Task 6: Robust Approaches for Sarcasm Detection for English and Arabic Using Transformers and Data Augmentation. (2022). https://doi.org/10.48550/arXiv.2203.04111 arXiv:arXiv:2203.04111Google Scholar
- Dwi AP Rahayu, Soveatin Kuntur, and Nur Hayatin. 2018. Sarcasm Detection on Indonesian Twitter Feeds. In 2018 5th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI). IEEE, 137–141. https://doi.org/10.1109/EECSI.2018.8752913Google Scholar
- Yessi Yunitasari, Aina Musdholifah, and Anny Kartika Sari. 2019. Sarcasm Detection For Sentiment Analysis in Indonesian Tweets. IJCCS (Indonesian Journal of Computing and Cybernetics Systems) 13, 1 (Jan. 2019), 53–62. https://doi.org/10.22146/ijccs.41136Google ScholarCross Ref
Index Terms
- Sarcasm Detection in Indonesian Tweets Using Hyperbole Features
Recommendations
Automatic Sarcasm Detection: A Survey
Automatic sarcasm detection is the task of predicting sarcasm in text. This is a crucial step to sentiment analysis, considering prevalence and challenges of sarcasm in sentiment-bearing text. Beginning with an approach that used speech-based features, ...
Sentence-Level Sarcasm Detection in English and Filipino Tweets
ICIBE '18: Proceedings of the 4th International Conference on Industrial and Business EngineeringSarcasm is a special form of sentiment which defines as "a nuanced form of language in which individuals say the opposite of what is implied". In this study, the researchers collected 6,000 Tagalog tweets and 6,000 English tweets from the microblogging ...
Signaling sarcasm
The use of hashtags such as #sarcasm reduces the further use of linguistic markers of sarcasm in tweets.Hashtags such as #sarcasm appear to be the extralinguistic equivalent of non-verbal expressions in live interaction.Sarcastic hashtags are 90% ...
Comments