skip to main content
10.1145/3503823.3503829acmotherconferencesArticle/Chapter ViewAbstractPublication PagespciConference Proceedingsconference-collections
research-article

Emotion detection on Greek social media using Bidirectional Encoder Representations from Transformers

Published: 22 February 2022 Publication History

Abstract

The widespread proliferation of online social networks has resulted in the creation of huge amounts of data related to, among other things, the expression of opinion and sentiment about literally all aspects of everyday life. In this respect, various tools have been developed by interested parties (companies, individuals) that monitor the social media pulse with respect to various topics (products, persons, organizations, etc) in order to detect the stance, either positive or negative, and the overall emotion in the textual content of social media posts and comments. Despite the success of machine learning models for related tasks in popular languages (e.g. English), little progress has been made in under-represented languages, such as Greek. Based on this reality, in this work, we capitalize on the use of Bidirectional Encoder Representations from Transformers (BERT) architectures for emotion detection in social media textual content written in the Greek language. For this purpose, a relevant corpus is initially collected and annotated. Then, two pre-trained BERT models for the Greek language are employed, one of which has been proposed by the authors of the current work in previous publications. Both models are further trained on the collected corpus and are subsequently used in architectures that classify short social media texts in one or more emotion classes. The obtained results indicate that the BERT-based models outperform other approaches, especially those based on emotion lexicons.

Supplementary Material

Presentation slides (emotion_classifier.pptx)

References

[1]
Francisca Adoma Acheampong, Henry Nunoo-Mensah, and Wenyu Chen. 2021. Transformer models for text-based emotion detection: a review of BERT-based approaches. Artificial Intelligence Review (08 Feb 2021). https://doi.org/10.1007/s10462-021-09958-2
[2]
Ameeta Agrawal and Aijun An. 2012. Unsupervised Emotion Detection from Text Using Semantic and Syntactic Relations. In Proceedings of the The 2012 IEEE/WIC/ACM Conf. on Web Intelligence and Intelligent Agent Technology - Volume 01(WI-IAT ’12). IEEE Computer Society, USA, 346–353.
[3]
Georgios Alexandridis, Iraklis Varlamis, Konstantinos Korovesis, George Caridakis, and Panagiotis Tsantilas. 2021. A Survey on Sentiment Analysis and Opinion Mining in Greek Social Media. Information 12, 8 (2021). https://doi.org/10.3390/info12080331
[4]
Itziar Alonso-Arbiol, Phillip R. Shaver, R. Chris Fraley, Beatriz Oronoz, Erne Unzurrunzaga, and Ruben Urizar. 2006. Structure of the Basque emotion lexicon. Cognition and Emotion 20, 6 (2006), 836–865. https://doi.org/10.1080/02699930500405469 arXiv:https://doi.org/10.1080/02699930500405469
[5]
Oscar Araque, Lorenzo Gatti, Jacopo Staiano, and Marco Guerini. 2019. DepecheMood++: a Bilingual Emotion Lexicon Built Through Simple Yet Powerful Techniques. IEEE Transactions on Affective Computing(2019), 1–1. https://doi.org/10.1109/TAFFC.2019.2934444
[6]
Gilbert Badaro, Hussein Jundi, Hazem Hajj, and Wassim El-Hajj. 2018. EmoWordNet: Automatic Expansion of Emotion Lexicon Using English WordNet. In Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics. Association for Computational Linguistics, New Orleans, Louisiana, 86–93. https://doi.org/10.18653/v1/S18-2009
[7]
Gilbert Badaro, Hussein Jundi, Hazem Hajj, Wassim El-Hajj, and Nizar Habash. 2018. ArSEL: A Large Scale Arabic Sentiment and Emotion Lexicon. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)(Miyazaki, Japan, 7-12), Hend Al-Khalifa, King Saud University, KSA Walid Magdy, University of Edinburgh, UK Kareem Darwish, Qatar Computing Research Institute, Qatar Tamer Elsayed, Qatar University, and Qatar (Eds.). European Language Resources Association (ELRA), Paris, France.
[8]
R C Balabantaray, Mudasir Mohammad, and Nibha Sharma. 2012. Article: Multi-Class Twitter Emotion Classification: A New Approach. International Journal of Applied Information Systems 4, 1 (September 2012), 48–53. Published by Foundation of Computer Science, New York, USA.
[9]
Carla Bazzanella. 2004. Emotions, language and context. Emotion in dialogic interaction: Advances in the Complex (2004), 55–72.
[10]
Laura-Ana-Maria Bostan and Roman Klinger. 2018. An Analysis of Annotated Corpora for Emotion Classification in Text. In Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, 2104–2119.
[11]
Anamaria Briciu and Mihaiela Lupea. 2017. RoEmoLex - A Romanian Emotion Lexicon. Studia Universitatis Babeș-Bolyai Informatica 62, 2(2017), 45–56. https://doi.org/10.24193/subbi.2017.2.04
[12]
Andrea Chiorrini, C. Diamantini, Alex Mircoli, and D. Potena. 2021. Emotion and sentiment analysis of tweets using BERT. In EDBT/ICDT Workshops.
[13]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR abs/1810.04805(2018). arXiv:1810.04805http://arxiv.org/abs/1810.04805
[14]
Paul Ekman. 1957. A Methodological Discussion of Nonverbal Behavior. The Journal of Psychology 43, 1 (1957), 141–149. https://doi.org/10.1080/00223980.1957.9713059 arXiv:https://doi.org/10.1080/00223980.1957.9713059
[15]
Paul Ekman. 1999. Basic emotions. Handbook of cognition and emotion 98, 45-60 (1999), 16.
[16]
Nida Manzoor Hakak, Mohsin Mohd, Mahira Kirmani, and Mudasir Mohd. 2017. Emotion analysis: A survey. In 2017 International Conference on Computer, Communications and Electronics (Comptelix). 397–402. https://doi.org/10.1109/COMPTELIX.2017.8004002
[17]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.). http://arxiv.org/abs/1412.6980
[18]
Philipp Koehn. 2005. Europarl: A Parallel Corpus for Statistical Machine Translation. In Proceedings of Machine Translation Summit X: Papers. Phuket, Thailand, 79–86. https://aclanthology.org/2005.mtsummit-papers.11
[19]
John Koutsikakis, Ilias Chalkidis, Prodromos Malakasiotis, and Ion Androutsopoulos. 2020. GREEK-BERT: The Greeks Visiting Sesame Street. In 11th Hellenic Conference on Artificial Intelligence (Athens, Greece) (SETN 2020). Association for Computing Machinery, New York, NY, USA, 110–117.
[20]
Taku Kudo and John Richardson. 2018. SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing. CoRR abs/1808.06226(2018). arXiv:1808.06226http://arxiv.org/abs/1808.06226
[21]
Zuhe Li, Yangyu Fan, Bin Jiang, Tao Lei, and Weihua Liu. 2019. A survey on sentiment analysis and opinion mining for social multimedia. Multimedia Tools and Applications 78, 6 (01 Mar 2019), 6939–6967. https://doi.org/10.1007/s11042-018-6445-z
[22]
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. RoBERTa: A Robustly Optimized BERT Pretraining Approach. CoRR abs/1907.11692(2019). arXiv:1907.11692http://arxiv.org/abs/1907.11692
[23]
Saif Mohammad. 2018. Word Affect Intensities. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). European Language Resources Association (ELRA), Miyazaki, Japan.
[24]
Saif Mohammad, Felipe Bravo-Marquez, Mohammad Salameh, and Svetlana Kiritchenko. 2018. SemEval-2018 Task 1: Affect in Tweets. In Proceedings of The 12th International Workshop on Semantic Evaluation. Association for Computational Linguistics, New Orleans, Louisiana, 1–17.
[25]
Saif M. Mohammad and Felipe Bravo-Marquez. 2017. Emotion Intensities in Tweets. In Proceedings of the sixth joint conference on lexical and computational semantics (*Sem). Vancouver, Canada.
[26]
Saif M. Mohammad and Peter D. Turney. 2013. CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON. Computational Intelligence 29, 3 (2013), 436–465.
[27]
Elisavet Palogiannidi, Polychronis Koutsakis, Elias Iosif, and Alexandros Potamianos. 2016. Affective Lexicon Creation for the Greek Language. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16). European Language Resources Association (ELRA), Portorož, Slovenia, 2867–2872. https://aclanthology.org/L16-1458
[28]
Daniel Preoţiuc-Pietro, H. Andrew Schwartz, Gregory Park, Johannes Eichstaedt, Margaret Kern, Lyle Ungar, and Elisabeth Shulman. 2016. Modelling Valence and Arousal in Facebook posts. In Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Association for Computational Linguistics, San Diego, California, 9–15. https://doi.org/10.18653/v1/W16-0404
[29]
XiPeng Qiu, TianXiang Sun, YiGe Xu, YunFan Shao, Ning Dai, and XuanJing Huang. 2020. Pre-trained models for natural language processing: A survey. Science China Technological Sciences 63, 10 (01 Oct 2020), 1872–1897. https://doi.org/10.1007/s11431-020-1647-3
[30]
Kumar Ravi and Vadlamani Ravi. 2015. A survey on opinion mining and sentiment analysis: Tasks, approaches and applications. Knowledge-Based Systems 89 (2015), 14–46. https://doi.org/10.1016/j.knosys.2015.06.015
[31]
Kashfia Sailunaz, Manmeet Dhaliwal, Jon Rokne, and Reda Alhajj. 2018. Emotion detection from text and speech: a survey. Social Network Analysis and Mining 8, 1 (07 Apr 2018), 28. https://doi.org/10.1007/s13278-018-0505-2
[32]
Carlo Strapparava and Rada Mihalcea. 2007. SemEval-2007 Task 14: Affective Text. In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). Association for Computational Linguistics, Prague, Czech Republic, 70–74. https://aclanthology.org/S07-1013
[33]
Carlo Strapparava and Alessandro Valitutti. 2004. WordNet Affect: an Affective Extension of WordNet. In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04). European Language Resources Association (ELRA), Lisbon, Portugal.
[34]
Pedro Javier Ortiz Suárez, Benoît Sagot, and Laurent Romary. 2019. Asynchronous pipeline for processing huge corpora on medium to low resource infrastructures. In 7th Workshop on the Challenges in the Management of Large Corpora (CMLC-7). Leibniz-Institut für Deutsche Sprache.
[35]
Jessica L. Tracy and Daniel Randles. 2011. Four Models of Basic Emotions: A Review of Ekman and Cordaro, Izard, Levenson, and Panksepp and Watt. Emotion Review 3, 4 (2011), 397–405. https://doi.org/10.1177/1754073911410747
[36]
Ge Xu, Xinfan Meng, and Houfeng Wang. 2010. Build Chinese Emotion Lexicons Using A Graph-based Algorithm and Multiple Resources. In Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010). Coling 2010 Organizing Committee, Beijing, China, 1209–1217. https://aclanthology.org/C10-1136

Cited By

View all
  • (2023)PHNN: A Prompt and Hybrid Neural Network-Based Model for Aspect-Based Sentiment ClassificationElectronics10.3390/electronics1219412612:19(4126)Online publication date: 3-Oct-2023
  • (2023)PIMA: Parameter-Shared Intelligent Media Analytics Framework for Low Resource LanguagesApplied Sciences10.3390/app1305326513:5(3265)Online publication date: 3-Mar-2023
  • (2023)Cross-lingual Sentiment Analysis of Tamil Language Using a Multi-stage Deep Learning ArchitectureACM Transactions on Asian and Low-Resource Language Information Processing10.1145/363139122:12(1-28)Online publication date: 19-Dec-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
PCI '21: Proceedings of the 25th Pan-Hellenic Conference on Informatics
November 2021
499 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 February 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bidirection encoder representations from transformers
  2. emotion detection
  3. natural language processing
  4. online social media
  5. sentiment analysis

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Operational Program Competitiveness, Entrepreneurship and Innovation

Conference

PCI 2021

Acceptance Rates

Overall Acceptance Rate 190 of 390 submissions, 49%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)PHNN: A Prompt and Hybrid Neural Network-Based Model for Aspect-Based Sentiment ClassificationElectronics10.3390/electronics1219412612:19(4126)Online publication date: 3-Oct-2023
  • (2023)PIMA: Parameter-Shared Intelligent Media Analytics Framework for Low Resource LanguagesApplied Sciences10.3390/app1305326513:5(3265)Online publication date: 3-Mar-2023
  • (2023)Cross-lingual Sentiment Analysis of Tamil Language Using a Multi-stage Deep Learning ArchitectureACM Transactions on Asian and Low-Resource Language Information Processing10.1145/363139122:12(1-28)Online publication date: 19-Dec-2023
  • (2023)Greek Political Speech Classification Using BERT2023 14th International Conference on Information, Intelligence, Systems & Applications (IISA)10.1109/IISA59645.2023.10345868(1-7)Online publication date: 10-Jul-2023
  • (2023)From Pre-Training to Meta-Learning: A Journey in Low-Resource-Language Representation LearningIEEE Access10.1109/ACCESS.2023.332633711(115951-115967)Online publication date: 2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media