skip to main content
10.1145/3508230.3508232acmotherconferencesArticle/Chapter ViewAbstractPublication PagesnlpirConference Proceedingsconference-collections
research-article

Aspect-Based Sentiment Analysis of Social Media Data With Pre-Trained Language Models

Published:08 March 2022Publication History

ABSTRACT

There is a great scope in utilizing the increasing content expressed by users on social media platforms such as Twitter. This study explores the application of Aspect-based Sentiment Analysis (ABSA) of tweets to retrieve fine-grained sentiment insights. The Plant-based food domain is chosen as an area of focus. To the best of our knowledge this is the first time ABSA task is done for this sector and it is distinct from standard food products because different and controversial aspects arise and opinions are polarized. The choice is relevant because these products can help in meeting the sustainable development goals and improve the welfare of millions of animals. Pre-trained BERT,”Bidirectional Encoder Representations with transformers”, is fine-tuned for this task and stands out because it was trained to learn from all the words in the sentence simultaneously using transformers. The aim was to develop methods to be applied on real life cases, therefore lowering the dependency on labeled data and improving performance were the key objectives. This research contributes to existing approaches of ABSA by proposing data processing techniques to adapt social media data for ABSA. The scope of this project presents a new method for the aspect category detection task (ACD) which does not rely on labeled data by using regular expressions (Regex). For aspect the sentiment classification task (ASC) a semi-supervised learning technique is explored. Additionally Part-of-Speech (POS) tags are incorporated into the predictions. The findings show that Regex is a solution to eliminate the dependency on labeled data for ACD. For ASC fine-tuning BERT on a small subset of data was the most accurate method to lower the dependency on aspect level sentiment data.

References

  1. Miklós Véha Márk Szakály Zoltán Szakály András Fehér, MichałVéha Gazdecki. 2020. A Comprehensive Review of the Benefits of and the Barriers to the Switch to a Plant-Based Diet. Sustainability (2020).Google ScholarGoogle Scholar
  2. Dogu Araci. 2019. Finbert: Financial sentiment analysis with pre-trained language models. arXiv preprint arXiv:1908.10063(2019).Google ScholarGoogle Scholar
  3. Udayan Khurana Elias B. Khalil Authors: Fatemeh Nargesian, Horst Samulowitz and Deepak Turaga. 2017. Learning Feature Engineering for Classification. ACM Digital Library (2017).Google ScholarGoogle Scholar
  4. Jamshid Bagherzadeh and Hasan Asil. 2019. A review of various semi-supervised learning models with a deep learning and memory approach. Iran Journal of Computer Science 2, 2 (2019), 65–80.Google ScholarGoogle ScholarCross RefCross Ref
  5. The World Bank. 2020. Population, total. https://data.worldbank.org/indicator/SP.POP.TOTL. Accessed: 04/12/2021.Google ScholarGoogle Scholar
  6. Samuel R. Bowman, Ellie Pavlick, Edouard Grave, Benjamin Van Durme, Alex Wang, Jan Hula, Patrick Xia, Raghavendra Pappagari, R. Thomas McCoy, Roma Patel, Najoung Kim, Ian Tenney, Yinghui Huang, Katherin Yu, Shuning Jin, and Berlin Chen. 2019. Looking for ELMo’s friends: Sentence-Level Pretraining Beyond Language Modeling. https://openreview.net/forum?id=Bkl87h09FXGoogle ScholarGoogle Scholar
  7. Paweł Budzianowski and Ivan Vulić. 2019. Hello, It’s GPT-2 – How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems. arxiv:1907.05774 [cs.CL]Google ScholarGoogle Scholar
  8. Bingli Clark Chai, Johannes Reidar van der Voort, Kristina Grofelnik, Helga Gudny Eliasdottir, Ines Klöss, and Federico J. A. Perez-Cueto. 2019. Which diet has the least environmental impact on our planet? A systematic review of vegan, vegetarian and omnivorous diets. Sustainability 11, 15 (2019), 4110.Google ScholarGoogle ScholarCross RefCross Ref
  9. Saranlita Chotirata and PhayungMeesad. 2021. Part-of-Speech tagging enhancement to natural language processing for Thai wh-question classification with deep learning. Science Direct (2021).Google ScholarGoogle Scholar
  10. Anirban Choudhury. 2019. A Deep Dive Analysis of Customer Sentiments in the Food Service Industry | Quantzig’s New Success Story. https://apnews.com/press-release/pr-businesswire/58df0387e8dc46479849a6cb3078eb29. Accessed: 2020-11-15.Google ScholarGoogle Scholar
  11. Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. arxiv:1412.3555 [cs.NE]Google ScholarGoogle Scholar
  12. Jessica Clement. 2019. Twitter: monthly active users worldwide | Statista. https://www.statista.com/statistics/282087/number-of-monthly-active-twitter-users/. Accessed: 2020-11-13.Google ScholarGoogle Scholar
  13. Brian Dean. 2021. Social Network Usage & Growth Statistics: How Many People Use Social Media in 2021?https://backlinko.com/social-media-users#social-media-usage-stats. Accessed: 04/12/2021.Google ScholarGoogle Scholar
  14. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018).Google ScholarGoogle Scholar
  15. Hai Ha Do, P. W. C. Prasad, Angelika Maag, and Abeer Alsadoon. 2019. Deep learning for aspect-based sentiment analysis: a comparative review. Expert Systems with Applications 118 (2019), 272–299.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Mickel Hoang, Oskar Alija Bihorac, and Jacobo Rouces. 2019. Aspect-based sentiment analysis using bert. In NEAL Proceedings of the 22nd Nordic Conference on Computional Linguistics (NoDaLiDa), September 30-October 2, Turku, Finland. Linköping University Electronic Press, 187–196.Google ScholarGoogle Scholar
  17. Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. 168–177.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Cristela Maia Bairrada Isabel Miguel, Arnaldo Coelho. 2021. Modelling Attitude towards Consumption of Vegan Products. Sustainability (2021).Google ScholarGoogle Scholar
  19. Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, and Ondrej Chum. 2019. Label Propagation for Deep Semi-supervised Learning. arxiv:1904.04717 [cs.CV]Google ScholarGoogle Scholar
  20. Zhao Jianqiang and Gui Xiaolin. 2017. Comparison research on text pre-processing methods on twitter sentiment analysis. IEEE Access 5(2017), 2870–2879.Google ScholarGoogle ScholarCross RefCross Ref
  21. A.M. Kuchling. [n.d.]. Regular Expression HOWTO. Technical Report. Python.Google ScholarGoogle Scholar
  22. Mary l. McHugh. 2012. Interrater reliability: the kappa statistic. Biochemia Medica (2012).Google ScholarGoogle Scholar
  23. Ning Li, Chi-Yin Chow, and Jia-Dong Zhang. 2020. SEML: A Semi-Supervised Multi-Task Learning Framework for Aspect-Based Sentiment Analysis. IEEE Access 8(2020), 189287–189297.Google ScholarGoogle ScholarCross RefCross Ref
  24. I. Montan M. Honnibal. 2017. spaCy2: Natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing. Zenodo (2017).Google ScholarGoogle Scholar
  25. Esteban Ortiz-Ospina. 2019. The rise of social media. https://ourworldindata.org/rise-of-social-media. Accessed: 2020-11-12.Google ScholarGoogle Scholar
  26. Tal Perry. 2020. Context is King! Why Deep Learning matters for NLP. https://www.lighttag.io/blog/context-is-king/. Accessed: 2020-11-09.Google ScholarGoogle Scholar
  27. Maria Pontiki, Dimitrios Galanis, Haris Papageorgiou, Ion Androutsopoulos, Suresh Manandhar, Mohammad Al-Smadi, Mahmoud Al-Ayyoub, Yanyan Zhao, Bing Qin, Orphée De Clercq, 2016. Semeval-2016 task 5: Aspect based sentiment analysis. In 10th International Workshop on Semantic Evaluation (SemEval 2016).Google ScholarGoogle ScholarCross RefCross Ref
  28. Maria Pontiki, Dimitris Galanis, John Pavlopoulos, Harris Papageorgiou, Ion Androutsopoulos, and Suresh Manandhar. 2014. SemEval-2014 Task 4: Aspect Based Sentiment Analysis. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014). Association for Computational Linguistics, Dublin, Ireland, 27–35. https://doi.org/10.3115/v1/S14-2004Google ScholarGoogle ScholarCross RefCross Ref
  29. Paramita Ray and Amlan Chakrabarti. 2020. A mixed approach of deep learning method and rule-based method to improve aspect level sentiment analysis. Applied Computing and Informatics(2020).Google ScholarGoogle Scholar
  30. Sebastian Ruder, Matthew E Peters, Swabha Swayamdipta, and Thomas Wolf. 2019. Transfer learning in natural language processing. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Tutorials. 15–18.Google ScholarGoogle ScholarCross RefCross Ref
  31. Marzieh Saeidi, Guillaume Bouchard, Maria Liakata, and Sebastian Riedel. 2016. SentiHood: Targeted Aspect Based Sentiment Analysis Dataset for Urban Neighbourhoods. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. The COLING 2016 Organizing Committee, Osaka, Japan, 1546–1556. https://www.aclweb.org/anthology/C16-1146Google ScholarGoogle Scholar
  32. Yasuhiro Fujiwara Sekitoshi Kana and Sotetsu Iwamura. 2017. Preventing Gradient Explosionsin Gated Recurrent Units. ACM Digital Library (2017).Google ScholarGoogle Scholar
  33. Peng Shi and Jimmy Lin. 2019. Simple bert models for relation extraction and semantic role labeling. arXiv preprint arXiv:1904.05255(2019).Google ScholarGoogle Scholar
  34. Sam Shleifer. 2019. Low Resource Text Classification with ULMFit and Backtranslation. arxiv:1903.09244 [cs.CL]Google ScholarGoogle Scholar
  35. Chi Sun, Luyao Huang, and Xipeng Qiu. 2019. Utilizing BERT for aspect-based sentiment analysis via constructing auxiliary sentence. arXiv preprint arXiv:1903.09588(2019).Google ScholarGoogle Scholar
  36. Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to Sequence Learning with Neural Networks. arxiv:1409.3215 [cs.CL]Google ScholarGoogle Scholar
  37. Bo Wang and Min Liu. 2015. Deep learning for aspect-based sentiment analysis. Stanford University report(2015).Google ScholarGoogle Scholar
  38. Hu Xu, Bing Liu, Lei Shu, and Philip S. Yu. 2019. Bert post-training for review reading comprehension and aspect-based sentiment analysis. arXiv preprint arXiv:1904.02232(2019).Google ScholarGoogle Scholar
  39. Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, and Quoc V. Le. 2020. XLNet: Generalized Autoregressive Pretraining for Language Understanding. arxiv:1906.08237 [cs.CL]Google ScholarGoogle Scholar
  40. Paul R. Yarnold. 2016. ODA vs. ϕ and κ: Paradoxes of Kappa. Optimal Data Analysis(2016).Google ScholarGoogle Scholar
  41. Danny Zhu. 2020. Sentiment Analysis for Impossible Burger. https://rpubs.com/DannyZhu/Impossible_Burger. Accessed: 2020-11-15.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    NLPIR '21: Proceedings of the 2021 5th International Conference on Natural Language Processing and Information Retrieval
    December 2021
    175 pages
    ISBN:9781450387354
    DOI:10.1145/3508230

    Copyright © 2021 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 8 March 2022

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format