research-article

Aspect-Based Sentiment Analysis of Social Media Data With Pre-Trained Language Models

Authors:
Anina Troya

Graduate School of Informatics, University of Amsterdam, Netherlands

Graduate School of Informatics, University of Amsterdam, Netherlands
View Profile

,
Reshmi Gopalakrishna Pillai

Graduate School of Informatics, University of Amsterdam, Netherlands

Graduate School of Informatics, University of Amsterdam, Netherlands
View Profile

,
Dr. Cristian Rodriguez Rivero

Graduate School of Informatics, University of Amsterdam, Netherlands

Graduate School of Informatics, University of Amsterdam, Netherlands
View Profile

,
Dr. Zulkuf Genc

Data Science, Prosus N.V., Netherlands

Data Science, Prosus N.V., Netherlands
View Profile

,
Dr. Subhradeep Kayal

Data Science, Prosus N.V., Netherlands

Data Science, Prosus N.V., Netherlands
View Profile

,
Dogu Araci

Data Science, Prosus N.V., Netherlands

Data Science, Prosus N.V., Netherlands
View Profile

NLPIR '21: Proceedings of the 2021 5th International Conference on Natural Language Processing and Information RetrievalDecember 2021Pages 8–17https://doi.org/10.1145/3508230.3508232

Published:08 March 2022Publication History

NLPIR '21: Proceedings of the 2021 5th International Conference on Natural Language Processing and Information Retrieval

Pages 8–17

ABSTRACT

There is a great scope in utilizing the increasing content expressed by users on social media platforms such as Twitter. This study explores the application of Aspect-based Sentiment Analysis (ABSA) of tweets to retrieve fine-grained sentiment insights. The Plant-based food domain is chosen as an area of focus. To the best of our knowledge this is the first time ABSA task is done for this sector and it is distinct from standard food products because different and controversial aspects arise and opinions are polarized. The choice is relevant because these products can help in meeting the sustainable development goals and improve the welfare of millions of animals. Pre-trained BERT,”Bidirectional Encoder Representations with transformers”, is fine-tuned for this task and stands out because it was trained to learn from all the words in the sentence simultaneously using transformers. The aim was to develop methods to be applied on real life cases, therefore lowering the dependency on labeled data and improving performance were the key objectives. This research contributes to existing approaches of ABSA by proposing data processing techniques to adapt social media data for ABSA. The scope of this project presents a new method for the aspect category detection task (ACD) which does not rely on labeled data by using regular expressions (Regex). For aspect the sentiment classification task (ASC) a semi-supervised learning technique is explored. Additionally Part-of-Speech (POS) tags are incorporated into the predictions. The findings show that Regex is a solution to eliminate the dependency on labeled data for ACD. For ASC fine-tuning BERT on a small subset of data was the most accurate method to lower the dependency on aspect level sentiment data.

References

Miklós Véha Márk Szakály Zoltán Szakály András Fehér, MichałVéha Gazdecki. 2020. A Comprehensive Review of the Benefits of and the Barriers to the Switch to a Plant-Based Diet. Sustainability (2020).Google Scholar
Dogu Araci. 2019. Finbert: Financial sentiment analysis with pre-trained language models. arXiv preprint arXiv:1908.10063(2019).Google Scholar
Udayan Khurana Elias B. Khalil Authors: Fatemeh Nargesian, Horst Samulowitz and Deepak Turaga. 2017. Learning Feature Engineering for Classification. ACM Digital Library (2017).Google Scholar
Jamshid Bagherzadeh and Hasan Asil. 2019. A review of various semi-supervised learning models with a deep learning and memory approach. Iran Journal of Computer Science 2, 2 (2019), 65–80.Google ScholarCross Ref
The World Bank. 2020. Population, total. https://data.worldbank.org/indicator/SP.POP.TOTL. Accessed: 04/12/2021.Google Scholar
Samuel R. Bowman, Ellie Pavlick, Edouard Grave, Benjamin Van Durme, Alex Wang, Jan Hula, Patrick Xia, Raghavendra Pappagari, R. Thomas McCoy, Roma Patel, Najoung Kim, Ian Tenney, Yinghui Huang, Katherin Yu, Shuning Jin, and Berlin Chen. 2019. Looking for ELMo’s friends: Sentence-Level Pretraining Beyond Language Modeling. https://openreview.net/forum?id=Bkl87h09FXGoogle Scholar
Paweł Budzianowski and Ivan Vulić. 2019. Hello, It’s GPT-2 – How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems. arxiv:1907.05774 [cs.CL]Google Scholar
Bingli Clark Chai, Johannes Reidar van der Voort, Kristina Grofelnik, Helga Gudny Eliasdottir, Ines Klöss, and Federico J. A. Perez-Cueto. 2019. Which diet has the least environmental impact on our planet? A systematic review of vegan, vegetarian and omnivorous diets. Sustainability 11, 15 (2019), 4110.Google ScholarCross Ref
Saranlita Chotirata and PhayungMeesad. 2021. Part-of-Speech tagging enhancement to natural language processing for Thai wh-question classification with deep learning. Science Direct (2021).Google Scholar
Anirban Choudhury. 2019. A Deep Dive Analysis of Customer Sentiments in the Food Service Industry | Quantzig’s New Success Story. https://apnews.com/press-release/pr-businesswire/58df0387e8dc46479849a6cb3078eb29. Accessed: 2020-11-15.Google Scholar
Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. arxiv:1412.3555 [cs.NE]Google Scholar
Jessica Clement. 2019. Twitter: monthly active users worldwide | Statista. https://www.statista.com/statistics/282087/number-of-monthly-active-twitter-users/. Accessed: 2020-11-13.Google Scholar
Brian Dean. 2021. Social Network Usage & Growth Statistics: How Many People Use Social Media in 2021?https://backlinko.com/social-media-users#social-media-usage-stats. Accessed: 04/12/2021.Google Scholar
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018).Google Scholar
Hai Ha Do, P. W. C. Prasad, Angelika Maag, and Abeer Alsadoon. 2019. Deep learning for aspect-based sentiment analysis: a comparative review. Expert Systems with Applications 118 (2019), 272–299.Google ScholarDigital Library
Mickel Hoang, Oskar Alija Bihorac, and Jacobo Rouces. 2019. Aspect-based sentiment analysis using bert. In NEAL Proceedings of the 22nd Nordic Conference on Computional Linguistics (NoDaLiDa), September 30-October 2, Turku, Finland. Linköping University Electronic Press, 187–196.Google Scholar
Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. 168–177.Google ScholarDigital Library
Cristela Maia Bairrada Isabel Miguel, Arnaldo Coelho. 2021. Modelling Attitude towards Consumption of Vegan Products. Sustainability (2021).Google Scholar
Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, and Ondrej Chum. 2019. Label Propagation for Deep Semi-supervised Learning. arxiv:1904.04717 [cs.CV]Google Scholar
Zhao Jianqiang and Gui Xiaolin. 2017. Comparison research on text pre-processing methods on twitter sentiment analysis. IEEE Access 5(2017), 2870–2879.Google ScholarCross Ref
A.M. Kuchling. [n.d.]. Regular Expression HOWTO. Technical Report. Python.Google Scholar
Mary l. McHugh. 2012. Interrater reliability: the kappa statistic. Biochemia Medica (2012).Google Scholar
Ning Li, Chi-Yin Chow, and Jia-Dong Zhang. 2020. SEML: A Semi-Supervised Multi-Task Learning Framework for Aspect-Based Sentiment Analysis. IEEE Access 8(2020), 189287–189297.Google ScholarCross Ref
I. Montan M. Honnibal. 2017. spaCy2: Natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing. Zenodo (2017).Google Scholar
Esteban Ortiz-Ospina. 2019. The rise of social media. https://ourworldindata.org/rise-of-social-media. Accessed: 2020-11-12.Google Scholar
Tal Perry. 2020. Context is King! Why Deep Learning matters for NLP. https://www.lighttag.io/blog/context-is-king/. Accessed: 2020-11-09.Google Scholar
Maria Pontiki, Dimitrios Galanis, Haris Papageorgiou, Ion Androutsopoulos, Suresh Manandhar, Mohammad Al-Smadi, Mahmoud Al-Ayyoub, Yanyan Zhao, Bing Qin, Orphée De Clercq, 2016. Semeval-2016 task 5: Aspect based sentiment analysis. In 10th International Workshop on Semantic Evaluation (SemEval 2016).Google ScholarCross Ref
Maria Pontiki, Dimitris Galanis, John Pavlopoulos, Harris Papageorgiou, Ion Androutsopoulos, and Suresh Manandhar. 2014. SemEval-2014 Task 4: Aspect Based Sentiment Analysis. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014). Association for Computational Linguistics, Dublin, Ireland, 27–35. https://doi.org/10.3115/v1/S14-2004Google ScholarCross Ref
Paramita Ray and Amlan Chakrabarti. 2020. A mixed approach of deep learning method and rule-based method to improve aspect level sentiment analysis. Applied Computing and Informatics(2020).Google Scholar
Sebastian Ruder, Matthew E Peters, Swabha Swayamdipta, and Thomas Wolf. 2019. Transfer learning in natural language processing. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Tutorials. 15–18.Google ScholarCross Ref
Marzieh Saeidi, Guillaume Bouchard, Maria Liakata, and Sebastian Riedel. 2016. SentiHood: Targeted Aspect Based Sentiment Analysis Dataset for Urban Neighbourhoods. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. The COLING 2016 Organizing Committee, Osaka, Japan, 1546–1556. https://www.aclweb.org/anthology/C16-1146Google Scholar
Yasuhiro Fujiwara Sekitoshi Kana and Sotetsu Iwamura. 2017. Preventing Gradient Explosionsin Gated Recurrent Units. ACM Digital Library (2017).Google Scholar
Peng Shi and Jimmy Lin. 2019. Simple bert models for relation extraction and semantic role labeling. arXiv preprint arXiv:1904.05255(2019).Google Scholar
Sam Shleifer. 2019. Low Resource Text Classification with ULMFit and Backtranslation. arxiv:1903.09244 [cs.CL]Google Scholar
Chi Sun, Luyao Huang, and Xipeng Qiu. 2019. Utilizing BERT for aspect-based sentiment analysis via constructing auxiliary sentence. arXiv preprint arXiv:1903.09588(2019).Google Scholar
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to Sequence Learning with Neural Networks. arxiv:1409.3215 [cs.CL]Google Scholar
Bo Wang and Min Liu. 2015. Deep learning for aspect-based sentiment analysis. Stanford University report(2015).Google Scholar
Hu Xu, Bing Liu, Lei Shu, and Philip S. Yu. 2019. Bert post-training for review reading comprehension and aspect-based sentiment analysis. arXiv preprint arXiv:1904.02232(2019).Google Scholar
Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, and Quoc V. Le. 2020. XLNet: Generalized Autoregressive Pretraining for Language Understanding. arxiv:1906.08237 [cs.CL]Google Scholar
Paul R. Yarnold. 2016. ODA vs. ϕ and κ: Paradoxes of Kappa. Optimal Data Analysis(2016).Google Scholar
Danny Zhu. 2020. Sentiment Analysis for Impossible Burger. https://rpubs.com/DannyZhu/Impossible_Burger. Accessed: 2020-11-15.Google Scholar

Recommendations

Sentence compression for aspect-based sentiment analysis

Sentiment analysis, which addresses the computational treatment of opinion, sentiment, and subjectivity in text, has received considerable attention in recent years. In contrast to the traditional coarse-grained sentiment analysis tasks, such as ...
Read More
Aspect-Based Sentiment Analysis with Pre-trained Language Models
Read More
IAN-BERT: Combining Post-trained BERT with Interactive Attention Network for Aspect-Based Sentiment Analysis
Abstract
Aspect-based sentiment analysis (ABSA), a task in sentiment analysis, predicts the sentiment polarity of specific aspects mentioned in the input sentence. Recent research has demonstrated the effectiveness of Bidirectional Encoder Representation ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

NLPIR '21: Proceedings of the 2021 5th International Conference on Natural Language Processing and Information Retrieval
December 2021
175 pages
ISBN:9781450387354
DOI:10.1145/3508230

Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 March 2022
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
ABSA
Aspect-Based Sentiment Analysis
BERT
POS tags
plant based Domain
semi-supervised
social media data.
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 217
  Total Downloads
- Downloads (Last 12 months)108
- Downloads (Last 6 weeks)18
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Aspect-Based Sentiment Analysis of Social Media Data With Pre-Trained Language Models

NLPIR '21: Proceedings of the 2021 5th International Conference on Natural Language Processing and Information Retrieval

ABSTRACT

References

Cited By

Recommendations

Sentence compression for aspect-based sentiment analysis

Aspect-Based Sentiment Analysis with Pre-trained Language Models

IAN-BERT: Combining Post-trained BERT with Interactive Attention Network for Aspect-Based Sentiment Analysis

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Aspect-Based Sentiment Analysis of Social Media Data With Pre-Trained Language Models

NLPIR '21: Proceedings of the 2021 5th International Conference on Natural Language Processing and Information Retrieval

ABSTRACT

References

Cited By

Recommendations

Sentence compression for aspect-based sentiment analysis

Aspect-Based Sentiment Analysis with Pre-trained Language Models

IAN-BERT: Combining Post-trained BERT with Interactive Attention Network for Aspect-Based Sentiment Analysis

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media