Aspect-based sentiment analysis via multitask learning for online reviews

doi:10.1016/j.knosys.2023.110326

Knowledge-Based Systems

Volume 264, 15 March 2023, 110326

https://doi.org/10.1016/j.knosys.2023.110326 Get rights and content

Abstract

Aspect based sentiment analysis(ABSA) aims to identify aspect terms in online reviews and predict their corresponding sentiment polarity. Sentiment analysis poses a challenging fine-grained task. Two typical subtasks are involved: Aspect Term Extraction (ATE) and Aspect Polarity Classification (APC). These two subtasks are usually trained discretely, which ignores the connection between ATE and APC. Concretely, we can relate ATE to APC through aspects and train them concurrently. We mainly use the ATE task as an auxiliary task, allowing the APC to focus more on relevant aspects to facilitate aspect polarity classification. In addition, previous studies have shown that utilizing dependency syntax information with a graph neural network (GNN) also contributes to the performance of the APC task. However, most studies directly input sentence dependency relations into graph neural networks without considering the influence of aspects, which do not emphasize the important dependency relationships. To address these issues, we propose a multitask learning model combining APC and ATE tasks that can extract aspect terms as well as classify aspect polarity simultaneously. Moreover, we exploit multihead attention(MHA) to associate dependency sequences with aspect extraction, which not only combines both ATE and APC tasks but also stresses the significant dependency relations, enabling the model to focus more on words closely related to aspects. According to our experiments on three benchmark datasets, we demonstrate that the connection between ATE and APC can be better established by our model, which enhances aspect polarity classification performance significantly. The source code has been released on GitHub https://github.com/winder-source/MTABSA.

Introduction

Aspect based sentiment analysis (ABSA) aims to mine sentiment information toward a given sentence, but is fine-grained. Specifically, its goal is to identify aspect terms in a comment and predict their corresponding sentiment polarity. In the example, “I like the service in the restaurant, but the environment is not very good”, the aspect terms are “service” and “environment”. The output emotional polarity of the two aspects is positive and negative. The sentiments corresponding to these two aspects are quite opposite, so it is not appropriate to conduct a sentiment analysis of the whole sentence but to conduct a more fine-grained analysis. The main research line of ABSA focuses on two subtasks, namely, ATE and APC.

The APC task is usually considered a classification task, or sentiment classification of a given aspect in a sentence. The approach to solving APC tasks has evolved from feature engineering to deep learning-based methods. The most common deep neural network architectures used in APC tasks are convolutional neural networks (CNNs) and recurrent neural networks (RNNs) [1], [2]. Moreover, the application of attention mechanisms in neural networks is becoming increasingly extensive. The attention mechanism [3], [4] is also suitable for ABSA tasks. In recent years, SOTA results have been obtained in numerous NLP tasks with proposed pretraining models. Therefore, many studies are based on the pretraining model, such as the AEN model [5] and BERT-PT [6]. In addition, a sentence contains not only semantic information but also syntactic structure information, such as dependency tree structure. Intuitively, it is helpful to integrate syntactic structure information into the APC task because the syntactic structure can better capture sentiment words related to aspect. Recently, many methods have regarded the dependency tree as an adjacency matrix and utilized GNNs to encode the entire adjacency matrix, such as graph attention networks [7] (GAT) and graph convolutional networks [8], [9] (GCN).

In most of these studies, the ATE task was studied independently. The ATE task is regarded as a NER task aimed at extracting aspects of the sentences as a sequence labeling task [10], [11], [12]. The advancement of deep learning has shown its usefulness in tasks. Recent methods use deep neural networks to assist aspect extraction [13], [14], [15]. Furthermore, there are many models based on BERT to perform sequence labeling tasks due to the success of the model.

Other approaches for sentiment analysis [16], [17] have emerged in recent years, with meta-based self-training sentiment analysis [18] and prompt-based sentiment analysis [19], [20] being proposed to perform sentiment analysis more efficiently, and new trends in neurosymbolic AI for explainable sentiment analysis [21], [22], [23], [24] have also emerged.

In addition, there are also some works that mainly consider multitask learning in ABSA [25], [26], [27] to achieve better performance with the help of interactions between tasks. Based on the inspiration of multitask learning, we also propose a multitask learning model that combines APC and ATE tasks. In our model, the APC task’s performance is further boosted by using the feature of the ATE task. Inspired by the relational graph attention network (RGAT) presented by Wang et al. [28], we also use a series of RGAT processes to encode the reshaped and pruned dependency tree. Although previous studies have shown that graph neural networks contribute to the performance of APC tasks, most previous studies have fed sentence dependencies directly into graph neural networks. Such dependencies do not consider the influence of aspects. Therefore, to address the challenge of dependency relations, we apply MHA to associate dependency sequences with aspect extraction, enabling our model to focus on the dependency sequences that are more closely related to the aspects. To validate the effectiveness of the proposed model, extensive experiments are conducted on three public datasets, and according to the experimental results, our model has obvious improvements and achieves superior performance.

Our main contributions are as follows:

•
We propose a multitask learning model that integrates BERT and RGAT models for APC and ATE tasks. The two tasks are conducted simultaneously in a joint training manner.
•
We propose to associate dependency sequences with aspect extraction via MHA, which can enhance the connection between aspects and their associated dependency sequences.
•
Three public datasets were used to confirm the validity of the model. As seen from the experimental results, the proposed model outperforms recent state-of-the-art models. We further conduct domain-adaptation experiments, achieving appealing results.

Section snippets

Aspect term extraction

As a subtask of ABSA, aspect term extraction works to identify different aspects mentioned in a given sentence. Aspect terms refer to specific characteristics or attributes of products or services discussed in the review. Aspect term extraction can be regarded as a textual entity. Aspect term extraction methods have undergone a development phase from traditional methods to deep learning methods.

Our model

We introduce our multitask learning model in detail in this section. It consists of four main parts: BERT-APC, RGAT, BERT-ATE, and MHA. Fig. 1 shows the overall architecture of our model. The input of our model includes three parts: input sentences and aspects into the BERT-APC module simultaneously; input dependency relations into the RGAT module; and input the sentence into the BERT-ATE module. The model has two outputs: the extracted aspect and the aspect polarity. BERT-APC is used to

Experiments

We first present the three datasets used in this section. Then, we introduce the evaluation metrics and parameter settings and the baseline approaches used for comparison. Finally, the experimental results are given and analyzed.

Discussion

To further understand the effects of some important parameters and modules on the experimental results, this section discusses them in detail, including the impact of multitask learning and MHA, the effects of the number of heads in MHA and RGAT, the effects of hyperparameter $α$ , the effects of different syntactic parsers, and the effects of multitask learning on ATE. In the discussion of the effects of multitask learning and MHA, we go further by changing the calculation of MHA to discuss how

Conclusion

We propose a new multitask learning model for ABSA by combining aspect term extraction and aspect polarity classification. Our model consists of four main modules: BERT-APC, BERT-ATE, RGAT, and MHA. The model can not only extract aspects but also classify aspects, however we mainly focus on the APC task, with the ATE task as an auxiliary means to improve the performance of the APC task. To correlate the two tasks and highlight important dependencies, we leverage a multihead attention mechanism

CRediT authorship contribution statement

Guoshuai Zhao: Conceptualization, Methodology, Formal analysis, Writing – original draft, Writing – review & editing, Supervision. Yiling Luo: Methodology, Software, Data curation, Writing – original draft, Writing – review & editing, Formal analysis, Validation. Qiang Chen: Methodology, Software, Data curation, Writing – original draft, Formal analysis. Xueming Qian: Resources, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under Grants 61902309, in part by China Postdoctoral Science Foundation under Grant 2020M683496 and BX20190273, in part by the Fundamental Research Funds for the Central Universities, China (xxj022019003, xzd012022006) ; in part by the Humanities and Social Sciences Foundation of Ministry of Education, China under Grant 16XJAZH003, and in part by the Science and Technology Program of Xi’an, China under Grant

References (61)

AyetiranE.F.
Attention-based aspect sentiment classification using enhanced learning through cnn-Bilstm networks
Knowl.-Based Syst.
(2022)
YadavR.K. et al.
Positionless aspect based sentiment analysis using attention mechanism
Knowl.-Based Syst.
(2021)
LiangB. et al.
Aspect-based sentiment analysis via affective knowledge enhanced graph convolutional networks
Knowl.-Based Syst.
(2022)
TangH. et al.
Dynamic evolution of multi-graph based collaborative filtering for recommendation systems
Knowl.-Based Syst.
(2021)
WanC. et al.
An association-constrained LDA model for joint extraction of product aspects and opinions
Inform. Sci.
(2020)
VenugopalanM. et al.
An enhanced guided LDA model augmented with BERT based semantic strength for aspect term extraction in sentiment analysis
Knowl.-Based Syst.
(2022)
ZhaoG. et al.
Personalized location recommendation by fusing sentimental and spatial context
Knowl.-Based Syst.
(2020)
ZhuJ. et al.
Joint reason generation and rating prediction for explainable recommendation
IEEE Trans. Knowl. Data Eng.
(2022)
SuJ. et al.
Enhanced aspect-based sentiment analysis models with progressive self-supervised attention learning
Artificial Intelligence
(2021)
ZhaoA. et al.
Knowledge-enabled BERT for aspect-based sentiment analysis
Knowl.-Based Syst.
(2021)

ZengJ. et al.

Relation construction for aspect-level sentiment classification

Inform. Sci.

(2022)

PhanH.T. et al.

Convolutional attention neural network over graph structures for improving the performance of aspect-level sentiment analysis

Inform. Sci.

(2022)

ZhaoM. et al.

Aggregated graph convolutional networks for aspect-based sentiment classification

Inform. Sci.

(2022)

FengS. et al.

Aspect-based sentiment analysis with attention-assisted graph and variational sentence representation

Knowl.-Based Syst.

(2022)

LiangY. et al.

A dependency syntactic knowledge augmented interactive architecture for end-to-end aspect-based sentiment analysis

Neurocomputing

(2021)

W. Xue, T. Li, Aspect Based Sentiment Analysis with Gated Convolutional Networks, in: Proceedings of the 56th Annual...

ZhaoG. et al.

Exploring users’ internal influence from reviews for social recommendation

IEEE Trans. Multimed.

(2018)

SongY. et al.

Targeted sentiment classification with attentional encoder network

H. Xu, B. Liu, L. Shu, S.Y. Philip, BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment...

BaiX. et al.

Investigating typed syntactic dependencies for targeted sentiment classification using graph attention neural network

IEEE/ACM Trans. Audio Speech Lang. Process.

(2020)

G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, C. Dyer, Neural Architectures for Named Entity Recognition, in:...

Da’uA. et al.

Aspect extraction on user textual reviews using multi-channel convolutional neural network

PeerJ Comput. Sci.

(2019)

ZhangZ. et al.

TADC: A topic-aware dynamic convolutional neural network for aspect extraction

IEEE Trans. Neural Netw. Learn. Syst.

(2021)

CambriaE.

Affective computing and sentiment analysis

IEEE Intell. Syst.

(2016)

ZhaoG. et al.

CAPER: context-aware personalized emoji recommendation

IEEE Trans. Knowl. Data Eng.

(2021)

HeK. et al.

Meta-based self-training and re-weighting for aspect-based sentiment analysis

IEEE Trans. Affect. Comput.

(2022)

MaoR. et al.

The biases of pre-trained language models: An empirical study on prompt-based sentiment analysis and emotion detection

IEEE Trans. Affect. Comput.

(2022)

T. Gao, J. Fang, H. Liu, Z. Liu, C. Liu, P. Liu, Y. Bao, W. Yan, LEGO-ABSA: A Prompt-based Task Assemblable Unified...

E. Cambria, Y. Li, F.Z. Xing, S. Poria, K. Kwok, SenticNet 6: Ensemble application of symbolic and subsymbolic AI for...

E. Cambria, Q. Liu, S. Decherchi, F. Xing, K. Kwok, SenticNet 7: a commonsense-based neurosymbolic AI framework for...

Cited by (17)

Aspect-based sentiment score and star rating prediction for travel destination using Multinomial Logistic Regression with fuzzy domain ontology algorithm
2024, Expert Systems with Applications
Due to its significant advantages for downstream applications, such as recommender systems, In the context of travel and tourism, user reviews on TripAdvisor have an impact on other travellers' judgments regarding a variety of travel-related issues, including the choice of a vacation spot, lodging, and places to visit. In graded user reviews, the model must specifically forecast the user's review score after receiving the textual review. The purpose of this article is to present a predictive outline for aspect-based extraction and classification that can estimate the users' optimal travel destination. This impression helps travellers in a variety of ways, such as by recommending better locations that also include expensive destinations. The underlying classification algorithm's processing time is significantly impacted by more dimensions. The term “curse of dimensionality” is often used in statistics and machine learning to describe these issues. By projecting the high-dimensional input data into the low-dimensional subspace while roughly maintaining the distance between the data points with a higher probability, the Random Projection (RP) ensemble classifier decreases the complexity of multivariate data. Extract the crucial information from the reviews, then use Glove word vector representation to categorise the relevant emotions. Further, the article proposed Multinomial Logistic Regression (MNLR) with a Fuzzy Domain Ontology (FDO) algorithm for aspect-based sentiment analysis. More intricate aspects than just the products themselves impact how satisfied people are with tourist destinations. The most important factor in evaluating how convenient a tourist route will be is typically the weather. The combined form of predicted sentiment score, start ratings and environment factor has to be calculated to predict the travel destination based on the measurement of personalized search results. The simulation was processed in Python software. The presented work has utilized some performance measures to evaluate the classification model such as F1-score, Recall, Precision, Mean Absolute Error (MAE), Mean Squared Error (MSE), Cohen Score and Matthew Score. The accuracy with GloVe word vector representation was 90% and after the GloVe representation, the classification model accuracy was about 94%. The proposed strategy outperforms in terms of classification accuracy, according to simulated results and analyses of real-world data.
Product ranking through fusing the wisdom of consumers extracted from online reviews on multiple platforms
2024, Knowledge-Based Systems
Ranking products based on online reviews has become an important measure to support consumers’ purchasing decisions. How to make an effective decision considering online review information from different e-commerce platforms is a challenge. In this regard, this study introduces a large-scale group decision-making (LSGDM) method to assist users in making purchasing decisions by extracting the collective wisdom of reviewers across different platforms. First, we use the lexical analysis system, term frequency-inverse document frequency algorithm, and sentiment dictionary to process online review data and obtain product attributes, the weights of attributes, and sentiment scores of reviews, respectively. The weights of platforms are determined using an integrated method that considers the characteristics of each platform. Afterwards, we present an LSGDM method to achieve the coordination of the wisdom of reviewers. We collect online reviews of four mobile phones from Tmall, JD and Suning, providing experimental analyses to demonstrate the applicability of our proposed method.
MIFINN: A novel multi-information fusion and interaction neural network for aspect-based sentiment analysis
2023, Knowledge-Based Systems
Aspect-based Sentiment Analysis (ABSA) is used to detect corresponding sentiment polarities toward different aspect terms. In recent years, graph convolutional network-based methods achieve great success for ABSA. However, previous studies ignore syntactic dependency types and aspect-related syntactic distances. Besides, effectively fusing semantic and syntactic information remains a challenging problem. Previous studies also neglect interactions between aspect terms and context. To alleviate the above problems, we develop a Multi-Information Fusion and Interaction Neural Network (MIFINN), which consists of an Aspect and Syntax-Aware Multi-Information Fusion Graph Convolutional Network (ASAMIFGCN) and a Multi-Information Interaction and Gating Network (MIIGN). The ASAMIFGCN model can integrate and fuse contextual semantics, syntactic dependency types, and aspect-related syntactic distances. Specifically, two different attentions are designed to learn global and aspect-related semantic and syntactic information, respectively. A syntactic distance gating module is proposed to acquire more precise score matrices. Besides, we propose a semantic and syntactic association module to fuse multi-information simultaneously. The MIIGN model can effectively interact aspect terms and context. Specifically, two interactive attentions are developed to learn different weights toward aspect terms and context via interactions between each other, respectively. Moreover, we propose a gating information module for controlling the information flow of aspect terms and context. We conduct experiments on multiple datasets and MIFINN achieves state-of-the-art performance. The results also demonstrate that effectively fusing and interacting multi-information can improve model performance for ABSA significantly.
Reconstructing graph networks by using new target representation for aspect-based sentiment analysis
2023, Knowledge-Based Systems
The purpose of aspect-based sentiment analysis (ABSA) is to identify the sentiment polarity of a given aspect of a sentence. Recent investigations have revealed that incorporating syntactic structures derived from dependency-parsing trees into graph convolutional networks (GCNs) can yield excellent performance. However, these GCN-based methods excessively rely on the quality of the dependency-parsing tree, resulting possibly in suboptimal dependencies between words. Moreover, these GCN-based models fail to adapt properly to informal and complex comments without syntactic dependencies. To alleviate these deficiencies, we proposed a target-based GCN with semantic and syntactic information (TSGCN). In a TSGCN, a new target generation (NTG) module with a dependency attention mechanism is designed to generate a new target representation using explicit semantic information to replace a given aspect. Then, the syntactic structure is reconstructed based on the new target representation to capture the shortest distance between the given aspect and viewpoint words. Finally, the semantic structure generated by the self-attention mechanism was injected into the syntactic structure to complement the semantic dependencies between words. The experimental findings on five benchmark datasets indicated that the TSGCN outperformed the other baseline models.
Breaking down linguistic complexities: A structured approach to aspect-based sentiment analysis
2023, Journal of King Saud University - Computer and Information Sciences
Aspect-based sentiment analysis refers to the task of determining the sentiment polarity associated with particular aspects mentioned in a sentence or document. Previous studies have used attention-based neural network models to connect aspect terms with context words, but these models often perform poorly due to limited interaction between aspect terms and opinion words. Furthermore, these models typically focus only on explicitly stated aspect objects, which can be overly restrictive in certain scenarios. Current sentiment analysis methods that rely on aspect categories also often fail to consider the implicit placement of aspect-category information within the context. While existing models may produce strong results, they often lack domain knowledge. To address these issues, this study proposes an Aspect-position and Entity-oriented Knowledge Convolutional Graph (APEKCG) consisting of two modules: the Aspect position-aware module (APA) and the Entity oriented Knowledge Dependency Convolutional Graph (EKDCG). The APA module is designed to integrate aspect-specific sentiment features for sentiment classification by incorporating information about aspect categories into different parts of the context. The EKDCG module incorporates entity-oriented knowledge, dependency labels, and syntactic path using a dependence graph. Experimental results on five benchmarks Natural Language Processing (NLP) datasets of the English language demonstrate the effectiveness of the proposed APEKCG framework. Furthermore, the APEKCG outperformed previous state-of-the-art models with its accuracy, achieving 89.13%, 84.32%, 89.02%, 79.64%, and 90.22% on the MAMS, Laptop, Restaurant, AWARE, and SemEval-15&16 datasets, respectively.
Aspect based sentiment analysis using deep learning approaches: A survey
2023, Computer Science Review
The wealth of unstructured text on the online web portal has made opinion mining the most thrust area for researchers, academicians, and businesses to extract information for gathering, analyzing, and aggregating human emotions. The extraction of public sentiment from the text at an aspect level has contributed exceptionally to various businesses in the marketplace. In recent times, deep learning-based techniques have learned high-level linguistic features without high-level feature engineering. Therefore, this paper focuses on a rigorous survey on two primary subtasks, aspect extraction and aspect category detection of aspect-based sentiment analysis (ABSA) methods based on deep learning. The significant advancement in the ABSA sector is demonstrated by a thorough evaluation of state-of-the-art and latest aspect extraction methodologies.

View all citing articles on Scopus

¹: They have equal contributions to this work.

View full text

Aspect-based sentiment analysis via multitask learning for online reviews

Abstract

Introduction

Section snippets

Aspect term extraction

Our model

Experiments

Discussion

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Knowl.-Based Syst.

Knowl.-Based Syst.

Knowl.-Based Syst.

Knowl.-Based Syst.

Inform. Sci.

Knowl.-Based Syst.

Knowl.-Based Syst.

IEEE Trans. Knowl. Data Eng.

Artificial Intelligence

Knowl.-Based Syst.

Inform. Sci.

Inform. Sci.

Inform. Sci.

Knowl.-Based Syst.

Neurocomputing

Exploring users’ internal influence from reviews for social recommendation

IEEE Trans. Multimed.

Targeted sentiment classification with attentional encoder network

Investigating typed syntactic dependencies for targeted sentiment classification using graph attention neural network

IEEE/ACM Trans. Audio Speech Lang. Process.

Aspect extraction on user textual reviews using multi-channel convolutional neural network

PeerJ Comput. Sci.

TADC: A topic-aware dynamic convolutional neural network for aspect extraction

IEEE Trans. Neural Netw. Learn. Syst.

Affective computing and sentiment analysis

IEEE Intell. Syst.

CAPER: context-aware personalized emoji recommendation

IEEE Trans. Knowl. Data Eng.

Meta-based self-training and re-weighting for aspect-based sentiment analysis

IEEE Trans. Affect. Comput.

The biases of pre-trained language models: An empirical study on prompt-based sentiment analysis and emotion detection

IEEE Trans. Affect. Comput.