S3
 map: Semisupervised aspect-based sentiment analysis with masked aspect prediction

doi:10.1016/j.knosys.2023.110513

Knowledge-Based Systems

Volume 269, 7 June 2023, 110513

https://doi.org/10.1016/j.knosys.2023.110513 Get rights and content

Abstract

Aspect-based sentiment analysis (ABSA) refers to a fine-grained task of detecting the sentiment polarities of sentences at the aspect level. To resolve this task, ABSA training samples must be annotated with aspect words and the corresponding sentiment polarities. However, collecting such fine-grained training samples is expensive and time-consuming. Therefore the available ABSA training samples are often scarce. To break the data scarcity challenge of ABSA, we investigate semi-supervised aspect-based sentiment analysis (SemiABSA), which trains ABSA models using a limited amount of expensive labeled sentences and more unlabeled-yet-cheaper sentences. We propose a novel SemiABSA framework, namely semi-supervised aspect-based sentiment analysis with masked aspect prediction (S $^{3}$ map), built on the self-training paradigm. We form pseudo-aspect words and pseudo-sentiment polarities for unlabeled sentences and improve model training. Specifically, a BERT-encoder-based masked aspect prediction (MAP) task achieves the pseudo-aspect words generation. Based on S $^{3}$ map, we thoroughly investigate the potential of SemiABSA from various perspectives. The empirical results show that S $^{3}$ map can consistently improve performance by leveraging unlabeled sentences, even those from different domains.

Introduction

Sentiment analysis (SA) is a fundamental research topic in the information retrieval, and natural language processing communities [1], [2], [3]. Generally, SA aims to automatically detect the sentiment polarity, {e.g.,Positive, Neutral, Negative} from sentences such as those in reviews on restaurants, movies, and products. For example, given a restaurant review,“ Nice restaurant whose service needs to be improved”, SA aims to accurately detect its actual sentiment polarity Positive. Naturally, SA is potentially in demand in many real-world applications.

Unfortunately, the traditional SA task only concentrates on the sentiment polarity of the full text. We expect to analyze fine-grained aspect-based sentiment in many real scenarios to explore more valuable information. Retaking the example above, we may be more concerned with the sentiment target “service” whose polarity is negative rather than the polarity of the full text. In response to this demand, more attention has been recently paid to the emerging topic of Aspect-Based Sentiment Analysis (ABSA), whose aim is to automatically detect the sentiment polarity of certain aspects [4], [5], [6], [7], [8], [9], [10], [11].

Generally speaking, the first step is to collect fine-grained training datasets to resolve ABSA with machine learning techniques. Several human annotators should tag aspect words and their sentiment polarities for training sentences, as illustrated in Table 1. Collecting such training datasets is costly and much more expensive than traditional SA training data. Therefore the available ABSA training datasets are scarce and often contain very few training sentences, e.g., the volumes of the prevalent ABSA datasets Restaurant and Laptop from SemiEval 14 [12] are only 2282 and 3608, respectively. Such scarce ABSA training sentences contain limited supervised signals, limiting the performance upper bound of ABSA models.

To meet this demand, we take inspiration from the spirit of semi-Supervised learning (SSL) and accordingly attempt to train ABSA models by simultaneously leveraging a limited amount of expensive labeled sentences and more unlabeled-yet-cheaper sentences, yielding a topic of semi-supervised aspect-based sentiment analysis (SemiABSA). Formally, we are given a training dataset, which consists of a subset of $N_{l}$ labeled triplets $Ω_{l} = {(s_{i}, a_{i}, y_{i})}_{i = 1}^{N_{l}}$ and a subset of $N_{u}$ unlabeled sentences $Ω_{u} = {s_{i}}_{i = N_{l} + 1}^{N_{l} + N_{u}}$ . Specifically, $s_{i} = {s_{i 1}, s_{i 2}, \dots, s_{i N_{i}}}$ , $a_{i} \in {0, 1}^{N_{i}}$ , and $y_{i} \in {0, 1}^{M}$ denote the raw sentence, aspect word indicator, and one-hot sentiment polarity indicator, respectively, where $N_{i}$ represents its number of word tokens and $M$ is the number of sentiment polarities. We consider the inductive learning paradigm, where the aim is to train a predictive model from $Ω_{l} \cup Ω_{u}$ and apply it to predict any unseen sentence-aspect tuple $(s, a)$ . To our knowledge, very few studies have addressed this topic.

In this paper, we propose a novel self-training SemiABSA framework, namely Semi-Supervised aspect-based Sentiment analysis with Masked Aspect Prediction (S³map). To fully use unlabeled sentences, our basic idea is to form pseudo-aspect-specific sentence embeddings and pseudo-sentiment polarities and train the sentiment classifier with them in a self-training manner. Specifically, S $^{3}$ map consists of 4 key modules: BERT-Encoder, Aspect-Discriminator, Aspect-Sentence-Encoder(AS-Encoder), and Sentiment-Predicter. First, the BERT-Encoder can be treated as the basic feature encoder for both sentences and tokens. Second, the Aspect-Discriminator is a BERT-Encoder-based masked aspect prediction task trained over labeled sentences, and it is used to identify the aspect words for unlabeled sentences. Third, with the pseudo-aspect in hand, we utilize the AS-Encoder to update the pseudo-aspect embedding and sentence embedding. Various structures such as GCN or attention networks can instantiate AS-Encoder. Then, we combine pseudo-aspect embedding and sentence embedding to form the pseudo-aspect-specific sentence embeddings. Finally, S $^{3}$ map is jointly trained with labeled and pseudo-labeled sentences.

Empirically, we thoroughly investigate the potential of SemiABSA from various perspectives based on S $^{3}$ map. First, we employ two prevalent ABSA collections of reviews on restaurants and laptops and four collections of unlabeled sentences from multiple domains, including reviews, daily social media posts, and encyclopedias. We generate synthetic SemiABSA datasets using pairwise combinations of labeled and unlabeled datasets. Accordingly, a total of 8 synthetic SemiABSA datasets are generated. We evaluate S $^{3}$ map on these datasets. The empirical results demonstrate that S $^{3}$ map can consistently improve performance by leveraging unlabeled sentences in various scenarios, even when labeled and unlabeled sentences are from different domains, and unlabeled sentences, i.e., encyclopedias, tend to be without any sentiment polarities. In addition, S $^{3}$ map significantly outperforms the existing SemiABSA methods.

In summary, the major contributions of this paper are outlined below:

•
We investigate the problem of SemiABSA and propose a novel framework named S³map.
•
We propose a novel BERT-based MAP task to infer the aspect words of sentences.
•
We conduct extensive experiments to indicate the effectiveness of S³map.

Section snippets

Sentiment analysis

Traditional SA methods mainly aim to automatically predict sentiment polarity, e.g., attitudes, and opinions, for full texts [13], [14]. From the development timeline of SA, the prior arts include rule-based systems [15], [16], shallow learning-based methods [17], [18] and deep learning-based methods [19], [20]. Generally, deep learning-based SA methods use neural networks to form discriminative text embeddings and can achieve promising performance. From the perspective of the network

Overall framework of S $^{3}$ map

Overall, S $^{3}$ map is built on the idea of self-training and it consists of 4 basic modules. (1) BERT-Encoder: We apply the pre-trained BERT Model $g (; W_{b})$ as the basic encoder, which inputs a sentence $s_{i}$ and outputs the contextualized embeddings of all tokens $h_{i} = g (s_{i}; W_{b}) \in R^{C \times N}$ . Each column $h_{i j} \in R^{C}$ denotes the embedding of the $j$ th word token. (2) Aspect Discriminator: This is used to identify the aspect words for unlabeled sentences (3) Aspect-Sentence-Encoder (AS-Encoder): This extracts the aspect

Experimental settings

In this section, we introduce the experimental settings, including datasets, parameter configuration of S $^{3}$ map, and evaluation metrics.

Datasets. We employ two prevalent ABSA datasets Restaurant (Rest.) and Laptop (Lap.) from SemEval 2014 Task 4 [12], and four datasets of unlabeled sentences from various domains, including two review collections - (Yelp⁴ and Amazon,⁵) and two generic sentence

Results and analysis

In this section, we empirically evaluate the proposed S $^{3}$ map method, and mainly attempt to answer the following questions:

•
Q1: Can S $^{3}$ map compete with the existing arts of semi-supervised learning and supervised learning?
•
Q2: Can S $^{3}$ map effectively improve the performance with auxiliary unlabeled sentences?
•
Q3: Can S $^{3}$ map be sensitive to the configurations of labeled and unlabeled sentences from various domains?
•
Q4: Is S $^{3}$ map is sensitive to different numbers of unlabeled sentences?

Conclusion

This work addresses the scarcity problem of fine-grained ABSA training sentences. To do this, we use the SemiABSA framework to simultaneously leverage labeled and unlabeled sentences for ABSA model training, and propose a novel method named S $^{3}$ map. The proposed S $^{3}$ map framework is built on the self-training paradigm, whose key idea is to generate pseudo-aspect words and pseudo-sentiment polarities for unlabeled sentences. Specifically, we propose a BERT-based MAP task to predict aspect words

CRediT authorship contribution statement

Zhiyao Yang: Investigation, Conceptualization, Methodology, Software, Validation, Writing – original draft, Writing – review & editing. Bing Wang: Conceptualization, Validation, Investigation, Writing – original draft, Writing – review & editing. Ximing Li: Writing – review & editing. Wenting Wang: Project administration. Jihong Ouyang: Project administration, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

We want to acknowledge support for this project from the National Natural Science Foundation of China (NSFC) (No. 62276113, No. 62006094), Scientific and Technological Developing Scheme of Jilin Province (No. 20180201003SF, No. 20190701031GH) and Energy Administration of Jilin Province (No. 3D516L921421).

References (68)

MaoR. et al.
MetaPro: A computational metaphor processing model for text pre-processing
Inf. Fusion
(2022)
FengS. et al.
Aspect-based sentiment analysis with attention-assisted graph and variational sentence representation
Knowl.-Based Syst.
(2022)
XuW. et al.
Semi-supervised target-oriented sentiment classification
Neurocomputing
(2019)
Y. Zhang, Y. Zhang, Tree Communication Models for Sentiment Analysis, in: Conference of the Association for...
H. Tian, C. Gao, X. Xiao, H. Liu, B. He, H. Wu, H. Wang, F. Wu, SKEP: Sentiment Knowledge Enhanced Pre-training for...
J. Barnes, R. Kurtz, S. Oepen, L. Øvrelid, E. Velldal, Structured Sentiment Analysis as Dependency Graph Parsing, in:...
P. Chen, Z. Sun, L. Bing, W. Yang, Recurrent Attention Network on Memory for Aspect Sentiment Analysis, in: Conference...
R. He, W.S. Lee, H.T. Ng, D. Dahlmeier, Effective Attention Modeling for Aspect-Level Sentiment Classification, in:...
W. Xue, T. Li, Aspect Based Sentiment Analysis with Gated Convolutional Networks, in: Annual Meeting of the Association...
X. Li, L. Bing, W. Lam, B. Shi, Transformation Networks for Target-Oriented Sentiment Classification, in: Annual...

K. Sun, R. Zhang, S. Mensah, Y. Mao, X. Liu, Aspect-Level Sentiment Analysis Via Convolution over Dependency Tree, in:...

C. Du, H. Sun, J. Wang, Q. Qi, J. Liao, T. Xu, M. Liu, Capsule Network with Interactive Attention for Aspect-Level...

Z. Chen, T. Qian, Transfer Capsule Network for Aspect Level Sentiment Classification, in: Annual Meeting of the...

R. Li, H. Chen, F. Feng, Z. Ma, X. Wang, E.H. Hovy, Dual Graph Convolutional Networks for Aspect-based Sentiment...

M. Pontiki, D. Galanis, J. Pavlopoulos, H. Papageorgiou, I. Androutsopoulos, S. Manandhar, SemEval-2014 Task 4: Aspect...

HemmatianF. et al.

A survey on classification techniques for opinion mining and sentiment analysis

Artif. Intell. Rev.

(2019)

YueL. et al.

A survey of sentiment analysis in social media

Knowl. Inf. Syst.

(2019)

C. Hutto, E. Gilbert, Vader: A parsimonious rule-based model for sentiment analysis of social media text, in:...

VashishthaS. et al.

Fuzzy rule based unsupervised sentiment analysis from social media posts

Expert Syst. Appl.

(2019)

B. Pang, L. Lee, S. Vaithyanathan, Thumbs up? Sentiment Classification using Machine Learning Techniques, in:...

TripathyA. et al.

Document-level sentiment classification using hybrid machine learning approach

Knowl. Inf. Syst.

(2017)

HabimanaO. et al.

Sentiment analysis using deep learning approaches: An overview

Sci. China Inf. Sci.

(2020)

YadavA. et al.

Sentiment analysis using deep learning architectures: A review

Artif. Intell. Rev.

(2020)

S. Poria, E. Cambria, A. Gelbukh, Deep convolutional neural network textual features and multiple kernel learning for...

Z. Teng, D.T. Vo, Y. Zhang, Context-sensitive lexicon features for neural sentiment analysis, in: Conference on...

Y. Wang, M. Huang, X. Zhu, L. Zhao, Attention-based LSTM for Aspect-level Sentiment Classification, in: Conference on...

ZhaoP. et al.

Modeling sentiment dependencies with graph convolutional networks for aspect-level sentiment classification

Knowl.-Based Syst.

(2020)

E. Cambria, R. Mao, S. Han, Q. Liu, Sentic parser: A graph-based approach to concept extraction for sentiment analysis,...

X. Wang, W. Jiang, Z. Luo, Combination of convolutional and recurrent neural network for sentiment analysis of short...

MaoR. et al.

The biases of pre-trained language models: An empirical study on prompt-based sentiment analysis and emotion detection

IEEE Trans. Affect. Comput.

(2022)

E. Cambria, Q. Liu, S. Decherchi, F. Xing, K. Kwok, SenticNet 7: A Commonsense-based Neurosymbolic AI Framework for...

D. Tang, B. Qin, T. Liu, Aspect Level Sentiment Classification with Deep Memory Network, in: Conference on Empirical...

D. Ma, S. Li, X. Zhang, H. Wang, Interactive Attention Networks for Aspect-Level Sentiment Classification, in:...

C. Zhang, Q. Li, D. Song, Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks, in:...

Cited by (8)

Aspect-based sentiment classification with aspect-specific hypergraph attention networks
2024, Expert Systems with Applications
Aspect-based sentiment classification aims to infer the sentiment expression towards a specific aspect in a sentence. The key to this task is to utilize the relationship between sentiment words and aspect words. The mainstream methods use Recurrent Neural Networks (RNN), Attention mechanisms, or Graph Neural Networks (GNN) to explore the syntactic information. Though these methods are undoubtedly effective, they still encounter several challenges: (1) Since most of the studies used only syntactic dependency graphs, they lacked a more optimal representation of inter-word relationships. (2) Some studies have explored multiple relationship graphs, but they fail to effectively integrate syntactic dependencies with semantic or other information, thereby impeding the exchange of multiple information elements. Moreover, the inclusion of more information graphs increases the computational burden on the model. In this paper, we construct a word-level relational hypergraph containing various syntactic and semantic relationships between aspect words and other context words. We propose an aspect-specific hypergraph attention network (ASHGAT) to thoroughly investigate the hypergraph’s information. Furthermore, we design an aspect-oriented syntactic distance-based weight distribution mechanism to optimize hypergraph attention. We conducted extensive experiments on four benchmark datasets from SemEval 14, 15, and 16. The results show that ASHGAT demonstrates the other SOTA baselines.
Pseudo dense counterfactual augmentation for aspect-based sentiment analysis
2023, Neurocomputing
Aspect-based sentiment analysis (ABSA) is a fine-grained text classification task, and the cutting-edge ABSA models have achieved outstanding performance. Unfortunately, the robustness of these ABSA models is neglected. ABSA models must face numerous challenges to be robust, and we concentrate on one of these challenges caused by negation words, such as “not”, “un-”. In the actual context, these negation words intuitively result in two problems: negative sensitivity and spurious correlation. First, a negation word tends to reverse the sentiment polarity of a sentence. Meanwhile, in the ABSA datasets, most sentences containing negation words express Negative polarities, which will lead the predictive model to learn the spurious correlation between negation words and polarities. To resolve these ambiguous issues, we are inspired by causal inference and propose a novel data augmentation framework, namely Pseudo Dense Counterfactual Augmentation (PDCaug) for ABSA. Specifically, we initialize a pseudo sequence and employ a multi-head multi-layer attention network to achieve counterfactual augmentation for a vanilla sentence in the hidden space. This pseudo sequence will be adversarially trained. PDCaug is a plug-and-play method for various ABSA models, so we evaluate it on discriminative models and generative prompt-based models. Our extensive experiments show that our PDCaug can significantly and consistently outperform several data augmentation methods and ABSA models.
Reconstructing graph networks by using new target representation for aspect-based sentiment analysis
2023, Knowledge-Based Systems
The purpose of aspect-based sentiment analysis (ABSA) is to identify the sentiment polarity of a given aspect of a sentence. Recent investigations have revealed that incorporating syntactic structures derived from dependency-parsing trees into graph convolutional networks (GCNs) can yield excellent performance. However, these GCN-based methods excessively rely on the quality of the dependency-parsing tree, resulting possibly in suboptimal dependencies between words. Moreover, these GCN-based models fail to adapt properly to informal and complex comments without syntactic dependencies. To alleviate these deficiencies, we proposed a target-based GCN with semantic and syntactic information (TSGCN). In a TSGCN, a new target generation (NTG) module with a dependency attention mechanism is designed to generate a new target representation using explicit semantic information to replace a given aspect. Then, the syntactic structure is reconstructed based on the new target representation to capture the shortest distance between the given aspect and viewpoint words. Finally, the semantic structure generated by the self-attention mechanism was injected into the syntactic structure to complement the semantic dependencies between words. The experimental findings on five benchmark datasets indicated that the TSGCN outperformed the other baseline models.
TraceNet: Tracing and locating the key elements in sentiment analysis
2023, Knowledge-Based Systems
We study sentiment analysis task where the outcomes are mainly contributed by a few key elements of the inputs. Motivated by the two-streams hypothesis, we explore processing input items and their weights separately by developing a neural architecture, named $TraceNet$ , to address this type of task. It not only learns discriminative representations for the target task via its encoders, but also traces key elements at the same time via its locators. In $TraceNet$ , both encoders and locators are organized in a layer-wise manner, and a smoothness regularization is employed between adjacent encoder-locator combinations. Moreover, a sparsity constraint is enforced on locators for tracing purposes and items are proactively masked according to the item weights output by locators. A major advantage of $TraceNet$ is that the outcomes are easier to understand, as it identifies the key components responsible for the outcomes, making them easier to understand. The experimental results demonstrate its effectiveness in sentiment classification. Furthermore, we present case studies to showcase the interpretability of the model and conduct comprehensive analyses to highlight the impacts of each component.
Aspect-Based Sentiment Analysis with Explicit Sentiment Augmentations
2024, Proceedings of the AAAI Conference on Artificial Intelligence
Aspect-Based Sentiment Analysis with Explicit Sentiment Augmentations
2023, arXiv

View all citing articles on Scopus

¹: Contributing equally with the first author.

View full text

S3 map: Semisupervised aspect-based sentiment analysis with masked aspect prediction

Abstract

Introduction

Section snippets

Sentiment analysis

Overall framework of S3 map

Experimental settings

Results and analysis

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Inf. Fusion

Knowl.-Based Syst.

Neurocomputing

A survey on classification techniques for opinion mining and sentiment analysis

Artif. Intell. Rev.

A survey of sentiment analysis in social media

Knowl. Inf. Syst.

Fuzzy rule based unsupervised sentiment analysis from social media posts

Expert Syst. Appl.

Document-level sentiment classification using hybrid machine learning approach

Knowl. Inf. Syst.

Sentiment analysis using deep learning approaches: An overview

Sci. China Inf. Sci.

Sentiment analysis using deep learning architectures: A review

Artif. Intell. Rev.

Modeling sentiment dependencies with graph convolutional networks for aspect-level sentiment classification

Knowl.-Based Syst.

The biases of pre-trained language models: An empirical study on prompt-based sentiment analysis and emotion detection

IEEE Trans. Affect. Comput.

S $^{3}$ map: Semisupervised aspect-based sentiment analysis with masked aspect prediction

Overall framework of S $^{3}$ map