Elsevier

Neurocomputing

Volume 455, 30 September 2021, Pages 47-58
Neurocomputing

MASAD: A large-scale dataset for multimodal aspect-based sentiment analysis

https://doi.org/10.1016/j.neucom.2021.05.040Get rights and content

Abstract

Aspect-based sentiment analysis has obtained great success in recent years. Most of the existing work focuses on determining the sentiment polarity of the given aspect according to the given text, while little attention has been paid to the visual information as well as multimodality content for aspect-based sentiment analysis. Multimodal content is becoming increasingly popular in mainstream online social platforms and can help better extract user sentiments toward a given aspect. There are only few studies focusing on this new task: Multimodal Aspect-based Sentiment Analysis (MASA), which performs aspect-based sentiment analysis by integrating both texts and images. In this paper, we propose a mutimodal interaction model for MASA to learn the relationship among the text, image and aspect via interaction layers and adversarial training. Additionally, we build a new large-scale dataset for this task, named MASAD, which involves seven domains and 57 aspect categories with 38 k image-text pairs. Extensive experiments have been conducted on the proposed dataset to provide several baselines for this task. Though our models obtain significant improvement for this task, empirical results show that MASA is more challenging than textual aspect-based sentiment analysis, which indicates that MASA remains a challenging open problem and requires further efforts.

Section snippets

Introduction and motivation

Nowadays, as a major social media platform for expressing experiences and sharing the opinion about service, products, and travel, the Internet provides extensive contents of users’ opinion and sentiment about rich topics [1]. This information is expressed in multiple formats, such as reviews, tags, browser behavior, and shared media objects. The analysis of such information plays an essential role in the area of opinion mining, affective computing, and sentiment analysis. It can predict human

Related work

We divide the relevant work into two groups: aspect-based sentiment analysis and multimodal sentiment analysis. There are a large number of studies on the topic of aspect-based sentiment classification [4], [5], [7], [8], [12] and multimodal sentiment analysis [2], [13]. Here we mainly review the work that is most related to our research.

Task description

In this paper, we focus on the new task, namely multimodal aspect-level sentiment analysis (MASA), which aims to perform the aspect-based sentiment analysis [3], [5] based the textual and visual information. This task consists of two subtasks: Aspect Extraction (AE) and Aspect Polarity Prediction (AP). The goal of AE is to extract the aspect (e.g., dog, car) in the text-image pair. Then AP aims to judge the sentiment polarity for the given aspect via the text-image pair. Formally, the

Data collection

We collect and label our dataset based on the publicly available Visual Sentiment Ontology (VSO) dataset2 [44] and Multilingual Visual Sentiment Ontology (MVSO) dataset3 [2], which are the largest available datasets for visual sentiment analysis. VSO dataset was collected from Flickr.4 We select Flickr since there is some existing work of multimedia research using it [44], [2], and to be specific,

Methods

In this section, we aim to find a straightforward architecture that provides good performance for the MASA task. In particular, we propose a multimodal aspect extraction (MMAE) method and a multimodal aspect polarity prediction (MMAP) method for aspect extraction (AE) and aspect polarity prediction (AP) respectively. The frameworks of these models are shown in Fig. 3. We first use text encoder and image encoder to extract the text and image representation via the state-of-the-art textual and

Experimental results and analysis

In this section, we conduct extensive experimental results on two subtasks, AE and AP on our MASAD dataset to evaluate our proposed methods and provide several baselines for this task. We first present the implementation details of our models. Then we discuss the experimental results of AE and AP. To be specific, we compare the results using text only, visual only, and their combination over seven domains of MASAD with various metrics. In addition, to verify the effectiveness of our model, we

Conclusion and future work

In this paper, we focus on the new task named multimodal aspect-based sentiment analysis (MASA), which performs aspect-based sentiment analysis based on the multimodal content on social media platforms. We propose a multimodal interaction model for this task, which learns the interaction between the aspect, text, and image effectively. Then we present and release a large-scale multimodal aspect-based sentiment analysis dataset named MASAD. The MASAD involves 57 aspects with seven domains and

CRediT authorship contribution statement

Jie Zhou: Writing - original draft, Conceptualization, Methodology, Software. Jiabao Zhao: Data curation, Methodology, Software. Jimmy Xiangji Huang: Visualization, Investigation, Supervision. Qinmin Vivian Hu: Methodology, Investigation. Liang He: Methodology, Validation, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

We greatly appreciate anonymous reviewers and the associate editor for their valuable and high quality comments that greatly helped to improve the quality of this article. This research is funded by the Science and Technology Commission of Shanghai Municipality (19511120200). This research is also supported by the Natural Sciences and Engineering Research Council (NSERC) of Canada and the York Research Chairs (YRC) program. Jie Zhou and Jiabao Zhao are first co-authors with equal contribution.

Jie Zhou is currently working toward the PhD degree at the Department of Computer Science and Technology, East China Normal University, China. His research interest includes sentiment analysis, aspect-level sentiment classification, retrieval model, and neural networks. He was awarded a scholarship from the China Scholarship Council, and received Top-3 in the KDD CUP Competition several times. Since 2016, he has published more than 10 referred papers in international conferences or journals,

References (54)

  • M. Pontiki et al.

    Semeval-2015 task 12: Aspect based sentiment analysis

    Proceedings of SemEval

    (2015)
  • M. Pontiki et al.

    Semeval-2016 task 5: Aspect based sentiment analysis

    Proceedings of SemEval

    (2016)
  • M. Saeidi et al.

    Targeted aspect based sentiment analysis dataset for urban neighbourhoods

    Proceedings of COLING

    (2016)
  • N. Xu et al.

    Multi-interactive memory network for aspect based multimodal sentiment analysis

    (2019)
  • J. Yu et al.

    Adapting BERT for target-oriented multimodal sentiment classification

    (2019)
  • J. Zhou et al.

    Modeling multi-aspect relationship with joint learning for aspect-level sentiment classification

  • S. Poria et al.

    Context-dependent sentiment analysis in user-generated videos

    Proceedings of ACL

    (2017)
  • J. Zhou et al.

    A sentiment-aware pre-trained model for cross-domain sentiment analysis

  • S. Kiritchenko et al.

    Nrc-canada-2014 Detecting aspects and sentiment in customer reviews

    Proceedings of SemEval

    (2014)
  • D.-T. Vo et al.

    Target-dependent twitter sentiment classification with rich automatic features

    Proceedings of IJCAI

    (2015)
  • D. Tang et al.

    Effective lstms for target-dependent sentiment classification

    Proceedings of COLING

    (2016)
  • R. He et al.

    Exploiting document knowledge for aspect-level sentiment classification

    Proceedings of ACL

    (2018)
  • X. Li et al.

    Deep multi-task learning for aspect term extraction with memory interaction

    Proceedings of EMNLP

    (2017)
  • C. Fan et al.

    Convolution-based memory network for aspect-based sentiment analysis

  • J. Liu, Y. Zhang, Attention modeling for targeted sentiment, in: Proceedings of EACL, volume 2, 2017, pp....
  • S. Wang et al.

    Target-sensitive memory networks for aspect sentiment classification

    Proceedings of ACL

    (2018)
  • W. Xue, T. Li, Aspect based sentiment analysis with gated convolutional networks, in: Proceedings of ACL, Association...
  • Cited by (36)

    View all citing articles on Scopus

    Jie Zhou is currently working toward the PhD degree at the Department of Computer Science and Technology, East China Normal University, China. His research interest includes sentiment analysis, aspect-level sentiment classification, retrieval model, and neural networks. He was awarded a scholarship from the China Scholarship Council, and received Top-3 in the KDD CUP Competition several times. Since 2016, he has published more than 10 referred papers in international conferences or journals, such as AAAI, Information Sciences, DASFAA, ICME.

    Jiabao Zhao is currently working toward the PhD degree at the Department of Computer Science and Technology, East China Normal University, China. Her research interest includes few-shot learning, meta-learning, and neural networks. Since 2017, she has published several referred papers in international conferences or journals, such as ICME.

    Jimmy Xiangji Huang received the Ph.D degree in information science from the City, University of London and was then a post doctoral fellow in the School of Computer Science, University of Waterloo, Canada. He is now a York Research Chair professor and the director of Information Retrieval & Knowledge Management Research Lab (IRLab), York University. He joined York University as an assistant professor, in 2003. He was awarded tenure and promoted to full professor in 2006 and 2011 respectively. He received the Deans Award for Outstanding Research in 2006, an Early Researcher Award, formerly the Premiers Research Excellence Award in 2007, the Petro Canada Young Innovators Award in 2008, the SHARCNET Research Fellowship Award in 2009, the Best Paper Award at the 32nd European Conference on Information Retrieval (ECIR 2010), United Kingdom, and LA&PS Award for Distinction in Research, Creativity, and Scholarship (established researcher) in 2015. Since 2003, he has published more than 230 refereed papers in journals (such as the ACM Transactions on Information Systems, the Journal of American Society for Information Science and Technology, Information Processing & Management, the IEEE Transactions on Knowledge and Data Engineering, Information Sciences, Information Retrieval, BMC Bioinformatics, BMC Genomics and BMC Medical Genomics), book chapters, and international conference proceedings (such as ACM SIGIR, ACM CIKM, KDD, ACL, COLING, IEEE ICDM, IJCAI and AAAI). He is a senior member of the IEEE & ACM Distinguished Scientist.

    Qinmin Vivian Hu received her Ph.D. degree in Computer Science from York University, Toronto, Canada. She is now an Assistant Professor in the Department of Computer Science, Ryerson University, Toronto, Canada. Before that, she was an Associate Professor at East China Normal University, Shanghai, China, and a Postdoctoral Fellow at the MRI research facility, the Wayne State University, USA. She won the National NSERC Postdoctoral Fellowship as one of best Ph.D fellows in Canada in 2013. She has published more than 30 referred papers in top tier journals (i.e. IEEE Transactions on Knowledge and Data Engineering, ACM Transactions on Intelligent Systems and Technology) and conferences (i.e. AAAI, ACM SIGIR).

    Liang He received his bachelor’s degree and PhD degree from the Department of Computer Science and Technology, East China Normal University, China. He is now a professor and the director of the Department of Computer Science and Technology, East China Normal University. His current research interest includes knowledge processing, user behavior analysis, and context-aware computing. He has been awarded the Star of the Talent in Shanghai. He is also a council member of the Shanghai Computer Society, a member of the Academic Committee, the director of the technical committee of Shanghai Engineering Research Center of Intelligent Service Robot, and a technology foresight expert of the Shanghai Science and Technology in focus areas. He has received the Shanghai Science and Technology Progress Award for 5 times, and won the first prize in 2013 and the second prize in 2015. He has obtained more than 10 patents, and published 2 monographs and more than 70 refereed papers in national and international journals and conference proceedings.

    View full text