Abstract
Multi-modal sentiment analysis of images and texts in social media has surpassed traditional text-based analysis and attracted more and more attention from researchers. Existing studies on multi-modal sentiment analysis of texts and images focus on learning each modal feature independently, which ignores the correlation between images and texts. In the field of social media, such correlation is often multi-granularity, that is, image areas are often associated with text (words, phrases, sentences) with multiple granularity. In this paper, a multi-granularity feature attention fusion network is proposed to model the correlation between image content and multi-granularity text content for multi-modal sentiment analysis. Specifically, the model proposed in this paper includes feature learning layer, interactive information fusion layer and classification layer. Image features and text features of multi-granularity can be learned in feature learning layer. In the interactive information fusion layer, multi-granularity text features and image features are interacted and fused, and the last classification layer uses the features learned last time to complete classification. The proposed model is validated on two public multimodal data sets of graphs and texts, and the experimental results show that the model is effective.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Pan, Y., Desheng, W.: Personalized Online-toOffline (O2O) service recommendation based on a novel frequent service-set network. IEEE Syst. J. 13(2), 1599–1607 (2019)
Xu, S., et al.: Venue2Vec: an efficient embedding model for fine-grained user location prediction in geo-social networks. IEEE Syst. J. 14(2), 1740–1751 (2019)
Yadav, S., et al.: Medical sentiment analysis using social media: towards building a patient assisted system. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (2018)
Dashtipour, K., et al.: Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cogn. Comput. 8(4), 757–771 (2016)
Preoiuc-Pietro, D., et al.: Beyond binary labels: political ideology prediction of Twitter users. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (2017)
Nasukawa, T., Yi, J.: Sentiment analysis: capturing favorability using natural language processing. In: International Conference on Knowledge Capture DBLP (2003)
Feng, X.A., Rui, X.B.: E-commerce product review sentiment classification based on a nave Bayes continuous learning framework. Inf. Process. Manag. 57, 5 (2020)
Dragoni, M., Petrucci, G.: A neural word embeddings approach for multi-domain sentiment analysis. IEEE Trans. Affect. Comput. 1 (2017)
Siersdorfer, S., et al.: Analyzing and predicting sentiment of images on the social web. In: ACM Multimedia 2010. ACM (2010)
Borth, D., et al.: Large-scale visual sentiment ontology and detectors using adjective noun pairs. In: ACM International Conference on Multimedia ACM (2013)
Misra, A.: Image sentiment analysis using deep learning. In: 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI) IEEE (2018)
Poria, S., et al.: Convolutional MKL based multimodal emotion recognition and sentiment analysis. In: 2016 IEEE 16th International Conference on Data Mining (ICDM). IEEE (2017)
Nan, X., Mao, W., Chen, G.: A co-memory network for multimodal sentiment analysis. In: The 41st International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM (2018)
Cao, D., et al.: A cross-media public sentiment analysis system for microblog. Multimedia Syst. 22(4), 479–486 (2016)
Kumar, A., et al.: Hybrid context enriched deep learning model for fine-grained sentiment analysis in textual and visual semiotic modality social data. Inf. Process. Manag. 57(1), 102141.1–102141.25 (2020)
Xu, N., Mao, W., Chen, G.: Multi-interactive memory network for aspect based multimodal sentiment analysis. In: The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019) (2019)
Zhao, Z., et al.: An image-text consistency driven multimodal sentiment analysis approach for social media. Inf. Process. Manag. 56(6) (2019)
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Conference on Empirical Methods in Natural Language Processing (2014)
He, K., et al.: Deep Residual Learning for Image Recognition. IEEE (2016)
Teng, N., et al.: Sentiment analysis on multi-view social data. In: International Conference on Multimedia Modeling Springer International Publishing (2016)
Cai, G., Xia, B.: Convolutional Neural Networks for Multimedia Sentiment Analysis. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2015 2015. Lecture Notes in Computer Science, vol 9362. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25207-0_14
Yu, Y., et al.: Visual and textual sentiment analysis of a microblog using deep convolutional neural networks. Algorithms 9(2), 41(2016)
Xu, N.: Analyzing multimodal public sentiment based on hierarchical semantic attentional network. In: IEEE International Conference on Intelligence Security Informatics. IEEE, 152–154 (2017)
Nan, X., Mao, W.: MultiSentiNet: a deep semantic network for multimodal sentiment analysis. In: The 26th ACM International Conference on Information and Knowledge Management (CIKM) ACM (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sun, T., Wang, S., Zhong, S. (2022). Multi-granularity Feature Attention Fusion Network for Image-Text Sentiment Analysis. In: Magnenat-Thalmann, N., et al. Advances in Computer Graphics. CGI 2022. Lecture Notes in Computer Science, vol 13443. Springer, Cham. https://doi.org/10.1007/978-3-031-23473-6_1
Download citation
DOI: https://doi.org/10.1007/978-3-031-23473-6_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-23472-9
Online ISBN: 978-3-031-23473-6
eBook Packages: Computer ScienceComputer Science (R0)