Skip to main content

Multi-granularity Feature Attention Fusion Network for Image-Text Sentiment Analysis

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13443))

Abstract

Multi-modal sentiment analysis of images and texts in social media has surpassed traditional text-based analysis and attracted more and more attention from researchers. Existing studies on multi-modal sentiment analysis of texts and images focus on learning each modal feature independently, which ignores the correlation between images and texts. In the field of social media, such correlation is often multi-granularity, that is, image areas are often associated with text (words, phrases, sentences) with multiple granularity. In this paper, a multi-granularity feature attention fusion network is proposed to model the correlation between image content and multi-granularity text content for multi-modal sentiment analysis. Specifically, the model proposed in this paper includes feature learning layer, interactive information fusion layer and classification layer. Image features and text features of multi-granularity can be learned in feature learning layer. In the interactive information fusion layer, multi-granularity text features and image features are interacted and fused, and the last classification layer uses the features learned last time to complete classification. The proposed model is validated on two public multimodal data sets of graphs and texts, and the experimental results show that the model is effective.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Pan, Y., Desheng, W.: Personalized Online-toOffline (O2O) service recommendation based on a novel frequent service-set network. IEEE Syst. J. 13(2), 1599–1607 (2019)

    Google Scholar 

  2. Xu, S., et al.: Venue2Vec: an efficient embedding model for fine-grained user location prediction in geo-social networks. IEEE Syst. J. 14(2), 1740–1751 (2019)

    Google Scholar 

  3. Yadav, S., et al.: Medical sentiment analysis using social media: towards building a patient assisted system. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (2018)

    Google Scholar 

  4. Dashtipour, K., et al.: Multilingual sentiment analysis: state of the art and independent comparison of techniques. Cogn. Comput. 8(4), 757–771 (2016)

    Article  Google Scholar 

  5. Preoiuc-Pietro, D., et al.: Beyond binary labels: political ideology prediction of Twitter users. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (2017)

    Google Scholar 

  6. Nasukawa, T., Yi, J.: Sentiment analysis: capturing favorability using natural language processing. In: International Conference on Knowledge Capture DBLP (2003)

    Google Scholar 

  7. Feng, X.A., Rui, X.B.: E-commerce product review sentiment classification based on a nave Bayes continuous learning framework. Inf. Process. Manag. 57, 5 (2020)

    Google Scholar 

  8. Dragoni, M., Petrucci, G.: A neural word embeddings approach for multi-domain sentiment analysis. IEEE Trans. Affect. Comput. 1 (2017)

    Google Scholar 

  9. Siersdorfer, S., et al.: Analyzing and predicting sentiment of images on the social web. In: ACM Multimedia 2010. ACM (2010)

    Google Scholar 

  10. Borth, D., et al.: Large-scale visual sentiment ontology and detectors using adjective noun pairs. In: ACM International Conference on Multimedia ACM (2013)

    Google Scholar 

  11. Misra, A.: Image sentiment analysis using deep learning. In: 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI) IEEE (2018)

    Google Scholar 

  12. Poria, S., et al.: Convolutional MKL based multimodal emotion recognition and sentiment analysis. In: 2016 IEEE 16th International Conference on Data Mining (ICDM). IEEE (2017)

    Google Scholar 

  13. Nan, X., Mao, W., Chen, G.: A co-memory network for multimodal sentiment analysis. In: The 41st International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM (2018)

    Google Scholar 

  14. Cao, D., et al.: A cross-media public sentiment analysis system for microblog. Multimedia Syst. 22(4), 479–486 (2016)

    Article  Google Scholar 

  15. Kumar, A., et al.: Hybrid context enriched deep learning model for fine-grained sentiment analysis in textual and visual semiotic modality social data. Inf. Process. Manag. 57(1), 102141.1–102141.25 (2020)

    Google Scholar 

  16. Xu, N., Mao, W., Chen, G.: Multi-interactive memory network for aspect based multimodal sentiment analysis. In: The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019) (2019)

    Google Scholar 

  17. Zhao, Z., et al.: An image-text consistency driven multimodal sentiment analysis approach for social media. Inf. Process. Manag. 56(6) (2019)

    Google Scholar 

  18. Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Conference on Empirical Methods in Natural Language Processing (2014)

    Google Scholar 

  19. He, K., et al.: Deep Residual Learning for Image Recognition. IEEE (2016)

    Google Scholar 

  20. Teng, N., et al.: Sentiment analysis on multi-view social data. In: International Conference on Multimedia Modeling Springer International Publishing (2016)

    Google Scholar 

  21. Cai, G., Xia, B.: Convolutional Neural Networks for Multimedia Sentiment Analysis. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2015 2015. Lecture Notes in Computer Science, vol 9362. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25207-0_14

  22. Yu, Y., et al.: Visual and textual sentiment analysis of a microblog using deep convolutional neural networks. Algorithms 9(2), 41(2016)

    Google Scholar 

  23. Xu, N.: Analyzing multimodal public sentiment based on hierarchical semantic attentional network. In: IEEE International Conference on Intelligence Security Informatics. IEEE, 152–154 (2017)

    Google Scholar 

  24. Nan, X., Mao, W.: MultiSentiNet: a deep semantic network for multimodal sentiment analysis. In: The 26th ACM International Conference on Information and Knowledge Management (CIKM) ACM (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shuang Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sun, T., Wang, S., Zhong, S. (2022). Multi-granularity Feature Attention Fusion Network for Image-Text Sentiment Analysis. In: Magnenat-Thalmann, N., et al. Advances in Computer Graphics. CGI 2022. Lecture Notes in Computer Science, vol 13443. Springer, Cham. https://doi.org/10.1007/978-3-031-23473-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-23473-6_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-23472-9

  • Online ISBN: 978-3-031-23473-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics