skip to main content
research-article

Image–Text Multimodal Sentiment Analysis Framework of Assamese News Articles Using Late Fusion

Authors Info & Claims
Published:17 June 2023Publication History
Skip Abstract Section

Abstract

Before the arrival of the web as a corpus, people detected positive and negative news based on the understanding of the textual content from physical newspaper rather than an automatic identification approach from readily available e-newspapers. Thus, the earlier sentiment analysis approach is based on unimodal data, and less effort is paid to the multimodal data. However, the presence of multimodal information helps us to get a clearer understanding of the sentiment. To the best of our knowledge, less work has been introduced on the image–text multimodal sentiment analysis framework of Assamese, a low-resource Indian language mostly spoken in the northeast part of India. We built an Assamese news articles dataset consisting of news text and associated images and one image caption to conduct an experimental study. Focusing on important words and discriminative regions of the images mostly related to sentiment, two individual unimodal such as textual and visual models are proposed. The visual model is developed using an encoder-decoder–based image caption generation system. An image–text multimodal approach is proposed to explore the internal correlation between textual and visual features for joint sentiment classification. Finally, we propose the multimodal sentiment analysis framework, i.e., Textual Visual Multimodal Fusion, by employing a late fusion scheme to merge the three different modalities for the final sentiment prediction. Experimental results conducted on the Assamese dataset built in-house demonstrate that the contextual integration of multimodal features delivers better performance than unimodal features.

REFERENCES

  1. [1] Al-Kabi Mohammed, Al-Qudah Noor M., Alsmadi Izzat, Dabour Muhammad, and Wahsheh Heider. 2013. Arabic/English sentiment analysis: An empirical study. In Proceedings of the 4th International Conference on Information and Communication Systems (ICICS’13). 2325.Google ScholarGoogle Scholar
  2. [2] Borth Damian, Ji Rongrong, Chen Tao, Breuel Thomas, and Chang Shih-Fu. 2013. Large-scale visual sentiment ontology and detectors using adjective noun pairs. In Proceedings of the 21st ACM International Conference on Multimedia. 223232.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. [3] Campos Victor, Jou Brendan, and Nieto Xavier Giro-i. 2017. From pixels to sentiment: Fine-tuning CNNs for visual sentiment prediction. Image Vis. Comput. 65 (2017), 1522.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] Cao Donglin, Ji Rongrong, Lin Dazhen, and Li Shaozi. 2016. Visual sentiment topic model based microblog image sentiment analysis. Multimedia Tools Appl. 75, 15 (2016), 89558968.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Chen Xingyue, Wang Yunhong, and Liu Qingjie. 2017. Visual and textual sentiment analysis using deep fusion convolutional neural networks. In Proceedings of the IEEE International Conference on Image Processing (ICIP’17). IEEE, 15571561.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. [6] Das Amitava and Bandyopadhyay Sivaji. 2010. Opinion-polarity identification in bengali. In Proceedings of the International Conference on Computer Processing of Oriental Languages. 169182.Google ScholarGoogle Scholar
  7. [7] Das Ringki and Singh Thoudam Doren. 2021. Image caption generation framework for assamese news using attention mechanism. In Proceedings of the 18th International Conference on Natural Language Processing (ICON’21). 231239.Google ScholarGoogle Scholar
  8. [8] Das Ringki and Singh Thoudam Doren. 2021. A step towards sentiment analysis of assamese news articles using lexical features. In Proceedings of the International Conference on Computing and Communication Systems (I3CS’20), Vol. 170. Springer, 15.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Das Ringki and Singh Thoudam Doren. 2022. Assamese news image caption generation using attention mechanism. Multimedia Tools Appl. 81, 7 (2022), 1005110069.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Das Ringki and Singh Thoudam Doren. 2022. A multi-stage multimodal framework for sentiment analysis of Assamese in low resource setting. Expert Syst. Appl. (2022), 117575.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Dhaoui Chedia, Webster Cynthia M., and Tan Lay Peng. 2017. Social media sentiment analysis: Lexicon versus machine learning. J. Cons. Market. (2017).Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Hu Minqing and Liu Bing. 2004. Mining opinion features in customer reviews. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI’04), Vol. 4. 755760.Google ScholarGoogle Scholar
  13. [13] Huang Feiran, Zhang Xiaoming, Zhao Zhonghua, Xu Jie, and Li Zhoujun. 2019. Image–text sentiment analysis via deep multimodal attentive fusion. Knowl.-Bas. Syst. 167 (2019), 2637.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Kaur Jasleen and Saini Jatinderkumar R.. 2014. A study and analysis of opinion mining research in Indo-Aryan, Dravidian and Tibeto-Burman language families. Int. J. Data Min. Emerg. Technol. 4, 2 (2014), 5360.Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Kim Soo-Min and Hovy Eduard. 2004. Determining the sentiment of opinions. In Proceedings of the 20th International Conference on Computational Linguistics. Association for Computational Linguistics, 1367.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. [16] Le Tuan Anh, Moeljadi David, Miura Yasuhide, and Ohkuma Tomoko. 2016. Sentiment analysis for low resource languages: A study on informal Indonesian tweets. In Proceedings of the 12th Workshop on Asian Language Resources (ALR12’16). 123131.Google ScholarGoogle Scholar
  17. [17] Meetei Loitongbam Sanayai, Singh Thoudam Doren, Borgohain Samir Kumar, and Bandyopadhyay Sivaji. 2021. Low resource language specific pre-processing and features for sentiment analysis task. Lang. Resourc. Eval. (2021), 123.Google ScholarGoogle Scholar
  18. [18] Mehmood Khawar, Essam Daryl, Shafi Kamran, and Malik Muhammad Kamran. 2019. Sentiment analysis for a resource poor language–Roman Urdu. ACM Trans. Asian Low-Resour. Lang. Inf. Proc. 19, 1 (2019), 115.Google ScholarGoogle Scholar
  19. [19] Neethu M. S. and Rajasree R.. 2013. Sentiment analysis in twitter using machine learning techniques. In Proceedings of the 4th International Conference on Computing, Communications and Networking Technologies (ICCCNT’13). IEEE, 15.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Ortis Alessandro, Farinella Giovanni Maria, Torrisi Giovanni, and Battiato Sebastiano. 2020. Exploiting objective text description of images for visual sentiment analysis. Multimedia Tools Appl. (2020), 124.Google ScholarGoogle Scholar
  21. [21] Pang Bo and Lee Lillian. 2004. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 271.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Pang Bo and Lee Lillian. 2005. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 115124.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Pang Bo, Lee Lillian, and Vaithyanathan Shivakumar. 2002. Thumbs up?: Sentiment classification using machine learning techniques. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing-Volume 10. Association for Computational Linguistics, 7986.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Qian Chen, Ragusa Edoardo, Chaturvedi Iti, Cambria Erik, and Zunino Rodolfo. Text-image sentiment analysis.Google ScholarGoogle Scholar
  25. [25] Rani Sujata and Kumar Parteek. 2019. A journey of Indian languages over sentiment analysis: A systematic review. Artif. Intell. Rev. 52, 2 (2019), 14151462.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Saharia Navanath, Das Dhrubajyoti, Sharma Utpal, and Kalita Jugal. 2009. Part of speech tagger for Assamese text. In Proceedings of the ACL-IJCNLP Conference Short Papers. 3336.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Sarkar Kamal and Bhowmick Mandira. 2017. Sentiment polarity detection in bengali tweets using multinomial Naïve Bayes and support vector machines. In Proceedings of the IEEE Calcutta Conference (CALCON’17). IEEE, 3136.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Simonyan Karen and Zisserman Andrew. 2014. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556. Retrieved from https://arxiv.org/abs/1409.1556.Google ScholarGoogle Scholar
  29. [29] Singh Alok, Meetei Loitongbam Sanayai, Singh Salam Michael, Singh Thoudam Doren, and Bandyopadhyay Sivaji. 2021. An efficient keyframes selection based framework for video captioning. In Proceedings of the 18th International Conference on Natural Language Processing (ICON’21). 240250.Google ScholarGoogle Scholar
  30. [30] Singh Thoudam Doren, Singh Telem Joyson, Shadang Mirinso, and Thokchom Surmila. 2021. Review comments of manipuri online video: Good, bad or ugly. In Proceedings of the International Conference on Computing and Communication Systems (I3CS’20), Vol. 170. Springer, 45.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Song Kaikai, Yao Ting, Ling Qiang, and Mei Tao. 2018. Boosting image sentiment analysis with visual attention. Neurocomputing 312 (2018), 218228.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Vinyals Oriol, Toshev Alexander, Bengio Samy, and Erhan Dumitru. 2015. Show and tell: A neural image caption generator. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 31563164.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Wang Jingwen, Fu Jianlong, Xu Yong, and Mei Tao. 2016. Beyond object recognition: Visual sentiment analysis with deep coupled adjective and noun neural networks. In Proceedings of the Internationa Joint Conference on Artificial Intelligence (IJCAI’16). 34843490.Google ScholarGoogle Scholar
  34. [34] Wankhade Mayur, Rao Annavarapu Chandra Sekhara, and Kulkarni Chaitanya. 2022. A survey on sentiment analysis methods, applications, and challenges. Artif. Intell. Rev. (2022), 150.Google ScholarGoogle Scholar
  35. [35] Xu C., Cetintas S., Lee K. C., and Li L. J.. [n.d.]. Visual sentiment prediction with deep convolutional neural networks. arXiv:1411.5731. Retrieved from https://arixv.org/abs/1411.5731.Google ScholarGoogle Scholar
  36. [36] Yao Xingxu, She Dongyu, Zhang Haiwei, Yang Jufeng, Cheng Ming-Ming, and Wang Liang. 2020. Adaptive deep metric learning for affective image retrieval and classification. IEEE Trans. Multimedia (2020).Google ScholarGoogle Scholar
  37. [37] You Quanzeng, Cao Liangliang, Jin Hailin, and Luo Jiebo. 2016. Robust visual-textual sentiment analysis: When attention meets tree-structured recursive neural networks. In Proceedings of the 24th ACM International Conference on Multimedia. 10081017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. [38] You Quanzeng, Jin Hailin, and Luo Jiebo. 2017. Visual sentiment analysis by attending on local image regions. In Proceedings of the 31st AAAI Conference on Artificial Intelligence.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] You Quanzeng, Luo Jiebo, Jin Hailin, and Yang Jianchao. 2015. Joint visual-textual sentiment analysis with deep neural networks. In Proceedings of the 23rd ACM International Conference on Multimedia. 10711074.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Yuan Jianbo, McDonough Sean, You Quanzeng, and Luo Jiebo. 2013. Sentribute: Image sentiment analysis from a mid-level perspective. In Proceedings of the 2nd International Workshop on Issues of Sentiment Discovery and Opinion Mining. 18.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Zhang Yaowen, Shang Lin, and Jia Xiuyi. 2015. Sentiment analysis on microblogging by integrating text and image features. In Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 5263.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Zhao Ziyuan, Zhu Huiying, Xue Zehao, Liu Zhao, Tian Jing, Chua Matthew Chin Heng, and Liu Maofu. 2019. An image-text consistency driven multimodal sentiment analysis approach for social media. Inf. Process. Manage. 56, 6 (2019), 102097.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Zitouni Imed and Florian Radu. 2008. Mention detection crossing the language barrier. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 600609.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Image–Text Multimodal Sentiment Analysis Framework of Assamese News Articles Using Late Fusion

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 6
      June 2023
      635 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3604597
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 June 2023
      • Online AM: 17 February 2023
      • Accepted: 3 February 2023
      • Revised: 26 August 2022
      • Received: 1 September 2021
      Published in tallip Volume 22, Issue 6

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text