skip to main content
10.1145/3123266.3123454acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on Microblogs

Published: 19 October 2017 Publication History

Abstract

Microblogs have become popular media for news propagation in recent years. Meanwhile, numerous rumors and fake news also bloom and spread wildly on the open social media platforms. Without verification, they could seriously jeopardize the credibility of microblogs. We observe that an increasing number of users are using images and videos to post news in addition to texts. Tweets or microblogs are commonly composed of text, image and social context. In this paper, we propose a novel Recurrent Neural Network with an attention mechanism (att-RNN) to fuse multimodal features for effective rumor detection. In this end-to-end network, image features are incorporated into the joint features of text and social context, which are obtained with an LSTM (Long-Short Term Memory) network, to produce a reliable fused classification. The neural attention from the outputs of the LSTM is utilized when fusing with the visual features. Extensive experiments are conducted on two multimedia rumor datasets collected from Weibo and Twitter. The results demonstrate the effectiveness of the proposed end-to-end att-RNN in detecting rumors with multimodal contents.

References

[1]
Gordon W Allport and Leo Postman. 1947. The psychology of rumor. (1947).
[2]
Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C Lawrence Zitnick, and Devi Parikh. 2015. Vqa: Visual question answering. In Proceedings of the IEEE International Conference on Computer Vision. 2425--2433.
[3]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).
[4]
Yoshua Bengio, Patrice Simard, and Paolo Frasconi. 1994. Learning long-term dependencies with gradient descent is difficult. IEEE transactions on neural networks Vol. 5, 2 (1994), 157--166.
[5]
Christina Boididou, Katerina Andreadou, Symeon Papadopoulos, Duc-Tien Dang-Nguyen, Giulia Boato, Michael Riegler, and Yiannis Kompatsiaris. 2015. Verifying Multimedia Use at MediaEval 2015. In MediaEval 2015 Workshop, Sept. 14--15, 2015, Wurzen, Germany.
[6]
Christina Boididou, Symeon Papadopoulos, and Yiannis Kompatsiaris. 2014. Challenges of computational verification in social multimedia Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion. 743--748.
[7]
Carlos Castillo, Marcelo Mendoza, and Barbara Poblete. 2011. Information credibility on twitter. In Proceedings of the 20th international conference on World Wide Web (WWW). ACM, 675--684.
[8]
Felix A Gers, Nicol N Schraudolph, and Jürgen Schmidhuber. 2002. Learning precise timing with LSTM recurrent networks. Journal of machine learning research Vol. 3, Aug (2002), 115--143.
[9]
Aditi Gupta, Hemank Lamba, Ponnurangam Kumaraguru, and Anupam Joshi. 2013. Faking Sandy: characterizing and identifying fake images on Twitter during Hurricane Sandy Proceedings of the 22nd international conference on World Wide Web companion. International World Wide Web Conferences Steering Committee, 729--736.
[10]
Manish Gupta, Peixiang Zhao, and Jiawei Han. 2012. Evaluating Event Credibility on Twitter. In Proceedings of the SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, 153.
[11]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation, Vol. 9, 8 (1997), 1735--1780.
[12]
Zhiwei Jin, Juan Cao, Han Guo, Yongdong Zhang, Yu Wang, and Jiebo Luo. 2017. Detection and Analysis of 2016 US Presidential Election Related Rumors on Twitter Social, Cultural, and Behavioral Modeling - 10th International Conference, SBP-BRiMS 2017, Washington, DC, USA, July 5-8, 2017, Proceedings. 14--24.
[13]
Zhiwei Jin, Juan Cao, Yu-Gang Jiang, and Yongdong Zhang. 2014. News credibility evaluation on microblog with a hierarchical propagation model 2014 IEEE International Conference on Data Mining (ICDM). IEEE, 230--239.
[14]
Zhiwei Jin, Juan Cao, Yongdong Zhang, and Jiebo Luo. 2016. News Verification by Exploiting Conflicting Social Viewpoints in Microblogs Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 12-17, 2016, Phoenix, Arizona, USA.
[15]
Zhiwei Jin, Juan Cao, Yazi Zhang, and Zhang Yongdong. 2015. MCG-ICT at MediaEval 2015: Verifying Multimedia Use with a Two-Level Classification Model Proceedings of the MediaEval 2015 Multimedia Benchmark Workshop.
[16]
Zhiwei Jin, Juan Cao, Yongdong Zhang, Jianshe Zhou, and Qi Tian. 2017. Novel Visual and Statistical Image Features for Microblogs News Verification. IEEE Trans. Multimedia Vol. 19, 3 (2017), 598--608.
[17]
Andrej Karpathy and Li Fei-Fei. 2015. Deep visual-semantic alignments for generating image descriptions Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3128--3137.
[18]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks Advances in neural information processing systems. 1097--1105.
[19]
Sejeong Kwon, Meeyoung Cha, Kyomin Jung, Wei Chen, and Yajun Wang. 2013. Prominent features of rumor propagation in online social media Data Mining (ICDM), 2013 IEEE 13th International Conference on. IEEE, 1103--1108.
[20]
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, and Xiaogang Wang. 2017. Person Search with Natural Language Description. arXiv preprint arXiv:1702.05729 (2017).
[21]
Jing Ma, Wei Gao, Prasenjit Mitra, Sejeong Kwon, Bernard J Jansen, Kam-Fai Wong, and Meeyoung Cha. 2016. Detecting rumors from microblogs with recurrent neural networks Proceedings of IJCAI.
[22]
Tomas Mikolov, Martin Karafiát, Lukas Burget, Jan Cernockỳ, and Sanjeev Khudanpur. 2010. Recurrent neural network based language model. In Interspeech, Vol. Vol. 2. 3.
[23]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality Advances in neural information processing systems. 3111--3119.
[24]
Meredith Ringel Morris, Scott Counts, Asta Roseway, Aaron Hoff, and Julia Schwarz. 2012. Tweeting is believing?: understanding microblog credibility perceptions Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work. ACM, 441--450.
[25]
Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio. 2013. On the difficulty of training recurrent neural networks. ICML (3) Vol. 28 (2013), 1310--1318.
[26]
Pierre Sermanet, David Eigen, Xiang Zhang, Michaël Mathieu, Rob Fergus, and Yann LeCun. 2013. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229 (2013).
[27]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
[28]
M. Slaney and M. Casey. 2008. Locality-Sensitive Hashing for Finding Nearest Neighbors. IEEE Signal Processing Magazine Vol. 25, 2 (March. 2008), 128--131.
[29]
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to sequence learning with neural networks Advances in neural information processing systems. 3104--3112.
[30]
Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2015. Show and tell: A neural image caption generator. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3156--3164.
[31]
Ke Wu, Song Yang, and Kenny Q Zhu. 2015. False rumors detection on sina weibo by propagation structures IEEE International Conference on Data Engineering, ICDE.
[32]
Quanzeng You, Liangliang Cao, Hailin Jin, and Jiebo Luo. 2016. Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks. In Proceedings of the 2016 ACM on Multimedia Conference. ACM, 1008--1017.

Cited By

View all
  • (2025)An Enhanced Misinformation Detection Model Based on an Improved Beluga Whale Optimization Algorithm and Cross-Modal Feature FusionBiomimetics10.3390/biomimetics1003012810:3(128)Online publication date: 20-Feb-2025
  • (2025)Veracity‐Oriented Context‐Aware Large Language Models–Based Prompting Optimization for Fake News DetectionInternational Journal of Intelligent Systems10.1155/int/59201422025:1Online publication date: 15-Jan-2025
  • (2025)MHR: A Multi-Modal Hyperbolic Representation Framework for Fake News DetectionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2025.352895137:4(2015-2028)Online publication date: Apr-2025
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '17: Proceedings of the 25th ACM international conference on Multimedia
October 2017
2028 pages
ISBN:9781450349062
DOI:10.1145/3123266
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. attention mechanism
  2. lstm
  3. microblog
  4. multimodal fusion
  5. rumor detection

Qualifiers

  • Research-article

Funding Sources

  • National Key Research and Development Program of China
  • Beijing Advanced Innovation Center for Imaging Technology
  • National Natural Science Foundation of China

Conference

MM '17
Sponsor:
MM '17: ACM Multimedia Conference
October 23 - 27, 2017
California, Mountain View, USA

Acceptance Rates

MM '17 Paper Acceptance Rate 189 of 684 submissions, 28%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)949
  • Downloads (Last 6 weeks)87
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)An Enhanced Misinformation Detection Model Based on an Improved Beluga Whale Optimization Algorithm and Cross-Modal Feature FusionBiomimetics10.3390/biomimetics1003012810:3(128)Online publication date: 20-Feb-2025
  • (2025)Veracity‐Oriented Context‐Aware Large Language Models–Based Prompting Optimization for Fake News DetectionInternational Journal of Intelligent Systems10.1155/int/59201422025:1Online publication date: 15-Jan-2025
  • (2025)MHR: A Multi-Modal Hyperbolic Representation Framework for Fake News DetectionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2025.352895137:4(2015-2028)Online publication date: Apr-2025
  • (2025)Early Detection of Multimodal Fake News via Reinforced Propagation Path GenerationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.349670137:2(613-625)Online publication date: Feb-2025
  • (2025)Under the Influence: A Survey of Large Language Models in Fake News DetectionIEEE Transactions on Artificial Intelligence10.1109/TAI.2024.34717356:2(458-476)Online publication date: Feb-2025
  • (2025)SmoothDectector: A Smoothed Dirichlet Multimodal Approach for Combating Fake News on Social MediaIEEE Access10.1109/ACCESS.2025.354687613(39289-39305)Online publication date: 2025
  • (2025)A Comprehensive Survey of Fake Text Detection on Misinformation and LM-Generated TextsIEEE Access10.1109/ACCESS.2025.353880513(25301-25324)Online publication date: 2025
  • (2025)Balanced Multi-modal Learning with Hierarchical Fusion for Fake News DetectionPattern Recognition10.1016/j.patcog.2025.111485164(111485)Online publication date: Aug-2025
  • (2025)A unified multimodal classification framework based on deep metric learningNeural Networks10.1016/j.neunet.2024.106747181(106747)Online publication date: Jan-2025
  • (2025)DCCMA-Net: Disentanglement-based cross-modal clues mining and aggregation network for explainable multimodal fake news detectionInformation Processing & Management10.1016/j.ipm.2025.10408962:4(104089)Online publication date: Jul-2025
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media