Abstract
Social platforms are vital for information dissemination but also contribute to the spread of fake news, causing confusion and misinformation. To combat this, advancements in detection technology are crucial, particularly for posts that combine text and images, as they often present misleading information. However, current research often overlooks the extraction of key features from both modalities, missing critical elements like writing styles and image manipulations, hampering detection accuracy. In response, this work introduces the MCAN (Multimodal Cross-Aware Network), which freezes the parameters of BERT and ResNet50 to extract semantic features from text and images. It includes a text vocabulary network to analyze writing style differences and employs error level analysis to detect image manipulations. By integrating these features through a flexible multimodal fusion subnetwork with Bimodal Cross-Attention Blocks, MCAN effectively identifies fake news. Experimental results on two popular datasets demonstrate that MCAN outperforms existing baseline models in predictive accuracy.













Similar content being viewed by others
Data availability
The program can be found on GitHub: https://github.com/yaozengzhang/MCAN.git.
References
Allcott H, Gentzkow M (2017) Social media and fake news in the 2016 election. J Econ Perspect 31(2):211–236
Meel P, Vishwakarma DK (2020) Fake news, rumor, information pollution in social media and web: a contemporary survey of state-of-the-arts, challenges and opportunities. Expert Syst Appl 153:112986
Jin Z, Cao J, Zhang Y et al (2016) Novel visual and statistical image features for microblogs news verification. IEEE Trans Multimedia 19(3):598–608
Sadeghi F, Bidgoly AJ, Amirkhani H (2022) Fake news detection on social media using a natural language inference approach. Multimedia Tools Appl 81(23):33801–33821
Mallik A, Kumar S (2024) Word2Vec and LSTM based deep learning technique for context-free fake news detection. Multimedia Tools Appl 83(1):919–940
Ma J, Gao W, Mitra P, et al. (2016) Detecting rumors from microblogs with recurrent neural networks. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI 2016). 3818–3824.
Huang Z, Xiao X, Cao X (Eds.) (2017) Databases Theory and Applications: 28th Australasian Database Conference, ADC 2017; Brisbane, QLD, Australia, September 25–28, 2017; Proceedings, Springer
Cui L, Wang S, Lee D (2019) SAME: sentiment-aware multi-modal embedding for detecting fake news. In: Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. New York, NY, USA: Association for Computing Machinery, pp. 41–48.
Zeng J, Zhang Y, Ma X (2021) Fake news detection for epidemic emergencies via deep correlations between text and images. Sustain Cities Soc 66:102652
Giachanou A, Zhang G, Rosso P (2020) Multimodal multi-image fake news detection. In: 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA). pp 647–654
Qian S, Wang J, Hu J, et al. (2021) Hierarchical multi-modal contextual attention network for fake news detection. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NY, USA: Association for Computing Machinery, pp 153–162
Ma J, Gao W, Wong K-F (2019) Detect rumors on twitter by promoting information campaigns with generative adversarial learning. In: The World Wide Web Conference. New York, NY, USA: Association for Computing Machinery, pp 3049–3055
Xue J, Wang Y, Tian Y et al (2021) Detecting fake news by exploring the consistency of multimodal data. Inf Process Manage 58(5):102610
Alkhodair SA, Ding SHH, Fung BCM et al (2020) Detecting breaking news rumors of emerging topics in social media. Inf Process Manage 57(2):102018
Zhang X, Cao J, Li X, et al. (2021) Mining dual emotion for fake news detection. In: Proceedings of the Web Conference 2021. New York, NY, USA: Association for Computing Machinery, pp 3465–3476
Yang Z, Wang C, Zhang F, et al. (2015) Emerging rumor identification for social media with hot topic detection. In: 2015 12th Web Information System and Application Conference (WISA). pp 53–58
Rubin VL, Chen Y, Conroy NK (2015) Deception detection for news: three types of fakes. Proc Associat Inf Sci Technol 52(1):1–4
Ruchansky N, Seo S, Liu Y (2021) CSI: a hybrid deep model for fake news detection. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. New York, NY, USA: Association for Computing Machinery, pp 797–806
Zhang C, Gupta A, Kauten C et al (2019) Detecting fake news for reducing misinformation risks using analytics approaches. Eur J Oper Res 279(3):1036–1052
Chen T, Li X, Yin H et al (2018) Call attention to rumors: deep attention based recurrent neural networks for early rumor detection. In: Ganji M, Rashidi L, Fung BCM et al (eds) Trends and applications in knowledge discovery and data mining. Springer International Publishing, Cham, pp 40–52
Yu F, Liu Q, Wu S, et al. (2017) A Convolutional Approach for Misinformation Identification, Twenty-Sixth International Joint Conference on Artificial Intelligence. https://doi.org/10.24963/ijcai.2017/545.
Kausar N, AliKhan A, Sattar M (2022) Towards better representation learning using hybrid deep learning model for fake news detection. Soc Netw Anal Min 12(1):165
Taher Y, Moussaoui A, Moussaoui F (2022) Automatic fake news detection based on deep learning, FasTtext and news title. Int J Adv Comput Sci Appl, 13(1)
Wang Y, Wang L, Yang Y et al (2022) Detecting fake news by enhanced text representation with multi-EDU-structure awareness. Expert Syst Appl 206:117781
Qi P, Cao J, Yang T, et al. (2019) Exploiting multi-domain visual information for fake news detection. In: 2019 IEEE International Conference on Data Mining (ICDM). pp 518–527
Singhal S, Shah RR, Chakraborty T, et al. (2019) SpotFake: a multi-modal framework for fake news detection. In: 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM). pp 39–47
Singh VK, Ghosh I, Sonagara D (2021) Detecting fake news stories via multimodal analysis. J Am Soc Inf Sci 72(1):3–17
Wang Y, Ma F, Jin Z, et al. (2018) EANN: event adversarial neural networks for multi-modal fake news detection. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York, NY, USA: Association for Computing Machinery, pp 849–857
Li S, Yao T, Li S et al (2022) Semantic-enhanced multimodal fusion network for fake news detection. Int J Intell Syst 37(12):12235–12251
Guo Y (2023) A mutual attention based multimodal fusion for fake news detection on social network. Appl Intell 53(12):15311–15320
Chen J, Wu Z, Yang Z et al (2022) Multimodal fusion network with contrary latent topic memory for rumor detection. IEEE Multimedia 29(1):104–113
Jin Z, Cao J, Guo H, et al. (2017) Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In: Proceedings of the 25th ACM International Conference on Multimedia. New York, NY, USA: Association for Computing Machinery, pp 795–816
Zhou X, Wu J, Zafarani R (2020) SAFE: Similarity-Aware Multi-modal Fake News Detection. In: Lauw HW, Wong RC-W, Ntoulas A, et al. (eds.) Advances in Knowledge Discovery and Data Mining. Cham: Springer International Publishing, pp 354–367
Song C, Ning N, Zhang Y et al (2021) A multimodal fake news detection model based on crossmodal attention residual and multichannel convolutional neural networks. Inf Process Manage 58(1):102437
Yadav A, Gaba S, Khan H, et al. (2023) Etma: Efficient transformer-based multilevel attention framework for multimodal fake news detection. IEEE Trans Comput Soc Syst
Devlin J, Chang M W, Lee K, et al. (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805
He K, Zhang X, Ren S, et al. (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 770–778
Chen X et al (2023) Identifying Cantonese rumors with discriminative feature integration in online social networks. Expert Syst Appl 215:119347
Comito C, Caroprese L, Zumpano E (2023) Multimodal fake news detection on social media: a survey of deep learning techniques. Soc Netw Anal Min 13(1):101
Zhang K, Cao J, Pi D (2024) A novel fine-grained rumor detection algorithm with attention mechanism. Neurocomputing 583:127595
Yao L, Mao C, Luo Y (2019) Graph convolutional networks for text classification. Proc AAAI Conf AI 33(01):7370–7377
Halliday MAK (2019) Linguistic function and literary style: an inquiry into the language of William Golding's' The Inheritors, Essays in modern stylistics. Routledge, pp 325–360
Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907
Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: Xing EP, Jebara T (eds.) Proceedings of the 31st International Conference on Machine Learning. Bejing, China: PMLR, pp 1188–1196
Vaswani A, Shazeer N, Parmar N, et al. (2017) Attention is all you need. Adv Neural Inf Process Syst, 30
Rehman MZU, Nawaz R, Ullah MS et al (2025) A context-aware attention and graph neural network-based multimodal framework for misogyny detection. Inf Process Manage 62(1):103895
Chen C, Han D, Chang CC (2022) CAAN: context-aware attention network for visual question answering. Pattern Recogn 132:108980
Qu L, Liu M, Cao D, Nie L, Tian Q (2020) Context-aware multi-view summarization network for image-text matching. In: Proceedings of the 28th ACM International Conference on Multimedia. New York, NY, USA: Association for Computing Machinery, pp 1047–1055
Loshchilov I, Hutter F (2017) Fixing weight decay regularization in Adam. arXiv preprint, arXiv:1711.05101.
Boididou C, Papadopoulos S, Zampoglou M, Apostolidis L, Papadopoulou O, Kompatsiaris Y (2018) Detection and visualization of misleading content on Twitter. Int J Multimedia Inf Retr 77(1):1–86
Jin Z, Cao J, Guo H, Zhang Y, Luo J (2017) Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In: Proceedings of the 25th ACM International Conference on Multimedia, ACM, pp 795–816
Khattar D, Goud JS, Gupta M, et al. MVAE: Multimodal Variational Autoencoder for Fake News Detection. In: The World Wide Web Conference. New York, NY, USA: Association for Computing Machinery, pp. 2915–2921.
Antol S, Agrawal A, Lu J, et al. (2015) Vqa: Visual question answering. In: Proceedings of the IEEE International Conference on Computer Vision. pp 2425–2433
Vinyals O, Toshev A, Bengio S, et al. (2015) Show and tell: a neural image caption generator. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 3156–3164
Chen Y, Li D, Zhang P, et al. (2022) Cross-modal ambiguity learning for multimodal fake news detection. In: Proceedings of the ACM Web Conference 2022. New York, NY, USA: Association for Computing Machinery, pp 2897–2905
Yang H, Zhang J, Zhang L et al (2024) MRAN: multimodal relationship-aware attention network for fake news detection. Comput Stand Interfaces 89:103822
Funding
This work was supported by the National Natural Science Foundation of China (72174086).
Author information
Authors and Affiliations
Contributions
Yaozeng Zhang contributed to conceptualization, methodology, original draft preparation, and software. Jing Ma contributed to supervision. Yuguang Jia contributed to validation and visualization.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Consent to publish
The images from social networking platforms presented in the text are all sourced from publicly available datasets and have been widely cited, thus allowing their publication.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, Y., Ma, J. & Jia, Y. MCAN: multimodal cross-aware network for fake news detection by extracting semantic-physical feature consistency. J Supercomput 81, 299 (2025). https://doi.org/10.1007/s11227-024-06815-1
Accepted:
Published:
DOI: https://doi.org/10.1007/s11227-024-06815-1