MCAN: multimodal cross-aware network for fake news detection by extracting semantic-physical feature consistency

Zhang, Yaozeng; Ma, Jing; Jia, Yuguang

doi:10.1007/s11227-024-06815-1

MCAN: multimodal cross-aware network for fake news detection by extracting semantic-physical feature consistency

Published: 16 December 2024

Volume 81, article number 299, (2025)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

274 Accesses
Explore all metrics

Abstract

Social platforms are vital for information dissemination but also contribute to the spread of fake news, causing confusion and misinformation. To combat this, advancements in detection technology are crucial, particularly for posts that combine text and images, as they often present misleading information. However, current research often overlooks the extraction of key features from both modalities, missing critical elements like writing styles and image manipulations, hampering detection accuracy. In response, this work introduces the MCAN (Multimodal Cross-Aware Network), which freezes the parameters of BERT and ResNet50 to extract semantic features from text and images. It includes a text vocabulary network to analyze writing style differences and employs error level analysis to detect image manipulations. By integrating these features through a flexible multimodal fusion subnetwork with Bimodal Cross-Attention Blocks, MCAN effectively identifies fake news. Experimental results on two popular datasets demonstrate that MCAN outperforms existing baseline models in predictive accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fake News Detection Using Heterogeneous Information from Multimedia Content

An Auxiliary Modality Based Text-Image Matching Methodology for Fake News Detection

An image and text-based multimodal model for detecting fake news in OSN’s

Article 30 November 2022

Data availability

The program can be found on GitHub: https://github.com/yaozengzhang/MCAN.git.

References

Allcott H, Gentzkow M (2017) Social media and fake news in the 2016 election. J Econ Perspect 31(2):211–236
Article Google Scholar
Meel P, Vishwakarma DK (2020) Fake news, rumor, information pollution in social media and web: a contemporary survey of state-of-the-arts, challenges and opportunities. Expert Syst Appl 153:112986
Article Google Scholar
Jin Z, Cao J, Zhang Y et al (2016) Novel visual and statistical image features for microblogs news verification. IEEE Trans Multimedia 19(3):598–608
Article Google Scholar
Sadeghi F, Bidgoly AJ, Amirkhani H (2022) Fake news detection on social media using a natural language inference approach. Multimedia Tools Appl 81(23):33801–33821
Article Google Scholar
Mallik A, Kumar S (2024) Word2Vec and LSTM based deep learning technique for context-free fake news detection. Multimedia Tools Appl 83(1):919–940
Article Google Scholar
Ma J, Gao W, Mitra P, et al. (2016) Detecting rumors from microblogs with recurrent neural networks. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI 2016). 3818–3824.
Huang Z, Xiao X, Cao X (Eds.) (2017) Databases Theory and Applications: 28th Australasian Database Conference, ADC 2017; Brisbane, QLD, Australia, September 25–28, 2017; Proceedings, Springer
Cui L, Wang S, Lee D (2019) SAME: sentiment-aware multi-modal embedding for detecting fake news. In: Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. New York, NY, USA: Association for Computing Machinery, pp. 41–48.
Zeng J, Zhang Y, Ma X (2021) Fake news detection for epidemic emergencies via deep correlations between text and images. Sustain Cities Soc 66:102652
Article Google Scholar
Giachanou A, Zhang G, Rosso P (2020) Multimodal multi-image fake news detection. In: 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA). pp 647–654
Qian S, Wang J, Hu J, et al. (2021) Hierarchical multi-modal contextual attention network for fake news detection. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NY, USA: Association for Computing Machinery, pp 153–162
Ma J, Gao W, Wong K-F (2019) Detect rumors on twitter by promoting information campaigns with generative adversarial learning. In: The World Wide Web Conference. New York, NY, USA: Association for Computing Machinery, pp 3049–3055
Xue J, Wang Y, Tian Y et al (2021) Detecting fake news by exploring the consistency of multimodal data. Inf Process Manage 58(5):102610
Article Google Scholar
Alkhodair SA, Ding SHH, Fung BCM et al (2020) Detecting breaking news rumors of emerging topics in social media. Inf Process Manage 57(2):102018
Article Google Scholar
Zhang X, Cao J, Li X, et al. (2021) Mining dual emotion for fake news detection. In: Proceedings of the Web Conference 2021. New York, NY, USA: Association for Computing Machinery, pp 3465–3476
Yang Z, Wang C, Zhang F, et al. (2015) Emerging rumor identification for social media with hot topic detection. In: 2015 12th Web Information System and Application Conference (WISA). pp 53–58
Rubin VL, Chen Y, Conroy NK (2015) Deception detection for news: three types of fakes. Proc Associat Inf Sci Technol 52(1):1–4
Article Google Scholar
Ruchansky N, Seo S, Liu Y (2021) CSI: a hybrid deep model for fake news detection. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. New York, NY, USA: Association for Computing Machinery, pp 797–806
Zhang C, Gupta A, Kauten C et al (2019) Detecting fake news for reducing misinformation risks using analytics approaches. Eur J Oper Res 279(3):1036–1052
Article Google Scholar
Chen T, Li X, Yin H et al (2018) Call attention to rumors: deep attention based recurrent neural networks for early rumor detection. In: Ganji M, Rashidi L, Fung BCM et al (eds) Trends and applications in knowledge discovery and data mining. Springer International Publishing, Cham, pp 40–52
Chapter Google Scholar
Yu F, Liu Q, Wu S, et al. (2017) A Convolutional Approach for Misinformation Identification, Twenty-Sixth International Joint Conference on Artificial Intelligence. https://doi.org/10.24963/ijcai.2017/545.
Kausar N, AliKhan A, Sattar M (2022) Towards better representation learning using hybrid deep learning model for fake news detection. Soc Netw Anal Min 12(1):165
Article Google Scholar
Taher Y, Moussaoui A, Moussaoui F (2022) Automatic fake news detection based on deep learning, FasTtext and news title. Int J Adv Comput Sci Appl, 13(1)
Wang Y, Wang L, Yang Y et al (2022) Detecting fake news by enhanced text representation with multi-EDU-structure awareness. Expert Syst Appl 206:117781
Article Google Scholar
Qi P, Cao J, Yang T, et al. (2019) Exploiting multi-domain visual information for fake news detection. In: 2019 IEEE International Conference on Data Mining (ICDM). pp 518–527
Singhal S, Shah RR, Chakraborty T, et al. (2019) SpotFake: a multi-modal framework for fake news detection. In: 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM). pp 39–47
Singh VK, Ghosh I, Sonagara D (2021) Detecting fake news stories via multimodal analysis. J Am Soc Inf Sci 72(1):3–17
Google Scholar
Wang Y, Ma F, Jin Z, et al. (2018) EANN: event adversarial neural networks for multi-modal fake news detection. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York, NY, USA: Association for Computing Machinery, pp 849–857
Li S, Yao T, Li S et al (2022) Semantic-enhanced multimodal fusion network for fake news detection. Int J Intell Syst 37(12):12235–12251
Article Google Scholar
Guo Y (2023) A mutual attention based multimodal fusion for fake news detection on social network. Appl Intell 53(12):15311–15320
Article Google Scholar
Chen J, Wu Z, Yang Z et al (2022) Multimodal fusion network with contrary latent topic memory for rumor detection. IEEE Multimedia 29(1):104–113
Article Google Scholar
Jin Z, Cao J, Guo H, et al. (2017) Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In: Proceedings of the 25th ACM International Conference on Multimedia. New York, NY, USA: Association for Computing Machinery, pp 795–816
Zhou X, Wu J, Zafarani R (2020) SAFE: Similarity-Aware Multi-modal Fake News Detection. In: Lauw HW, Wong RC-W, Ntoulas A, et al. (eds.) Advances in Knowledge Discovery and Data Mining. Cham: Springer International Publishing, pp 354–367
Song C, Ning N, Zhang Y et al (2021) A multimodal fake news detection model based on crossmodal attention residual and multichannel convolutional neural networks. Inf Process Manage 58(1):102437
Article Google Scholar
Yadav A, Gaba S, Khan H, et al. (2023) Etma: Efficient transformer-based multilevel attention framework for multimodal fake news detection. IEEE Trans Comput Soc Syst
Devlin J, Chang M W, Lee K, et al. (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805
He K, Zhang X, Ren S, et al. (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 770–778
Chen X et al (2023) Identifying Cantonese rumors with discriminative feature integration in online social networks. Expert Syst Appl 215:119347
Article Google Scholar
Comito C, Caroprese L, Zumpano E (2023) Multimodal fake news detection on social media: a survey of deep learning techniques. Soc Netw Anal Min 13(1):101
Article Google Scholar
Zhang K, Cao J, Pi D (2024) A novel fine-grained rumor detection algorithm with attention mechanism. Neurocomputing 583:127595
Article Google Scholar
Yao L, Mao C, Luo Y (2019) Graph convolutional networks for text classification. Proc AAAI Conf AI 33(01):7370–7377
Google Scholar
Halliday MAK (2019) Linguistic function and literary style: an inquiry into the language of William Golding's' The Inheritors, Essays in modern stylistics. Routledge, pp 325–360
Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907
Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: Xing EP, Jebara T (eds.) Proceedings of the 31st International Conference on Machine Learning. Bejing, China: PMLR, pp 1188–1196
Vaswani A, Shazeer N, Parmar N, et al. (2017) Attention is all you need. Adv Neural Inf Process Syst, 30
Rehman MZU, Nawaz R, Ullah MS et al (2025) A context-aware attention and graph neural network-based multimodal framework for misogyny detection. Inf Process Manage 62(1):103895
Article Google Scholar
Chen C, Han D, Chang CC (2022) CAAN: context-aware attention network for visual question answering. Pattern Recogn 132:108980
Article Google Scholar
Qu L, Liu M, Cao D, Nie L, Tian Q (2020) Context-aware multi-view summarization network for image-text matching. In: Proceedings of the 28th ACM International Conference on Multimedia. New York, NY, USA: Association for Computing Machinery, pp 1047–1055
Loshchilov I, Hutter F (2017) Fixing weight decay regularization in Adam. arXiv preprint, arXiv:1711.05101.
Boididou C, Papadopoulos S, Zampoglou M, Apostolidis L, Papadopoulou O, Kompatsiaris Y (2018) Detection and visualization of misleading content on Twitter. Int J Multimedia Inf Retr 77(1):1–86
Google Scholar
Jin Z, Cao J, Guo H, Zhang Y, Luo J (2017) Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In: Proceedings of the 25th ACM International Conference on Multimedia, ACM, pp 795–816
Khattar D, Goud JS, Gupta M, et al. MVAE: Multimodal Variational Autoencoder for Fake News Detection. In: The World Wide Web Conference. New York, NY, USA: Association for Computing Machinery, pp. 2915–2921.
Antol S, Agrawal A, Lu J, et al. (2015) Vqa: Visual question answering. In: Proceedings of the IEEE International Conference on Computer Vision. pp 2425–2433
Vinyals O, Toshev A, Bengio S, et al. (2015) Show and tell: a neural image caption generator. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 3156–3164
Chen Y, Li D, Zhang P, et al. (2022) Cross-modal ambiguity learning for multimodal fake news detection. In: Proceedings of the ACM Web Conference 2022. New York, NY, USA: Association for Computing Machinery, pp 2897–2905
Yang H, Zhang J, Zhang L et al (2024) MRAN: multimodal relationship-aware attention network for fake news detection. Comput Stand Interfaces 89:103822
Article Google Scholar

Download references

Funding

This work was supported by the National Natural Science Foundation of China (72174086).

Author information

Authors and Affiliations

College of Economics and Management, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China
Yaozeng Zhang & Jing Ma
School of Information Management, Nanjing University, Nanjing, 211106, China
Yuguang Jia

Authors

Yaozeng Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Jing Ma
View author publications
You can also search for this author inPubMed Google Scholar
Yuguang Jia
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Yaozeng Zhang contributed to conceptualization, methodology, original draft preparation, and software. Jing Ma contributed to supervision. Yuguang Jia contributed to validation and visualization.

Corresponding author

Correspondence to Jing Ma.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Consent to publish

The images from social networking platforms presented in the text are all sourced from publicly available datasets and have been widely cited, thus allowing their publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhang, Y., Ma, J. & Jia, Y. MCAN: multimodal cross-aware network for fake news detection by extracting semantic-physical feature consistency. J Supercomput 81, 299 (2025). https://doi.org/10.1007/s11227-024-06815-1

Download citation

Accepted: 06 December 2024
Published: 16 December 2024
DOI: https://doi.org/10.1007/s11227-024-06815-1

Keywords

Profiles

Yaozeng Zhang View author profile
Jing Ma View author profile

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MCAN: multimodal cross-aware network for fake news detection by extracting semantic-physical feature consistency

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Fake News Detection Using Heterogeneous Information from Multimedia Content

An Auxiliary Modality Based Text-Image Matching Methodology for Fake News Detection

An image and text-based multimodal model for detecting fake news in OSN’s

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Consent to publish

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Profiles

Subscribe and save

Buy Now