Abstract
Currently, news content with images and texts on social media is widely spread, prompting significant interest in multi-modal fake news detection. However, existing research in this field focuses on large-scale annotated data to train models. Furthermore, data scarcity characterizes the initial stages of fake news propagation. Hence, addressing the challenge of few-shot multi-modal fake news detection becomes essential. In scenarios of limited data availability, current research inadequately utilizes the information inherent in each modality, leading to underutilization of modal information. To address the above challenges, in the paper, we propose a novel detection approach called Prompt-based Adaptive Fusion(ProAF). Specifically, to enhance the model’s comprehension of news content, we extract supplementary information from two modalities to facilitate timely guidance for model training. Then the model employs adaptive fusion to integrate the output predictions of different prompts during training, effectively enhancing the robust performance of the model. Experimental results on two datasets illustrate that our model surpasses existing methods, representing a significant advancement in few-shot multi-modal fake news detection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
O’Connor, C., Murphy, M.: Going viral: doctors must tackle fake news in the COVID-19 pandemic. BMJ 369(10.1136) (2020)
Grinberg, N., Joseph, K., Friedland, L., Swire-Thompson, B., Lazer, D.: Fake news on twitter during the 2016 US presidential election. Science 363(6425), 374–378 (2019)
Yu, C., Ma, Y., An, L., Li, G.: BCMF: a bidirectional cross-modal fusion model for fake news detection. Inf. Process. Manag. 59(5), 103063 (2022)
Wang, B., Feng, Y., Xiong, X.C., Wang, Y.H., Qiang, B.H.: Multi-modal transformer using two-level visual features for fake news detection. Appl. Intell. 53(9), 10429–10443 (2023)
Bowman, S., Angeli, G., Potts, C., Manning, C.D.: A large annotated corpus for learning natural language inference. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 632–642 (2015)
Wu, Y., Zhan, P., Zhang, Y., Wang, L., Xu, Z.: Multimodal fusion with co-attention networks for fake news detection. In: Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pp. 2560–2569 (2021)
Gu, Y., Han, X., Liu, Z., Huang, M.: PPT: pre-trained prompt tuning for few-shot learning. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 8410–8423 (2022)
Wu, J., Li, S., Deng, A., Xiong, M., Hooi, B.: Prompt-and-align: prompt-based social alignment for few-shot fake news detection. In: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, pp. 2726–2736 (2023)
Gao, W., Ni, M., Deng, H., Zhu, X., Zeng, P., Hu, X.: Few-shot fake news detection via prompt-based tuning. J. Intell. Fuzzy Syst. (Preprint), 1–10 (2023)
Jiang, Y., Yu, X., Wang, Y., Xu, X., Song, X., Maynard, D.: Similarity-aware multimodal prompt learning for fake news detection. Inf. Sci. 647, 119446 (2023)
Qi, P., Cao, J., Yang, T., Guo, J., Li, J.: Exploiting multi-domain visual information for fake news detection. In: 2019 IEEE International Conference on Data Mining (ICDM), pp. 518–527. IEEE (2019)
Jin, Z., Cao, J., Guo, H., Zhang, Y., Luo, J.: Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In: Proceedings of the 25th ACM International Conference on Multimedia, pp. 795–816 (2017)
Khattar, D., Goud, J.S., Gupta, M., Varma, V.: MVAE: multimodal variational autoencoder for fake news detection. In: The World Wide Web Conference, pp. 2915–2921 (2019)
Kumari, R., Ekbal, A.: AMFB: attention based multimodal factorized bilinear pooling for multimodal fake news detection. Expert Syst. Appl. 184, 115412 (2021)
Zhang, H., Fang, Q., Qian, S., Xu, C.: Multi-modal knowledge-aware event memory network for social media rumor detection. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 1942–1951 (2019)
Floridi, L., Chiriatti, M.: GPT-3: its nature, scope, limits, and consequences. Mind. Mach. 30, 681–694 (2020)
Kenton, J.D.M.W.C., Toutanova, L.K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of NAACL-HLT, pp. 4171–4186 (2019)
Liu, Y., et al.: RoBERTa: a robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019)
Shin, T., Razeghi, Y., Logan IV, R.L., Wallace, E., Singh, S.: Autoprompt: eliciting knowledge from language models with automatically generated prompts. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 4222–4235 (2020)
Li, X.L., Liang, P.: Prefix-tuning: optimizing continuous prompts for generation. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics (2021)
Jiang, G., Liu, S., Zhao, Y., Sun, Y., Zhang, M.: Fake news detection via knowledgeable prompt learning. Inf. Process. Manag. 59(5), 103029 (2022)
Yu, Y., Zhang, D., Li, S.: Unified multi-modal pre-training for few-shot sentiment analysis with prompt-based learning. In: Proceedings of the 30th ACM International Conference on Multimedia, pp. 189–198 (2022)
Brock, A., De, S., Smith, S.L.: Characterizing signal propagation to close the performance gap in unnormalized resnets. In: International Conference on Learning Representations (2020)
Yu, Y., Zhang, D.: Few-shot multi-modal sentiment analysis with prompt-based vision-aware language modeling. In: 2022 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2022)
Mokady, R., Hertz, A., Bermano, A.H.: Clipcap: clip prefix for image captioning. arXiv preprint arXiv:2111.09734 (2021)
Liu, P., Yuan, W., Fu, J., Jiang, Z., Hayashi, H., Neubig, G.: Pre-train, prompt, and predict: a systematic survey of prompting methods in natural language processing. ACM Comput. Surv. 55(9), 1–35 (2023)
Ferragina, P., Scaiella, U.: TagMe: on-the-fly annotation of short text fragments (by Wikipedia entities). In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 1625–1628 (2010)
Vrandečić, D., Krötzsch, M.: WikiData: a free collaborative knowledgebase. Commun. ACM 57(10), 78–85 (2014)
Chen, Y.: Convolutional neural network for sentence classification. Master’s thesis, University of Waterloo (2015)
Chung, J., Gulcehre, C., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. In: NIPS 2014 Workshop on Deep Learning, December 2014 (2014)
Shu, K., Mahudeswaran, D., Wang, S., Lee, D., Liu, H.: FakeNewsnet: a data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big data 8(3), 171–188 (2020)
Xiong, S., Zhang, G., Batra, V., Xi, L., Shi, L., Liu, L.: Trimoon: two-round inconsistency-based multi-modal fusion network for fake news detection. Inf. Fusion 93, 150–158 (2023)
Liu, Z., et al.: Swin transformer v2: scaling up capacity and resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12009–12019 (2022)
Acknowledgement
This work was supported by the National Natural Science Foundation of China (No. 62376062), the Ministry of Education of Humanities and Social Science Project (No. 23YJAZH220), the Philosophy and Social Sciences 14th Five-Year Plan Project of Guangdong Province (No. GD23CTS03), and the Guangdong Basic and Applied Basic Research Foundation of China (No. 2023A1515012718).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ouyang, Q., Lin, N., Zhou, Y., Yang, A., Zhou, D. (2024). Enhancing Few-Shot Multi-modal Fake News Detection Through Adaptive Fusion. In: Zhang, W., Tung, A., Zheng, Z., Yang, Z., Wang, X., Guo, H. (eds) Web and Big Data. APWeb-WAIM 2024. Lecture Notes in Computer Science, vol 14964. Springer, Singapore. https://doi.org/10.1007/978-981-97-7241-4_27
Download citation
DOI: https://doi.org/10.1007/978-981-97-7241-4_27
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-7240-7
Online ISBN: 978-981-97-7241-4
eBook Packages: Computer ScienceComputer Science (R0)