Abstract
Fake news in social networks causes disastrous effects on the real world yet effectively detecting newly emerged fake news remains difficult. This problem is particularly pronounced when the testing samples (target domain) are derived from different topics, events, platforms or time periods from the training dataset (source domains). Though efforts have focused on learning domain-invariant features (DIF) across multiple source domains to transfer universal knowledge from the source to the target domain, they ignore the complexity that arises when the number of source domains increases, resulting in unreliable DIF. In this paper, we first point out two challenges faced by learning DIF for fake news detection, (1) high intra-domain correlations, caused by the similarity of news samples within the same domain but different categories can be higher than that in different domains but the same categories, and (2) complex inter-domain correlations, stemming from that news samples in different domains are semantically related. To tackle these challenges, we propose two modules, center-aware feature alignment and likelihood gain-based feature disentanglement, to enhance the multiple domains alignment while enforcing two categories separated and disentangle the domain-specific features in an adversarial supervision manner. By combining these modules, we conduct a label-irrelevant multi-domain feature alignment (LIMFA) framework. Our experiments show that LIMFA can be deployed with various base models and it outperforms the state-of-the-art baselines in 4 cross-domain scenarios. Our source codes will be available upon the acceptance of this manuscript.
Similar content being viewed by others
Data availability
All data, models and code generated or used during this study are available at https://github.com/TAN-OpenLab.
References
Bian T, Xiao X, Xu T et al (2020) Rumor detection on social media with bi-directional graph convolutional networks. In: Proceedings of the AAAI conference on artificial intelligence, pp 549–556, https://doi.org/10.1609/aaai.v34i01.5393, arxiv:2001.06362
Castelo S, Santos A, Almeida T et al (2019) A topic-agnostic approach for identifying fake news pages. In: Proceedings of the web conference 2019—companion of the world wide web conference, WWW 2019, pp 975–980, https://doi.org/10.1145/3308560.3316739, arxiv:1905.00957
Chen X, Zhou F, Trajcevski G et al (2022) Multi-view learning with distinguishable feature fusion for rumor detection. Knowl Based Syst 240:108085. https://doi.org/10.1016/j.knosys.2021.108085
Chen Y, Hu L, Sui J et al (2019) Attention-residual network with CNN for rumor detection. In: Proceedings of the international conference on information and knowledge management, pp 1121–1130, https://doi.org/10.1145/3357384.3357950
Dai SC, Hsu YL, Xiong A et al (2022) Ask to know more: generating counterfactual explanations for fake claims. In: Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining (KDD’22), vol 1. ACM, New York, NY, USA, pp 2800–2810, https://doi.org/10.1145/3534678.3539205
Hu L, Yang T, Zhang L et al (2021) Compare to the knowledge: Graph neural fake news detection with external knowledge. In: Proceedings of the 59th annual meeting of the association for computational linguistics, pp 754–763, https://doi.org/10.18653/v1/2021.acl-long.62
Jin X, Lan C, Zeng W et al (2022) Style normalization and restitution for domain generalization and adaptation. IEEE Trans Multimed 24:3636–3651. https://doi.org/10.1109/TMM.2021.3104379, arxiv:2101.00588
Kaliyar RK, Goswami A, Narang P et al (2020) FNDNet A deep convolutional neural network for fake news detection. Cogn Syst Res 61:32–44. https://doi.org/10.1016/j.cogsys.2019.12.005
Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the EMNLP 2014 - 2014 conference on empirical methods in natural language processing, proceedings of the conference, pp 1746–1751, https://doi.org/10.3115/v1/d14-1181, arxiv:1408.5882
Kumari R, Ashok N, Ghosal T et al (2022) What the fake? Probing misinformation detection standing on the shoulder of novelty and emotion. Inf Process Manag 59(1):102740. https://doi.org/10.1016/j.ipm.2021.102740
Li Y, Gong M, Tian X et al (2018) Domain generalization via conditional invariant representations. 32nd AAAI conference on artificial intelligence, AAAI 2018 pp 3579–3587. https://doi.org/10.1609/aaai.v32i1.11682, arxiv:1807.08479
Lin H, Ma J, Cheng M et al (2021) Rumor detection on twitter with claim-guided hierarchical graph attention networks. In: The 2021 conference on empirical methods in natural language processing, pp 783–787, arxiv:2110.04522
Lin H, Ma J, Chen L et al (2022) Detect rumors in microblog posts for low-resource domains via adversarial contrastive learning. In: 2022 Annual conference of the north American chapter of the association for computational linguistics, pp 2543–2556
Luo Y, Ma J, Yeo CK (2021) BCMM: a novel post-based augmentation representation for early rumour detection on social media. Pattern Recognit 113:107818. https://doi.org/10.1016/j.patcog.2021.107818
Ma J, Gao W, Mitra P et al (2016) Detecting rumors from microblogs with recurrent neural networks. In: Proceedings of the IJCAI international joint conference on artificial intelligence, pp 3818–3824
Ma J, Gao W, Wong KF (2017) Detect rumors in microblog posts using propagation structure via kernel learning. In: Proceedings of the 55th annual meeting of the association for computational linguistics, pp 708–717, https://doi.org/10.18653/v1/P17-1066
Ma J, Gao W, Wong KF (2018) Rumor detection on twitter with tree-structured recursive neural networks. In: Proceedings of the ACL 2018, pp 1980–1989, https://doi.org/10.18653/v1/p18-1184
Mosallanezhad A, Karami M, Shu K et al (2022) Domain adaptive fake news detection via reinforcement learning. In: Proceedings of the ACM web conference 2022, vol 1. ACM, New York, NY, USA, pp 3632–3640, https://doi.org/10.1145/3485447.3512258, arxiv:2202.08159
Nan Q, Cao J, Zhu Y et al (2021) MDFEND: multi-domain fake news detection. In: Proceedings of the 30th ACM international conference on information & knowledge management. ACM, New York, NY, USA, pp 3343–3347, https://doi.org/10.1145/3459637.3482139
Popat K (2017) Assessing the credibility of claims on the web. In: Proceedings of the 26th international world wide web conference 2017, WWW 2017 companion, pp 735–739, https://doi.org/10.1145/3041021.3053379
Potthast M, Kiesel J, Reinartz K et al (2018) A stylometric inquiry into hyperpartisan and fake news. In: Proceedings of the 56th annual meeting of the association for computational linguistics, pp 231–240, https://doi.org/10.18653/v1/p18-1022, arxiv:1702.05638
Przybyła P (2020) Capturing the style of fake news. In: Proceedings of the AAAI 2020—34th AAAI conference on artificial intelligence, pp 490–497, https://doi.org/10.1609/aaai.v34i01.5386
Qian S, Hu J, Fang Q et al (2021) Knowledge-aware multi-modal adaptive graph convolutional networks for fake news detection. ACM Trans Multimed Comput Commun Appl 17(3):3451215. https://doi.org/10.1145/3451215
Shu K, Sliva A, Wang S et al (2017) Fake News Detection on Social Media. ACM SIGKDD Explor Newsl 19(1):22–36. https://doi.org/10.1145/3137597.3137600
Silva A, Luo L, Karunasekera S et al (2021) Embracing domain differences in fake news: cross-domain fake news detection using multi-modal data. In: Proceedings of the AAAI conference on artificial intelligence, pp 557–565
Sun M (2021) Perceiving the narrative style for fake news detection using deep learning. In: 2021 IEEE 23rd Int Conf on high performance computing & communications; 7th Int Conf on data science & systems; 19th Int Conf on Smart City; 7th Int Conf on dependability in sensor, cloud & big data systems & application, pp 1195–1202
Van der Maaten L, Hinton G (2008) Visualizing data using t-sne. J Mach Learn Res 9(11)
Vaswani A, Shazeer N, Parmar N, Uzkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. In: Proceedings of the 31st conference on neural information processing systems (NIPS 2017), pp 8–15, https://doi.org/10.5555/3295222.3295349, arxiv:1706.03762v5
Vosoughi S, Mohsenvand MN, Roy D (2017) Rumor gauge: predicting the veracity of rumors on twitter. ACM Trans Knowl Discov Data 11(4):3070644. https://doi.org/10.1145/3070644
Wang Y, Ma F, Jin Z et al (2018) EANN: event adversarial neural networks for multi-modal fake news detection. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. ACM, New York, NY, USA, pp 849–857, https://doi.org/10.1145/3219819.3219903
Wen Y, Zhang K, Li Z et al (2016) A discriminative feature learning approach for deep face recognition. In: Leibe B, Matas J, Sebe N, et al (eds) Proceedings of the European conference on computer vision, Lecture Notes in Computer Science, vol 9911. Springer International Publishing, Cham, pp 1–7, https://doi.org/10.1007/978-3-319-46478-7
Wolf T, Debut L, Sanh V et al (2019) Transformers: State-of-the-art natural language processing. arxiv:1910.03771
Yuan C, Ma Q, Zhou W et al (2020) Early detection of fake news by utilizing the credibility of news, publishers, and users based on weakly supervised learning. In: Proceedings of the 28th international conference on computational linguistics. International Committee on Computational Linguistics, Stroudsburg, PA, USA, pp 5444–5454, https://doi.org/10.18653/v1/2020.coling-main.475, arxiv:2012.04233
Yue Z, Zeng H, Kou Z et al (2022) Contrastive domain adaptation for early misinformation detection: a case study on COVID-19. International conference on information and knowledge management, proceedings pp 2423–2433. https://doi.org/10.1145/3511808.3557263, arXiv:2208.09578
Zhang H, Fang Q, Qian S et al (2019) Multi-modal knowledge-aware event memory network for social media rumor detection. In: MM 2019 - Proceedings of the 27th ACM International Conference on Multimedia, pp 1942–1951, https://doi.org/10.1145/3343031.3350850
Zhang H, Qian S, Fang Q et al (2021) Multimodal disentangled domain adaption for social media event rumor detection. IEEE Trans Multimed 23:4441–4454. https://doi.org/10.1109/TMM.2020.3042055
Zhang T, Wang D, Chen H et al (2020) BDANN: BERT-based domain adaptation neural network for multi-modal fake news detection. In: Proceedings of the international joint conference on neural networks, pp 1–8, https://doi.org/10.1109/IJCNN48605.2020.9206973
Zhang X, Cao J, Li X et al (2021) Mining dual emotion for fake news detection. In: Proceedings of the web conference 2021—proceedings of the world wide web conference, WWW 2021, pp 3465–3476, https://doi.org/10.1145/3442381.3450004, arxiv:1903.01728
Zhu Y, Sheng Q, Cao J et al (2022) Memory-guided multi-view multi-domain fake news detection. IEEE Trans Knowl Data Eng 14(8):3185151. https://doi.org/10.1109/TKDE.2022.3185151
Zubiaga A, Liakata M, Procter R et al (2016) Analysing how people orient to and spread rumours in social media by looking at conversational threads. PLoS ONE 11(3):1–29. https://doi.org/10.1371/journal.pone.0150989, arXiv:1511.07487
Zubiaga A, Aker A, Bontcheva K et al (2018) Detection and resolution of rumours in social media: a survey. ACM Comput Surv 51(2). https://doi.org/10.1145/3161603, arXiv:1704.00656
Zuo Y, Zhu W, Cai G (2022) Continuous detection, rapidly react: unseen rumors detection based on continual prompt-tuning. In: Proceedings of the COLING 2022, arxiv:2203.11720
Acknowledgements
This work is supported by the National Natural Science Foundation of China under Grants (61772125) and the Fundamental Research Funds for the Central Universities (N2217001).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wu, D., Tan, Z., Zhao, H. et al. LIMFA: label-irrelevant multi-domain feature alignment-based fake news detection for unseen domain. Neural Comput & Applic 36, 5197–5215 (2024). https://doi.org/10.1007/s00521-023-09340-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-09340-z