Rumor detection with self-supervised learning on texts and social graph

Gao, Yuan; Wang, Xiang; He, Xiangnan; Feng, Huamin; Zhang, Yongdong

doi:10.1007/s11704-022-1531-9

Rumor detection with self-supervised learning on texts and social graph

Research Article
Published: 12 December 2022

Volume 17, article number 174611, (2023)
Cite this article

Frontiers of Computer Science Aims and scope Submit manuscript

Yuan Gao¹,
Xiang Wang¹,
Xiangnan He¹,
Huamin Feng² &
…
Yongdong Zhang¹

175 Accesses
41 Citations
1 Altmetric
Explore all metrics

Abstract

Rumor detection has become an emerging and active research field in recent years. At the core is to model the rumor characteristics inherent in rich information, such as propagation patterns in social network and semantic patterns in post content, and differentiate them from the truth. However, existing works on rumor detection fall short in modeling heterogeneous information, either using one single information source only (e.g., social network, or post content) or ignoring the relations among multiple sources (e.g., fusing social and content features via simple concatenation).

Therefore, they possibly have drawbacks in comprehensively understanding the rumors, and detecting them accurately. In this work, we explore contrastive self-supervised learning on heterogeneous information sources, so as to reveal their relations and characterize rumors better. Technically, we supplement the main supervised task of detection with an auxiliary self-supervised task, which enriches post representations via post self-discrimination.

Specifically, given two heterogeneous views of a post (i.e., representations encoding social patterns and semantic patterns), the discrimination is done by maximizing the mutual information between different views of the same post compared to that of other posts. We devise cluster-wise and instance-wise approaches to generate the views and conduct the discrimination, considering different relations of information sources. We term this framework as self-supervised rumor detection (SRD). Extensive experiments on three real-world datasets validate the effectiveness of SRD for automatic rumor detection on social media.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Joint rumour and stance identification based on semantic and structural information in social networks

Article 08 December 2023

Detecting the Rumor Patterns Integrating Features of User, Content, and the Spreading Structure

Harmfulness metrics in digital twins of social network rumors detection in cloud computing environment

Article Open access 08 February 2024

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

Farajtabar M, Yang J, Ye X, Xu H, Trivedi R, Khalil E, Li S, Song L, Zha H. Fake news mitigation via point process based intervention. In: Proceedings of the 34th International Conference on Machine Learning. 2017, 1097–1106
Jin Z, Cao J, Guo H, Zhang Y, Wang Y, Luo J. Detection and analysis of 2016 US presidential election related rumors on twitter. In: Proceedings of the 10th International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation. 2017, 14–24
Ruchansky N, Seo S, Liu Y. CSI: a hybrid deep model for fake news detection. In: Proceedings of 2017 ACM on Conference on Information and Knowledge Management. 2017, 797–806
Shu K, Cui L, Wang S, Lee D, Liu H. dEFEND: explainable fake news detection. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019, 395–405
Bian T, Xiao X, Xu T, Zhao P, Huang W, Rong Y, Huang J. Rumor detection on social media with bi-directional graph convolutional networks. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence. 2020, 549–556
Wu Z, Pi D, Chen J, Xie M, Cao J. Rumor detection based on propagation graph neural network with attention mechanism. Expert Systems with Applications, 2020, 158: 113595
Article Google Scholar
Ma J, Gao W, Wong K F. Rumor detection on twitter with tree-structured recursive neural networks. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018, 1980–1989
Yang X, Lyu Y, Tian T, Liu Y, Liu Y, Zhang X. Rumor detection on social media with graph structured adversarial learning. In: Proceedings of the 29th International Joint Conference on Artificial Intelligence. 2020, 1417–1423
Nguyen V H, Sugiyama K, Nakov P, Kan M Y. FANG: leveraging social context for fake news detection using graph representation. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 2020, 1165–1174
Foroozani A, Ebrahimi M. Anomalous information diffusion in social networks: Twitter and Digg. Expert Systems with Applications, 2019, 134: 249–266
Article Google Scholar
Zubiaga A, Liakata M, Procter R, Hoi G W S, Tolmie P. Analysing how people orient to and spread rumours in social media by looking at conversational threads. PLoS One, 2016, 11(3): e0150989
Article Google Scholar
Chen T, Kornblith S, Norouzi M, Hinton G. A simple framework for contrastive learning of visual representations. In: Proceedings of the 37th International Conference on Machine Learning. 2020, 149
Devlin J, Chang M W, Lee K, Toutanova K. BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2019, 4171–4186
He K, Fan H, Wu Y, Xie S, Girshick R. Momentum contrast for unsupervised visual representation learning. In: Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020, 9726–9735
Castillo C, Mendoza M, Poblete B. Information credibility on twitter. In: Proceedings of the 20th International Conference on World Wide Web. 2011, 675–684
Yang F, Liu Y, Yu X, Yang M. Automatic detection of rumor on Sina Weibo. In: Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics. 2012, 13
Ma J, Gao W, Wei Z, Lu Y, Wong K F. Detect rumors using time series of social context information on microblogging websites. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 2015, 1751–1754
Liu X, Nourbakhsh A, Li Q, Fang R, Shah S. Real-time rumor debunking on twitter. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 2015, 1867–1870
Ma J, Gao W, Wong K F. Detect rumors in microblog posts using propagation structure via kernel learning. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2017, 708–717
Ma J, Gao W, Mitra P, Kwon S, Jansen B J, Wong K F, Cha M. Detecting rumors from microblogs with recurrent neural networks. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence. 2016, 3818–3824
Jin Z, Cao J, Guo H, Zhang Y, Luo J. Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In: Proceedings of the 25th ACM international conference on Multimedia. 2017, 795–816
Chen T, Li X, Yin H, Zhang J. Call attention to rumors: deep attention based recurrent neural networks for early rumor detection. In: Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. 2018, 40–52
Guo H, Cao J, Zhang Y, Guo J, Li J. Rumor detection with hierarchical social attention network. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 2018, 943–951
Li Q, Zhang Q, Si L. Rumor detection by exploiting user credibility information, attention and multi-task learning. In: Proceedings of the 57th Conference of the Association for Computational Linguistics. 2019, 1173–1179
Yu F, Liu Q, Wu S, Wang L, Tan T. A convolutional approach for misinformation identification. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence. 2017, 3901–3907
Kumar S, Asthana R, Upadhyay S, Upreti N, Akbar M. Fake news detection using deep learning models: a novel approach. Transactions on Emerging Telecommunications Technologies, 2020, 31(2): e3767
Article Google Scholar
Liu Y, Wu Y F B. Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018, 354–361
Rao D, Miao X, Jiang Z, Li R. STANKER: stacking network based on level-grained attention-masked BERT for rumor detection on social media. In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 3347–3363
Song Y Z, Chen Y S, Chang Y T, Weng S Y, Shuai H H. Adversary-aware rumor detection. In: Proceedings of the 59th Findings of the Association for Computational Linguistics. 2021, 1371–1382
Li J, Ni S, Kao H Y. Meet the truth: leverage objective facts and subjective views for interpretable rumor detection. In: Proceedings of the 59th Findings of the Association for Computational Linguistics. 2021, 705–715
Kipf T N, Welling M. Semi-supervised classification with graph convolutional networks. In: Proceedings of the 5th International Conference on Learning Representations. 2017
Zhang M, Chen Y. Link prediction based on graph neural networks. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems. 2018, 5171–5181
Pan S, Wu J, Zhu X, Zhang C, Yu P S. Joint structure feature exploration and regularization for multi-task graph classification. IEEE Transactions on Knowledge and Data Engineering, 2016, 28(3): 715–728
Article Google Scholar
Wang D, Cui P, Zhu W. Structural deep network embedding. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016, 1225–1234
You J, Ying R, Ren X, Hamilton W L, Leskovec J. GraphRNN: generating realistic graphs with deep auto-regressive models. In: Proceedings of the 35th International Conference on Machine Learning. 2018
Wu Z, Pan S, Chen F, Long G, Zhang C, Yu P S. A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems, 2021, 32(1): 4–24
Article MathSciNet Google Scholar
Niepert M, Ahmed M, Kutzkov K. Learning convolutional neural networks for graphs. In: Proceedings of the 33rd International Conference on Machine Learning. 2016, 2014–2023
Hamilton W L, Ying R, Leskovec J. Inductive representation learning on large graphs. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017, 1025–1035
Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y. Graph attention networks. In: Proceedings of the 6th International Conference on Learning Representations. 2018
He X, Deng K, Wang X, Li Y, Zhang Y, Wang M. LightGCN: simplifying and powering graph convolution network for recommendation. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020, 639–648
Wang X, He X, Cao Y, Liu M, Chua T S. KGAT: knowledge graph attention network for recommendation. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019, 950–958
Zeng J, Wang X, Liu J, Chen Y, Liang Z, Chua T S, Chua Z L. SHADEWATCHER: recommendation-guided cyber threat analysis using system audit records. In: Proceedings of 2022 IEEE Symposium on Security and Privacy. 2022
Lin H, Ma J, Cheng M, Yang Z, Chen L, Chen G. Rumor detection on twitter with claim-guided hierarchical graph attention networks. In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 10035–10047
van den Oord A, Kalchbrenner N, Vinyals O, Espeholt L, Graves A, Kavukcuoglu K. Conditional image generation with PixelCNN decoders. In: Proceedings of the 30th International Conference on Neural Information Processing Systems. 2016, 4797–4805
Hjelm R D, Fedorov A, Lavoie-Marchildon S, Grewal K, Bachman P, Trischler A, Bengio Y. Learning deep representations by mutual information estimation and maximization. In: Proceedings of the 7th International Conference on Learning Representations. 2019
Goodfellow I J, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y. Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems. 2014, 2672–2680
Gutmann M, Hyvärinen A. Noise-contrastive estimation: a new estimation principle for unnormalized statistical models. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics. 2010, 297–304
Liu X, Zhang F, Hou Z, Mian L, Wang Z, Zhang J, Tang J. Self-supervised learning: generative or contrastive. IEEE Transactions on Knowledge and Data Engineering, 2021, doi: https://doi.org/10.1109/TKDE.2021.3090866
van den Oord A, Li Y, Vinyals O. Representation learning with contrastive predictive coding. 2019, arXiv preprint arXiv: 1807.03748
Tschannen M, Djolonga J, Rubenstein P K, Gelly S, Lucic M. On mutual information maximization for representation learning. In: Proceedings of the 8th International Conference on Learning Representations. 2020
Caron M, Bojanowski P, Joulin A, Douze M. Deep clustering for unsupervised learning of visual features. In: Proceedings of the 15th European Conference on Computer Vision. 2018, 139–156
Caron M, Misra I, Mairal J, Goyal P, Bojanowski P, Joulin A. Unsupervised learning of visual features by contrasting cluster assignments. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. 2020, 831
Alwassel H, Mahajan D, Korbar B, Torresani L, Ghanem B, Tran D. Self-supervised learning by cross-modal audio-video clustering. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. 2020, 818
Veličković P, Fedus W, Hamilton W L, Liò P, Bengio Y, Hjelm R D. Deep graph infomax. In: Proceedings of the 7th International Conference on Learning Representations. 2019
Kim D, Oh A. How to find your friendly neighborhood: graph attention design with self-supervision. In: Proceedings of the 9th International Conference on Learning Representations. 2021
Wu J, Wang X, Feng F, He X, Chen L, Lian J, Xie X. Self-supervised graph learning for recommendation. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020, 726–735
Liu Y, Xu S, Tourassi G D. Detecting rumors through modeling information propagation networks in a social media environment. In: Proceedings of 8th International Conference, SBP 2015. 2015
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez A N, Kaiser Ł, Polosukhin I. Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017, 6000–6010
Bello I, Zoph B, Le Q, Vaswani A, Shlens J. Attention augmented convolutional networks. In: Proceedings of 2019 IEEE/CVF International Conference on Computer Vision. 2019, 3285–3294
Wang X, Girshick R, Gupta A, He K. Non-local neural networks. In: Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018, 7794–7803
Kim Y. Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014, 1746–1751
Zhang Z K, Liu C, Zhan X X, Lu X, Zhang C X, Zhang Y C. Dynamics of information diffusion and its applications on complex networks. Physics Reports, 2016, 651: 1–34
Article MathSciNet Google Scholar
Huang Q, Zhou C, Wu J, Liu L, Wang B. Deep spatial—temporal structure learning for rumor detection on twitter. Neural Computing and Applications, 2020, doi: https://doi.org/10.1007/s00521-020-05236-4
Wei L, Hu D, Zhou W, Yue Z, Hu S. Towards propagation uncertainty: edge-enhanced Bayesian graph convolutional networks for rumor detection. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. 2021, 3845–3854
Kingma D P, Ba J. Adam: a method for stochastic optimization. In: Proceedings of the 3rd International Conference on Learning Representations. 2015
Loshchilov I, Hutter F. SGDR: stochastic gradient descent with warm restarts. In: Proceedings of the 5th International Conference on Learning Representations. 2017
Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th International Conference on Neural Information Processing Systems. 2013, 3111–3119
Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics. 2010, 249–256
Li Q, Han Z, Wu X M. Deeper insights into graph convolutional networks for semi-supervised learning. In: Proceedings of the 32th AAAI Conference on Artificial Intelligence. 2018, 3538–3545
Wang F, Liu H. Understanding the behaviour of contrastive loss. In: Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021, 2495–2504
Farinneya P, Pour M M A, Hamidian S, Diab M. Active learning for rumor identification on social media. In: Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021. 2021, 4556–4565
Xia R, Xuan K, Yu J. A state-independent and time-evolving network for early rumor detection in social media. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 9042–9051
Chami I, Ying R, Re C, Leskovec J. Hyperbolic graph convolutional neural networks. In: Proceedings of the 33rd International Conference on Neural Information Processing Systems. 2019, 438
Wu Y X, Wang X, Zhang A, He X, Chua T S. Discovering invariant rationales for graph neural networks. In: Proceedings of the ICLR 2022. 2022
Wang X, Wu Y X, Zhang A, He X, Chua T S. Towards multi-grained explainability for graph neural networks. In: Proceedings of the 34th Annual Conference on Neural Information Processing Systems. 2021
Li Y, Wang X, Xiao J, Ji W, Chua T S. Invariant grounding for video question answering. 2022, arXiv preprint arXiv: 2206.02349

Download references

Acknowledgements

This work was supported by the National Key Research and Development Program of China (2020AAA0106000), the National Natural Science Foundation of China (Grant Nos. U21B2026, 62121002), and the CCCD Key Lab of Ministry of Culture and Tourism.

Author information

Authors and Affiliations

School of Information Science and Technology, University of Science and Technology of China, Hefei, 230026, China
Yuan Gao, Xiang Wang, Xiangnan He & Yongdong Zhang
Beijing Electronic Science and Technology Institute, Beijing, 102627, China
Huamin Feng

Authors

Yuan Gao
View author publications
Search author on:PubMed Google Scholar
Xiang Wang
View author publications
Search author on:PubMed Google Scholar
Xiangnan He
View author publications
Search author on:PubMed Google Scholar
Huamin Feng
View author publications
Search author on:PubMed Google Scholar
Yongdong Zhang
View author publications
Search author on:PubMed Google Scholar

Corresponding authors

Correspondence to Xiang Wang or Xiangnan He.

Additional information

Yuan Gao received the MS degree in Electrical and Computer Engineering from University of Michigan, USA in 2019. He is now a PhD student in the School of Cyberspace Science and Technology at the University of Science and Technology of China (USTC), China. His research interest lies in fraud detection, representation learning, and graph learning.

Xiang Wang is now a professor at the University of Science and Technology of China (USTC), China. He received his PhD degree from National University of Singapore, Singapore in 2019. His research interests include recommender systems, graph learning, AI explainability, and AI security. He has published some academic papers on international conferences such as NeurIPS, ICLR, KDD, WWW, SIGIR, and AAAI. He serves as a program committee member for several top conferences such as SIGIR and WWW.

Xiangnan He is a professor at the University of Science and Technology of China (USTC), China. He received his PhD in Computer Science from the National University of Singapore (NUS), Singapore. His research interests span information retrieval, data mining, and multi-media analytics. He has over 80 publications that appeared in several top conferences such as SIGIR, WWW, and MM, and journals including TKDE, TOIS, and TMM. His work has received the Best Paper Award Honorable Mention in WWW 2018 and ACM SIGIR 2016. He is in the editorial board of journals including Frontiers in Big Data, AI Open. Moreover, he has served as the PC chair of CCIS 2019 and SPC/PC member for several top conferences including SIGIR, WWW, KDD, MM, WSDM, ICML, etc., and the regular reviewer for journals including TKDE, TOIS, TMM.

Huamin Feng is now a professor at Beijing Electronic Science and Technology Institute, China. He received his PhD dergree from National University of Singapore, Singapore in 2005. His research interests include multimedia semantic analysis, recommend system, and Web content analysis. He has published some academic papers on international conferences such as WWW, SIGIR, and MMM.

Yongdong Zhang (Senior Member, IEEE) received the PhD degree in electronic engineering from Tianjin University, China in 2002. He is currently a Professor with the University of Science and Technology of China. He has authored more than 100 refereed journal and conference papers. His current research interests include multimedia content analysis and understanding, multimedia content security, video encoding, and streaming media technology. He was a recipient of the Best Paper Award in PCM2013, ICIMCS 2013, and ICME 2010; and the Best Paper Candidate in ICME 2011. He serves as an Editorial Board Member for Multimedia Systems journal and Neurocomputing.

Electronic supplementary material