Early Detection of Fake News with Multi-source Weak Social Supervision

Shu, Kai; Zheng, Guoqing; Li, Yichuan; Mukherjee, Subhabrata; Awadallah, Ahmed Hassan; Ruston, Scott; Liu, Huan

doi:10.1007/978-3-030-67664-3_39

Early Detection of Fake News with Multi-source Weak Social Supervision

Kai Shu¹²,
Guoqing Zheng¹³,
Yichuan Li¹⁴,
Subhabrata Mukherjee¹³,
Ahmed Hassan Awadallah¹³,
Scott Ruston¹⁴ &
…
Huan Liu¹⁴

Conference paper
First Online: 25 February 2021

1987 Accesses
12 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12459))

Abstract

Social media has greatly enabled people to participate in online activities at an unprecedented rate. However, this unrestricted access also exacerbates the spread of misinformation and fake news which cause confusion and chaos if not detected in a timely manner. Given the rapidly evolving nature of news events and the limited amount of annotated data, state-of-the-art systems on fake news detection face challenges for early detection. In this work, we exploit multiple weak signals from different sources from user engagements with contents (referred to as weak social supervision), and their complementary utilities to detect fake news. We jointly leverage limited amount of clean data along with weak signals from social engagements to train a fake news detector in a meta-learning framework which estimates the quality of different weak instances. Experiments on real-world datasets demonstrate that the proposed framework outperforms state-of-the-art baselines for early detection of fake news without using any user engagements at prediction time.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://bit.ly/39zPnMd.
2.
https://bit.ly/39xmXT7.
3.
https://www.gossipcop.com/.
4.
https://www.politifact.com/.
5.
https://bit.ly/2WGK6zE.
6.
All the data and code are available at: this clickable link.

References

Abbasi, M.-A., Liu, H.: Measuring user credibility in social media. In: Greenberg, A.M., Kennedy, W.G., Bos, N.D. (eds.) SBP 2013. LNCS, vol. 7812, pp. 441–448. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37210-0_48
Chapter Google Scholar
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on twitter. In: WWW (2011)
Google Scholar
Cui, L., Wang, S., Lee, D.: Same: sentiment-aware multi-modal embedding for detecting fake news (2019)
Google Scholar
Ge, L., Gao, J., Li, X., Zhang, A.: Multi-source deep learning for information trustworthiness estimation. In: KDD, ACM (2013)
Google Scholar
Gentzkow, M., Shapiro, J.M., Stone, D.F.: In Handbook of media economics, vol. 1, pp. 623–645. Elsevier (2015)
Google Scholar
Hendrycks, D., Mazeika, M., Wilson, D., Gimpel, K.: Using trusted data to train deep networks on labels corrupted by severe noise. In: NeurIPS (2018)
Google Scholar
Hutto, C., Gilbert, E.: Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: ICWSM (2014)
Google Scholar
Jin, Z., Cao, J., Zhang, Y., Luo, J.: News verification by exploiting conflicting social viewpoints in microblogs. In: AAAI (2016)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification (2014)
Google Scholar
Kulshrestha, J.: Quantifying search bias: Investigating sources of bias for political searches in social media. In: CSCW, ACM (2017)
Google Scholar
Li, Y., Yang, J., Song, Y., Cao, L., Luo, J., Li, L.J.: Learning from noisy labels with distillation. In: ICCV (2017)
Google Scholar
Yinhan, L.: A robustly optimized bert pretraining approach, Roberta (2019)
Google Scholar
Ouyang, W., Chu, X., Wang, X.: Multi-source deep learning for human pose estimation. In: CVPR, IEEE (2014)
Google Scholar
Patrini, G., Rozza, A., Krishna Menon, A., Nock, R., Qu, L.: A loss correction approach. In: CVPR, Making Deep Neural Networks Robust to Label Noise (2017)
Google Scholar
Pennebaker, J.W., Boyd, R.L., Jordan, K., Blackburn, K.: The development and psychometric properties of liwc2015. Technical report (2015)
Google Scholar
Potthast, M., Kiesel, J., Reinartz, K., Bevendorff, J., Stein, B.: A stylometric inquiry into hyperpartisan and fake news. arXiv preprint arXiv:1702.05638 (2017)
Qian, F., Gong, C., Sharma, K., Liu, Y.: Fake news detection with collective user intelligence. In: IJCAI, Neural User Response Generator (2018)
Google Scholar
Ratner, A., Bach, S.H., Ehrenberg, H., Fries, J., Wu, S., Ré, C.: Snorkel: rapid training data creation with weak supervision. Proc. VLDB Endowment 11(3), 269–282 (2017)
Article Google Scholar
Ratner, A., Bach, S.H., Ehrenberg, H., Fries, J., Wu, S., Ré, C.: Snorkel: rapid training data creation with weak supervision. In: Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases, vol. 11, no. 3, p. 269. NIH Public Access (2017)
Google Scholar
Ratner, A., Hancock, B., Dunnmon, J., Sala, F., Pandey, S., Ré, C.: Training complex models with multi-task weak supervision. arXiv preprint arXiv:1810.02840 (2018)
Ratner, A., Hancock, B., Dunnmon, J., Sala, F., Pandey, S., Ré C.: Training complex models with multi-task weak supervision (2018)
Google Scholar
Reed, S., Lee, H., Anguelov, D., Szegedy, C., Erhan, D., Rabinovich, A.: Training deep neural networks on noisy labels with bootstrapping. arXiv preprint arXiv:1412.6596 (2014)
Ren, M., Zeng, W., Yang, B., Urtasun, R.: Learning to reweight examples for robust deep learning. arXiv preprint arXiv:1803.09050 (2018)
Rubin, V.L., Lukoianova, T.: Truth and deception at the rhetorical structure level. J. Assoc. Inf. Sci. Technol. 66(5), 905–917 (2015)
Article Google Scholar
Shu, K., Mahudeswaran, D., Wang, S., Lee, D., Liu, H.: Fakenewsnet: A data repository with news content, social context and dynamic information for studying fake news on social media. arXiv preprint arXiv:1809.01286 (2018)
Shu, K., Mukherjee, S., Zheng, G., Awadallah, A.H., Shokouhi, M., Dumais, S.: Learning with weak supervision for email intent detection. In: SIGIR (2020)
Google Scholar
Shu, K., Sliva, A., Wang, S., Tang, J., Liu, H.: A data mining perspective. KDD exploration newsletter, Fake news detection on social media (2017)
Google Scholar
Shu, K., Wang, S., Liu, H.: The role of social context for fake news detection. In: WSDM, Beyond news contents (2019)
Google Scholar
Stewart, R., Ermon, S.: Label-free supervision of neural networks with physics and domain knowledge. In: AAAI (2017)
Google Scholar
Sukhbaatar, S., Bruna, J., Paluri, M., Bourdev, L., Fergus, R.: Training convolutional networks with noisy labels. arXiv preprint arXiv:1406.2080 (2014)
Varma, P., Sala, F., He, A., Ratner, A., Ré C.: Learning dependency structures for weak supervision models. In: ICML (2019)
Google Scholar
Vosoughi, S., Roy, D., Aral, S.: The spread of true and false news online. Science 359(6380), 1146–1151 (2018)
Article Google Scholar
Wang, Y., et al.: Event adversarial neural networks for multi-modal fake news detection. In: CIKM, Eann (2018)
Google Scholar
Zhang, Z.Y., Zhao, P., Jiang, Y., Zhou, Z.H.: Learning from incomplete and inaccurate supervision. In: KDD (2019)
Google Scholar
Zheng, G., Awadallah, A.H., Dumais, S.: Meta label correction for learning with weak supervision. arXiv preprint arXiv:1911.03809 (2019)

Download references

Acknowledgments

This work is, in part, supported by Global Security Initiative (GSI) at ASU and by NSF grant # 1614576.

Author information

Authors and Affiliations

Department of Computer Science, Illinois Institute of Technology, Chicago, USA
Kai Shu
Microsoft Research AI, Redmond, USA
Guoqing Zheng, Subhabrata Mukherjee & Ahmed Hassan Awadallah
Arizona State University, Tempe, USA
Yichuan Li, Scott Ruston & Huan Liu

Authors

Kai Shu
View author publications
You can also search for this author in PubMed Google Scholar
Guoqing Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Yichuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Subhabrata Mukherjee
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Hassan Awadallah
View author publications
You can also search for this author in PubMed Google Scholar
Scott Ruston
View author publications
You can also search for this author in PubMed Google Scholar
Huan Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kai Shu .

Editor information

Editors and Affiliations

Albert-Ludwigs-Universität, Freiburg, Germany
Frank Hutter
TU Darmstadt, Darmstadt, Germany
Kristian Kersting
Ghent University, Ghent, Belgium
Jefrey Lijffijt
Saarland University, Saarbrücken, Germany
Isabel Valera

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shu, K. et al. (2021). Early Detection of Fake News with Multi-source Weak Social Supervision. In: Hutter, F., Kersting, K., Lijffijt, J., Valera, I. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2020. Lecture Notes in Computer Science(), vol 12459. Springer, Cham. https://doi.org/10.1007/978-3-030-67664-3_39

Download citation

DOI: https://doi.org/10.1007/978-3-030-67664-3_39
Published: 25 February 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67663-6
Online ISBN: 978-3-030-67664-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)