Skip to main content

Early Detection of Fake News with Multi-source Weak Social Supervision

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12459))

Abstract

Social media has greatly enabled people to participate in online activities at an unprecedented rate. However, this unrestricted access also exacerbates the spread of misinformation and fake news which cause confusion and chaos if not detected in a timely manner. Given the rapidly evolving nature of news events and the limited amount of annotated data, state-of-the-art systems on fake news detection face challenges for early detection. In this work, we exploit multiple weak signals from different sources from user engagements with contents (referred to as weak social supervision), and their complementary utilities to detect fake news. We jointly leverage limited amount of clean data along with weak signals from social engagements to train a fake news detector in a meta-learning framework which estimates the quality of different weak instances. Experiments on real-world datasets demonstrate that the proposed framework outperforms state-of-the-art baselines for early detection of fake news without using any user engagements at prediction time.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://bit.ly/39zPnMd.

  2. 2.

    https://bit.ly/39xmXT7.

  3. 3.

    https://www.gossipcop.com/.

  4. 4.

    https://www.politifact.com/.

  5. 5.

    https://bit.ly/2WGK6zE.

  6. 6.

    All the data and code are available at: this clickable link.

References

  1. Abbasi, M.-A., Liu, H.: Measuring user credibility in social media. In: Greenberg, A.M., Kennedy, W.G., Bos, N.D. (eds.) SBP 2013. LNCS, vol. 7812, pp. 441–448. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37210-0_48

    Chapter  Google Scholar 

  2. Castillo, C., Mendoza, M., Poblete, B.: Information credibility on twitter. In: WWW (2011)

    Google Scholar 

  3. Cui, L., Wang, S., Lee, D.: Same: sentiment-aware multi-modal embedding for detecting fake news (2019)

    Google Scholar 

  4. Ge, L., Gao, J., Li, X., Zhang, A.: Multi-source deep learning for information trustworthiness estimation. In: KDD, ACM (2013)

    Google Scholar 

  5. Gentzkow, M., Shapiro, J.M., Stone, D.F.: In Handbook of media economics, vol. 1, pp. 623–645. Elsevier (2015)

    Google Scholar 

  6. Hendrycks, D., Mazeika, M., Wilson, D., Gimpel, K.: Using trusted data to train deep networks on labels corrupted by severe noise. In: NeurIPS (2018)

    Google Scholar 

  7. Hutto, C., Gilbert, E.: Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: ICWSM (2014)

    Google Scholar 

  8. Jin, Z., Cao, J., Zhang, Y., Luo, J.: News verification by exploiting conflicting social viewpoints in microblogs. In: AAAI (2016)

    Google Scholar 

  9. Kim, Y.: Convolutional neural networks for sentence classification (2014)

    Google Scholar 

  10. Kulshrestha, J.: Quantifying search bias: Investigating sources of bias for political searches in social media. In: CSCW, ACM (2017)

    Google Scholar 

  11. Li, Y., Yang, J., Song, Y., Cao, L., Luo, J., Li, L.J.: Learning from noisy labels with distillation. In: ICCV (2017)

    Google Scholar 

  12. Yinhan, L.: A robustly optimized bert pretraining approach, Roberta (2019)

    Google Scholar 

  13. Ouyang, W., Chu, X., Wang, X.: Multi-source deep learning for human pose estimation. In: CVPR, IEEE (2014)

    Google Scholar 

  14. Patrini, G., Rozza, A., Krishna Menon, A., Nock, R., Qu, L.: A loss correction approach. In: CVPR, Making Deep Neural Networks Robust to Label Noise (2017)

    Google Scholar 

  15. Pennebaker, J.W., Boyd, R.L., Jordan, K., Blackburn, K.: The development and psychometric properties of liwc2015. Technical report (2015)

    Google Scholar 

  16. Potthast, M., Kiesel, J., Reinartz, K., Bevendorff, J., Stein, B.: A stylometric inquiry into hyperpartisan and fake news. arXiv preprint arXiv:1702.05638 (2017)

  17. Qian, F., Gong, C., Sharma, K., Liu, Y.: Fake news detection with collective user intelligence. In: IJCAI, Neural User Response Generator (2018)

    Google Scholar 

  18. Ratner, A., Bach, S.H., Ehrenberg, H., Fries, J., Wu, S., Ré, C.: Snorkel: rapid training data creation with weak supervision. Proc. VLDB Endowment 11(3), 269–282 (2017)

    Article  Google Scholar 

  19. Ratner, A., Bach, S.H., Ehrenberg, H., Fries, J., Wu, S., Ré, C.: Snorkel: rapid training data creation with weak supervision. In: Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases, vol. 11, no. 3, p. 269. NIH Public Access (2017)

    Google Scholar 

  20. Ratner, A., Hancock, B., Dunnmon, J., Sala, F., Pandey, S., Ré, C.: Training complex models with multi-task weak supervision. arXiv preprint arXiv:1810.02840 (2018)

  21. Ratner, A., Hancock, B., Dunnmon, J., Sala, F., Pandey, S., Ré C.: Training complex models with multi-task weak supervision (2018)

    Google Scholar 

  22. Reed, S., Lee, H., Anguelov, D., Szegedy, C., Erhan, D., Rabinovich, A.: Training deep neural networks on noisy labels with bootstrapping. arXiv preprint arXiv:1412.6596 (2014)

  23. Ren, M., Zeng, W., Yang, B., Urtasun, R.: Learning to reweight examples for robust deep learning. arXiv preprint arXiv:1803.09050 (2018)

  24. Rubin, V.L., Lukoianova, T.: Truth and deception at the rhetorical structure level. J. Assoc. Inf. Sci. Technol. 66(5), 905–917 (2015)

    Article  Google Scholar 

  25. Shu, K., Mahudeswaran, D., Wang, S., Lee, D., Liu, H.: Fakenewsnet: A data repository with news content, social context and dynamic information for studying fake news on social media. arXiv preprint arXiv:1809.01286 (2018)

  26. Shu, K., Mukherjee, S., Zheng, G., Awadallah, A.H., Shokouhi, M., Dumais, S.: Learning with weak supervision for email intent detection. In: SIGIR (2020)

    Google Scholar 

  27. Shu, K., Sliva, A., Wang, S., Tang, J., Liu, H.: A data mining perspective. KDD exploration newsletter, Fake news detection on social media (2017)

    Google Scholar 

  28. Shu, K., Wang, S., Liu, H.: The role of social context for fake news detection. In: WSDM, Beyond news contents (2019)

    Google Scholar 

  29. Stewart, R., Ermon, S.: Label-free supervision of neural networks with physics and domain knowledge. In: AAAI (2017)

    Google Scholar 

  30. Sukhbaatar, S., Bruna, J., Paluri, M., Bourdev, L., Fergus, R.: Training convolutional networks with noisy labels. arXiv preprint arXiv:1406.2080 (2014)

  31. Varma, P., Sala, F., He, A., Ratner, A., Ré C.: Learning dependency structures for weak supervision models. In: ICML (2019)

    Google Scholar 

  32. Vosoughi, S., Roy, D., Aral, S.: The spread of true and false news online. Science 359(6380), 1146–1151 (2018)

    Article  Google Scholar 

  33. Wang, Y., et al.: Event adversarial neural networks for multi-modal fake news detection. In: CIKM, Eann (2018)

    Google Scholar 

  34. Zhang, Z.Y., Zhao, P., Jiang, Y., Zhou, Z.H.: Learning from incomplete and inaccurate supervision. In: KDD (2019)

    Google Scholar 

  35. Zheng, G., Awadallah, A.H., Dumais, S.: Meta label correction for learning with weak supervision. arXiv preprint arXiv:1911.03809 (2019)

Download references

Acknowledgments

This work is, in part, supported by Global Security Initiative (GSI) at ASU and by NSF grant # 1614576.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kai Shu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Shu, K. et al. (2021). Early Detection of Fake News with Multi-source Weak Social Supervision. In: Hutter, F., Kersting, K., Lijffijt, J., Valera, I. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2020. Lecture Notes in Computer Science(), vol 12459. Springer, Cham. https://doi.org/10.1007/978-3-030-67664-3_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-67664-3_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-67663-6

  • Online ISBN: 978-3-030-67664-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics