skip to main content
10.1145/3366424.3385772acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Proactive Discovery of Fake News Domains from Real-Time Social Media Feeds

Published: 20 April 2020 Publication History

Abstract

The proliferation of web sites that disseminate fake news is a growing problem in our society. Not surprisingly, the problem of identifying whether a web page contains fake news has attracted substantial attention. However, the problem of discovering new sources of fake news has been largely unexplored. Timely discovery of such sources is critical to combat misinformation and minimize its potential harm. In this paper, we present an automatic discovery system that proactively surfaces fake news domains before they are flagged by humans. Our system operates in two-steps: first, it uses Twitter feeds to uncover user co-sharing structures to discover political websites; then it uses a topic-agnostic classifier to score and rank newly discovered domains. To demonstrate the effectiveness of our system, we conduct an experimental evaluation in which we collect tweets related to the 2020 presidential impeachment process in the United States, and show that not only our system is able to discover new sites, but that a large percentage of these sites are indeed publishing fake news. We also design an integrated user interface to support fact-checkers and leverage their knowledge. Through this interface, fact-checkers can visualize domain interaction networks, query domain fakeness score, and tag incorrectly predicted results. Our proactive discovery system will expedite fact-checking process and can be a powerful weapon in the toolbox to combat misinformation.

References

[1]
Alexandre Bovet and Hernán A. Makse. 2019. Influence of fake news in Twitter during the 2016 US presidential election. Nature Communications 10, 1 (2019), 7. https://doi.org/10.1038/s41467-018-07761-2
[2]
Sonia Castelo, Thais Almeida, Anas Elghafari, Aécio Santos, Kien Pham, Eduardo Nakamura, and Juliana Freire. 2019. A Topic-Agnostic Approach for Identifying Fake News Pages. In Companion Proceedings of The 2019 World Wide Web Conference (San Francisco, USA) (WWW ’19). ACM, New York, NY, USA, 975–980. https://doi.org/10.1145/3308560.3316739
[3]
Zhouhan Chen, Rima S. Tanash, Richard Stoll, and Devika Subramanian. 2017. Hunting Malicious Bots on Twitter: An Unsupervised Approach. In Social Informatics. Springer International Publishing, Cham, 501–510.
[4]
Fernando Cardoso Durier da Silva, Rafael Vieira, and Ana Cristina Garcia. 2019. Can Machines Learn to Detect Fake News? A Survey Focused on Social Media. In HICSS.
[5]
Chris Dulhanty, Jason L. Deglint, Ibrahim Ben Daya, and Alexander Wong. 2019. Taking a Stance on Fake News: Towards Automatic Disinformation Assessment via Deep Bidirectional Transformer Language Models for Stance Detection. CoRR abs/1911.11951(2019). arxiv:1911.11951http://arxiv.org/abs/1911.11951
[6]
Emilio Ferrara, Onur Varol, Clayton A. Davis, Filippo Menczer, and Alessandro Flammini. 2016. The rise of social bots. Commun. ACM 59(2016), 96–104.
[7]
Maria Glenski, Ellyn Ayton, Josh Mendoza, and Svitlana Volkova. 2019. Multilingual Multimodal Digital Deception Detection and Disinformation Spread across Social Platforms. CoRR abs/1909.05838(2019). arxiv:1909.05838http://arxiv.org/abs/1909.05838
[8]
Nir Grinberg, Kenneth Joseph, Lisa Friedland, Briony Swire-Thompson, and David Lazer. 2019. Fake news on Twitter during the 2016 U.S. presidential election. Science 363, 6425 (2019), 374–378. https://doi.org/10.1126/science.aau2706 arXiv:https://science.sciencemag.org/content/363/6425/374.full.pdf
[9]
Gisel Bastidas Guacho, Sara Abdali, Neil Shah, and Evangelos E. Papalexakis. 2018. Semi-supervised Content-Based Detection of Misinformation via Tensor Embeddings. 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) (Aug 2018). https://doi.org/10.1109/asonam.2018.8508241
[10]
Bin Guo, Yasan Ding, Lina Yao, Yunji Liang, and Zhiwen Yu. 2019. The Future of Misinformation Detection: New Perspectives and Trends. CoRR abs/1909.03654(2019). arxiv:1909.03654http://arxiv.org/abs/1909.03654
[11]
Sebastião Miranda, David Nogueira, Afonso Mendes, Andreas Vlachos, Andrew Secker, Rebecca Garrett, Jeff Mitchel, and Zita Marinho. 2019. Automated Fact Checking in the News Room. In The World Wide Web Conference (San Francisco, CA, USA) (WWW ’19). Association for Computing Machinery, New York, NY, USA, 3579–3583. https://doi.org/10.1145/3308558.3314135
[12]
Kai Nakamura, Sharon Levy, and William Yang Wang. 2019. r/Fakeddit: A New Multimodal Benchmark Dataset for Fine-grained Fake News Detection. ArXiv abs/1911.03854(2019).
[13]
Feng Qian, Chengyue Gong, Karishma Sharma, and Yan Liu. 2018. Neural User Response Generator: Fake News Detection with Collective User Intelligence. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18. International Joint Conferences on Artificial Intelligence Organization, 3834–3840. https://doi.org/10.24963/ijcai.2018/533
[14]
Chengcheng Shao, Giovanni Luca Ciampaglia, Alessandro Flammini, and Filippo Menczer. 2016. Hoaxy: A platform for tracking online misinformation. In Proceedings of the 25th International Conference Companion on World Wide Web. International World Wide Web Conferences Steering Committee, 745–750.
[15]
Kai Shu, Ahmed Hassan Awadallah, Susan Dumais, and Huan Liu. 2019. Detecting Fake News with Weak Social Supervision. arXiv:arXiv:1910.11430
[16]
Kai Shu, Limeng Cui, Suhang Wang, Dongwon Lee, and Huan Liu. 2019. dEFEND: Explainable Fake News Detection. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Anchorage, AK, USA) (KDD ’19). ACM, New York, NY, USA, 395–405. https://doi.org/10.1145/3292500.3330935
[17]
Kai Shu and Huan Liu. 2019. Detecting Fake News on Social Media. Synthesis Lectures on Data Mining and Knowledge Discovery (2019).
[18]
Kai Shu, Amy Sliva, Suhang Wang, Jiliang Tang, and Huan Liu. 2017. Fake News Detection on Social Media: A Data Mining Perspective. SIGKDD Explor. Newsl. 19, 1 (Sept. 2017), 22–36. https://doi.org/10.1145/3137597.3137600
[19]
Kai Shu, Suhang Wang, Dongwon Lee, and Huan Liu. 2020. Mining Disinformation and Fake News: Concepts, Methods, and Recent Advancements. CoRR abs/2001.00623(2020). arxiv:2001.00623http://arxiv.org/abs/2001.00623
[20]
Kate Starbird, Ahmer Arif, and Tom Wilson. 2019. Disinformation As Collaborative Work: Surfacing the Participatory Nature of Strategic Information Operations. Proc. ACM Hum.-Comput. Interact. 3, CSCW, Article 127 (Nov. 2019), 26 pages. https://doi.org/10.1145/3359229
[21]
Leo Graiden Stewart, Ahmer Arif, and Kate Starbird. 2018. Examining Trolls and Polarization with a Retweet Network.
[22]
Maciej Szpakowski. 2020. FakeNewsCorpus. https://github.com/several27/FakeNewsCorpus
[23]
Onur Varol, Emilio Ferrara, Clayton Davis, Filippo Menczer, and Alessandro Flammini. 2017. Online Human-Bot Interactions: Detection, Estimation, and Characterization. https://aaai.org/ocs/index.php/ICWSM/ICWSM17/paper/view/15587
[24]
Soroush Vosoughi, Deb Roy, and Sinan Aral. 2018. The spread of true and false news online. Science 359, 6380 (2018), 1146–1151. https://doi.org/10.1126/science.aap9559 arXiv:https://science.sciencemag.org/content/359/6380/1146.full.pdf
[25]
Yaqing Wang, Weifeng Yang, Fenglong Ma, Jin Xu, Bin Zhong, Qiang Deng, and Jing Gao. 2019. Weak Supervision for Fake News Detection via Reinforcement Learning. arXiv e-prints, Article arXiv:1912.12520 (Dec 2019), arXiv:1912.12520 pages. arxiv:1912.12520 [cs.SI]
[26]
Fan Yang, Shiva K. Pentyala, Sina Mohseni, Mengnan Du, Hao Yuan, Rhema Linder, Eric D. Ragan, Shuiwang Ji, and Xia (Ben) Hu. 2019. XFake: Explainable Fake News Detector with Visualizations. In The World Wide Web Conference(San Francisco, CA, USA) (WWW ’19). ACM, New York, NY, USA, 3600–3604. https://doi.org/10.1145/3308558.3314119
[27]
Shuo Yang, Kai Shu, Suhang Wang, Renjie Gu, Fan Wu, and Huan Liu. 2019. Unsupervised Fake News Detection on Social Media: A Generative Approach. In AAAI.

Cited By

View all
  • (2024)Leveraging Exposure Networks for Detecting Fake News SourcesProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671539(5635-5646)Online publication date: 25-Aug-2024
  • (2024)Knowledge Enhanced Vision and Language Model for Multi-Modal Fake News DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.333029626(8312-8322)Online publication date: 2024
  • (2024)Do Sentence-Level Sentiment Interactions Matter? Sentiment Mixed Heterogeneous Network for Fake News DetectionIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.326909011:4(5090-5100)Online publication date: Aug-2024
  • Show More Cited By

Index Terms

  1. Proactive Discovery of Fake News Domains from Real-Time Social Media Feeds
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          WWW '20: Companion Proceedings of the Web Conference 2020
          April 2020
          854 pages
          ISBN:9781450370240
          DOI:10.1145/3366424
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 20 April 2020

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. fake news discovery
          2. misinformation
          3. social network analysis

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Conference

          WWW '20
          Sponsor:
          WWW '20: The Web Conference 2020
          April 20 - 24, 2020
          Taipei, Taiwan

          Acceptance Rates

          Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)65
          • Downloads (Last 6 weeks)5
          Reflects downloads up to 02 Mar 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)Leveraging Exposure Networks for Detecting Fake News SourcesProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671539(5635-5646)Online publication date: 25-Aug-2024
          • (2024)Knowledge Enhanced Vision and Language Model for Multi-Modal Fake News DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.333029626(8312-8322)Online publication date: 2024
          • (2024)Do Sentence-Level Sentiment Interactions Matter? Sentiment Mixed Heterogeneous Network for Fake News DetectionIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.326909011:4(5090-5100)Online publication date: Aug-2024
          • (2024)Domain- and category-style clustering for general fake news detection via contrastive learningInformation Processing & Management10.1016/j.ipm.2024.10372561:4(103725)Online publication date: Jul-2024
          • (2023)Who Funds Misinformation? A Systematic Analysis of the Ad-related Profit Routines of Fake News SitesProceedings of the ACM Web Conference 202310.1145/3543507.3583443(2765-2776)Online publication date: 30-Apr-2023
          • (2022)A Review on Identification of Unreliability and Fakeness in Social Media Posts using Blockchain Technology2022 9th International Conference on Computing for Sustainable Global Development (INDIACom)10.23919/INDIACom54597.2022.9763221(831-837)Online publication date: 23-Mar-2022
          • (2022)Combating Fake News: Stakeholder Interventions and Potential SolutionsIEEE Access10.1109/ACCESS.2022.319367010(78268-78289)Online publication date: 2022
          • (2022)I Hardly Lie: A Multistage Fake News Detection SystemBiologically Inspired Techniques in Many Criteria Decision Making10.1007/978-981-16-8739-6_23(253-261)Online publication date: 4-Jun-2022
          • (2022)HACK: A Hierarchical Model for Fake News DetectionWeb Information Systems Engineering – WISE 202110.1007/978-3-030-90888-1_43(565-572)Online publication date: 1-Jan-2022
          • (2021)The Surprising Performance of Simple Baselines for Misinformation DetectionProceedings of the Web Conference 202110.1145/3442381.3450111(3432-3441)Online publication date: 19-Apr-2021
          • Show More Cited By

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media