research-article

ACT : Automatic Fake News Classification Through Self-Attention

Author:
Nujud Aloshban

University of Glasgow, United Kingdom

University of Glasgow, United Kingdom
View Profile

WebSci '20: Proceedings of the 12th ACM Conference on Web ScienceJuly 2020Pages 115–124https://doi.org/10.1145/3394231.3397901

Published:06 July 2020Publication History

WebSci '20: Proceedings of the 12th ACM Conference on Web Science

Pages 115–124

ABSTRACT

Automatic detection of fake news is an important issue given the disproportionate effect of fake news on democratic processes, individuals and institutions. Research on automated fact-checking has proposed different approaches based on traditional machine learning methods, using hand-crafted lexical features. Nevertheless, these approaches focus on analyzing the text claim without considering the facts that are not explicitly given but can be derived from it. For example, external evidence that is retrieved from the Web as a knowledge source of the claim can provide complementary context of the claim and gives convincing reasons from it to support or oppose. Recent approaches study this deficit by incorporating supportive evidence (article) corresponding to the claim. However, these methods are either requiring substantial feature modeling, not considering several supporting evidences, or even not analyzing the language of the supporting evidence deeply.

To this end, we propose an end-to-end framework, named Automatic Fake News Classification Through Self-Attention (ACT), which exploits different supportive articles to a claim which mimics manual fact-checking processes. The model presents an approach that computes the claim credibility by aggregating over the prediction generated by every claim-retrieved article pair. The article input is represented by using self-attention on the top of a bidirectional LSTM neural network. By using the self-attention, the model concentrates on nuanced linguistic features and does not require any feature engineering, lexicons or any other manual intervention. Moreover, different aspects of the supporting article are extracted into multiple vector representations. Hence, different meaningful article representations can be extracted into a two-dimensional matrix to represent the article. In the end, a majority vote over the several external articles of a given claim is applied to assess the claim’s credibility. We conduct experiments on three different real-world datasets, compare them to the state-of-the-art approaches and analyze our results, which shows performance improvements.

Supplemental Material

3394231.3397901.mp4

mp4

27.2 MB

Download

References

Hunt Allcott, Matthew Gentzkow, and Chuan Yu. 2019. Trends in the diffusion of misinformation on social media. Research & Politics 6, 2 (2019), 2053168019848554.Google ScholarCross Ref
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473(2014).Google Scholar
Yoshua Bengio, Patrice Y. Simard, and Paolo Frasconi. 1994. Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Networks 5, 2 (1994), 157–166.Google ScholarDigital Library
Petter Bae Brandtzæg and Asbjørn Følstad. 2017. Trust and distrust in online fact-checking services. Commun. ACM 60, 9 (2017), 65–71.Google ScholarDigital Library
Tong Chen, Xue Li, Hongzhi Yin, and Jun Zhang. 2018. Call attention to rumors: Deep attention based recurrent neural networks for early rumor detection. In Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 40–52.Google ScholarCross Ref
Jianpeng Cheng, Li Dong, and Mirella Lapata. 2016. Long Short-Term Memory-Networks for Machine Reading. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP 2016, Austin, Texas, USA, November 1-4, 2016. 551–561.Google ScholarCross Ref
Jan K Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, Kyunghyun Cho, and Yoshua Bengio. 2015. Attention-based models for speech recognition. In Advances in neural information processing systems. 577–585.Google Scholar
Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel P. Kuksa. 2011. Natural Language Processing (almost) from Scratch. CoRR abs/1103.0398(2011). http://arxiv.org/abs/1103.0398Google Scholar
Cícero Nogueira dos Santos and Maira Gatti. 2014. Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts. In COLING 2014, 25th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers, August 23-29, 2014, Dublin, Ireland. 69–78.Google Scholar
Cícero Nogueira dos Santos, Ming Tan, Bing Xiang, and Bowen Zhou. 2016. Attentive Pooling Networks. abs/1602.03609 (2016). http://arxiv.org/abs/1602.03609Google Scholar
Felix A Gers, Nicol N Schraudolph, and Jürgen Schmidhuber. 2002. Learning precise timing with LSTM recurrent networks. Journal of machine learning research 3, Aug (2002), 115–143.Google Scholar
Naeemul Hassan, Bill Adair, James T Hamilton, Chengkai Li, Mark Tremayne, Jun Yang, and Cong Yu. 2015. The quest to automate fact-checking. world (2015).Google Scholar
Naeemul Hassan, Gensheng Zhang, Fatma Arslan, Josue Caraballo, Damian Jimenez, Siddhant Gawsane, Shohedul Hasan, Minumol Joseph, Aaditya Kulkarni, Anil Kumar Nayak, Vikas Sable, Chengkai Li, and Mark Tremayne. 2017. ClaimBuster: The First-ever End-to-end Fact-checking System. PVLDB 10, 12 (2017), 1945–1948.Google ScholarDigital Library
Stefan Helmstetter and Heiko Paulheim. 2018. Weakly Supervised Learning for Fake News Detection on Twitter. In IEEE/ACM 2018 International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2018, Barcelona, Spain, August 28-31, 2018. 274–277.Google Scholar
Karl Moritz Hermann, Tomas Kocisky, Edward Grefenstette, Lasse Espeholt, Will Kay, Mustafa Suleyman, and Phil Blunsom. 2015. Teaching machines to read and comprehend. In Advances in Neural Information Processing Systems. 1693–1701.Google Scholar
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735–1780.Google ScholarDigital Library
Georgi Karadzhov, Preslav Nakov, Lluís Màrquez, Alberto Barrón-Cedeño, and Ivan Koychev. 2017. Fully Automated Fact Checking Using External Sources. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, Varna, Bulgaria, September 2 - 8, 2017. 344–353.Google ScholarCross Ref
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980(2014).Google Scholar
Lars Konieczny. 2016. Locality and parsing complexity. Journal of psycholinguistic research 29 (2016), 627–645. Issue 2.Google ScholarCross Ref
Peng Li, Wei Li, Zhengyan He, Xuguang Wang, Ying Cao, Jie Zhou, and Wei Xu. 2016. Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering. CoRR abs/1607.06275(2016). http://arxiv.org/abs/1607.06275 Withdrawn.Google Scholar
Zhouhan Lin, Minwei Feng, Cícero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, and Yoshua Bengio. 2017. A Structured Self-attentive Sentence Embedding. CoRR abs/1703.03130(2017). http://arxiv.org/abs/1703.03130Google Scholar
Charles X Ling, Jin Huang, Harry Zhang, 2003. AUC: a statistically consistent and more discriminating measure than accuracy. In IJCAI, Vol. 3. 519–524.Google Scholar
Jing Ma, Wei Gao, and Kam-Fai Wong. 2017. Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers. 708–717.Google ScholarCross Ref
Delia Mocanu, Luca Rossi, Qian Zhang, Marton Karsai, and Walter Quattrociocchi. 2015. Collective attention in the age of (mis) information. Computers in Human Behavior 51 (2015), 1198–1204.Google ScholarDigital Library
Kashyap Popat. 2017. Assessing the Credibility of Claims on the Web. In Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia, April 3-7, 2017. 735–739.Google ScholarDigital Library
Kashyap Popat, Subhabrata Mukherjee, Andrew Yates, and Gerhard Weikum. 2018. DeClarE: Debunking Fake News and False Claims using Evidence-Aware Deep Learning. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018. 22–32.Google ScholarCross Ref
Martin Potthast, Johannes Kiesel, Kevin Reinartz, Janek Bevendorff, and Benno Stein. 2017. A Stylometric Inquiry into Hyperpartisan and Fake News. abs/1702.05638 (2017). http://arxiv.org/abs/1702.05638Google Scholar
Hannah Rashkin, Eunsol Choi, Jin Yea Jang, Svitlana Volkova, and Yejin Choi. 2017. Truth of varying shades: Analyzing language in fake news and political fact-checking. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2931–2937.Google ScholarCross Ref
Natali Ruchansky, Sungyong Seo, and Yan Liu. 2017. CSI: A Hybrid Deep Model for Fake News Detection. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM 2017, Singapore, November 06 - 10, 2017. 797–806.Google ScholarDigital Library
Michela Del Vicario, Alessandro Bessi, Fabiana Zollo, Fabio Petroni, Antonio Scala, Guido Caldarelli, H. Eugene Stanley, and Walter Quattrociocchi. 2016. The spreading of misinformation online. PNAS 113, 3 (2016), 554–559.Google ScholarCross Ref
William Yang Wang. 2017. ”Liar, Liar Pants on Fire”: A New Benchmark Dataset for Fake News Detection. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 2: Short Papers. 422–426.Google ScholarCross Ref
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. In International conference on machine learning. 2048–2057.Google ScholarDigital Library
Jiawei Zhang, Limeng Cui, Yanjie Fu, and Fisher B Gouza. 2018. Fake news detection with deep diffusive network model. arXiv preprint arXiv:1805.08751(2018).Google Scholar

Recommendations

Attention-based C-BiLSTM for fake news detection
Abstract
Social media platforms have radically transformed the creation and dissemination of news. Users can easily access this news in a fast and efficient manner. However, some users might post negative and fraudulent content in the form of ...
Highlights
- Tackles fake news detection problem in a multi-class environment.
- Employed pre-...
Read More
The Fake News Bible: A guide to fake news. How do they start? Who originates fake news? How do they become viral? Answers to questions on a new phenomenon that is changing our lives.
Read More
Learning Textual Representations from Multiple Modalities to Detect Fake News Through One-Class Learning
WebMedia '21: Proceedings of the Brazilian Symposium on Multimedia and the Web

Fake news can rapidly spread through internet users. Approaches proposed in the literature for content classification usually learn models considering textual and contextual features from real and fake news to minimize the spread of disinformation. One ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

WebSci '20: Proceedings of the 12th ACM Conference on Web Science
July 2020
361 pages
ISBN:9781450379892
DOI:10.1145/3394231

Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 July 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
deep learning
fake news classification
recurrent neural networks
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate218of875submissions,25%
Upcoming Conference
Websci '24

Sponsor:

sigweb

16th ACM Web Science Conference

May 21 - 24, 2024

Stuttgart , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 293
  Total Downloads
- Downloads (Last 12 months)32
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

ACT : Automatic Fake News Classification Through Self-Attention

WebSci '20: Proceedings of the 12th ACM Conference on Web Science

ABSTRACT

Supplemental Material

References

Cited By

Recommendations

Attention-based C-BiLSTM for fake news detection

The Fake News Bible: A guide to fake news. How do they start? Who originates fake news? How do they become viral? Answers to questions on a new phenomenon that is changing our lives.

Learning Textual Representations from Multiple Modalities to Detect Fake News Through One-Class Learning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

ACT : Automatic Fake News Classification Through Self-Attention

WebSci '20: Proceedings of the 12th ACM Conference on Web Science

ABSTRACT

Supplemental Material

References

Cited By

Recommendations

Attention-based C-BiLSTM for fake news detection

The Fake News Bible: A guide to fake news. How do they start? Who originates fake news? How do they become viral? Answers to questions on a new phenomenon that is changing our lives.

Learning Textual Representations from Multiple Modalities to Detect Fake News Through One-Class Learning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media