ABSTRACT
Today, the increasing ease of publishing information online combined with a gradual shift of paradigm from consuming news via conventional media to non-conventional media calls for a computational and automatic approach to the identification of an article's legitimacy. In this study, we propose an approach for cross-domain fake news detection focusing on the identification of legitimate content from a pool of articles that are of varying degrees of legitimacy. We present a model as a proof of concept as well as data gathered from evaluating the model on Fake-News AMT, a dataset released for cross-domain fake news detection. The results of our model are then compared against a baseline model which has served as the benchmark for the dataset. We find all results in support of our hypothesis. Our proof-of-concept model has also outperformed the benchmark in the domains Technology and Entertainment as well as when it was run on the whole dataset at once.
- A Geiger. 2019. Key findings about the online news landscape in America. Pew Research Center. Retrieved from Pew Research Center: https://www.pewresearch.org/fact-tank/2019/09/11/key-findings-about-the-online-news-landscape-in-america/Google Scholar
- Shadi Shahsavari, Pavan Holur, Timothy R. Tangherlini, Vwani Roychowdhury. 2020. Conspiracy In The Time Of Corona: Automatic Detection Of Covid-19 Conspiracy Theories In Social Media And The News. arXiv:2004.13783 Retrieved from https://arxiv.org/abs/2004.13783Google Scholar
- Martin Potthast, Johannes Kiesel, Kevin Reinartz, Janek Bevendorff, Benno Stein. 2017. A Stylometric Inquiry into Hyperpartisan and Fake News. arXiv:1702.05638. Retrieved from https://arxiv.org/abs/1702.05638Google Scholar
- Victoria L. Rubin, Yimin Chen, Nadia K. Conroy. 2016. Deception detection for news: Three types of fakes. Proc. Assoc. Info. Sci. Tech., 52: 1--4. DOI:10.1002/pra2.2015.145052010083Google ScholarCross Ref
- Jing Ma, Wei Gao, Prasenjit Mitra, Sejeong Kwon, Bernard J. Jansen, Kam-Fai Wong, and Meeyoung Cha. 2016. Detecting rumors from microblogs with recurrent neural networks. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI'16). AAAI Press, 3818--3824.Google ScholarDigital Library
- Pérez-Rosas, Verónica. Kleinberg, Bennett., Lefevre, Alexandra., Mihalcea, Rada. 2017. Automatic Detection of Fake News. arXiv:1708.07104. Retrieved from https://arxiv.org/abs/1708.07104v1Google Scholar
- Saikh, Tanik., De, Arkadipta., Asif, Ekbal., Bhattacharyya, Pushpak. 2020. A Deep Learning Approach for Automatic Detection of Fake News. arXiv:2005.04938v1. Retrieved from https://arxiv.org/abs/2005.04938v1Google Scholar
- Andreas Hanselowski, Avinesh PVS, Benjamin Schiller, Felix Caspelherr, Debanjan Chaudhuri, Christian Meyer, Iryna Gurevych. 2018. A Retrospective Analysis of the Fake News Challenge Stance Detection Task. arXiv:1806.05180. Retrieved from https://arxiv.org/abs/1806.05180Google Scholar
- Lucas Ou-Yang. 2020. Newspaper3k: Article scraping & curation. Retrieved from https://newspaper.readthedocs.ioGoogle Scholar
- Matthew Honnibal and Mark Johnson. 2015. An Improved Non-monotonic Transition System for Dependency Parsing. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (2015). DOI:10.18653/v1/d15-1162Google ScholarCross Ref
Index Terms
- Automatic Differentiation Between Legitimate and Fake News Using Named Entity Recognition
Recommendations
A deep learning-based bilingual Hindi and Punjabi named entity recognition system using enhanced word embeddings
AbstractThe increasing availability of information on the web makes the task of named entity recognition (NER) more challenging. Named entity recognition is an important pre-processor tool that is concerned with the extraction of entities of ...
Highlights- Development of enhanced word embeddings for bilingual NER system is a novel attempt.
Multidimensional Analysis of Fake News Spreaders on Twitter
Computational Data and Social NetworksAbstractSocial media has become a tool to spread false information with the help of its large complex network. The consequences of such misinformation could be very severe. The paper uses the Twitter conversations about the scrapping of Article 370 in ...
Named Entity Recognition Using Gazetteer of Hierarchical Entities
Advances and Trends in Artificial Intelligence. From Theory to PracticeAbstractThis paper presents a named entity recognition method which finds predetermined entities in an unstructured text. The method uses word similarities based on typical word transformations (lemmatization and stemming), word embeddings and character ...
Comments