Abstract
Social media and news outlets facilitate information sharing, while the Web is flooded by information posted online on a daily basis. However, content may be differently transmitted from case to case, based on the authors’ intentions and vocabulary, to the extent that it generates completely opposite points of view. As such, fake news have become a global phenomenon, and recent events highlight a high impact of distorted or fake information, especially on the political side, when candidates’ discourses include tendentious statements that require careful validation before completely trusting the source. This paper proposes an automated analysis of political statements in Romanian by applying different state-of-the-art Natural Language Processing techniques, and evaluating the importance of context in determining their veracity. Our corpus consists of entries from Factual, a recent Romanian fact-checking initiative that assembled a list of public statements, alongside relevant contextual information for their interpretation. Our results are comparable to similar experiments performed on the PolitiFact dataset, and represent a strong baseline for experiments in low-resource languages, like Romanian.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
https://pypi.org/project/beautifulsoup4/ Retrieved March 15, 2021.
References
Allcott, H., Gentzkow, M.: Social media and fake news in the 2016 election. J. Econ. Perspect. 31(2), 211–36 (2017)
Bovet, A., Makse, H.A.: Influence of fake news in twitter during the 2016 us presidential election. Nat. Commun. 10(1), 1–14 (2019)
Deligiannis, N., Huu, T., Nguyen, D.M., Luo, X.: Deep learning for geolocating social media users and detecting fake news. In: NATO Workshop (2018)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, pp. 4171–4186 (2019)
Dizikes, P.: Study: On twitter, false news travels faster than true stories (2018). https://news.mit.edu/2018/study-twitter-false-news-travels-faster-true-stories-0308
Hanselowski, A., Zhang, H., Li, Z., Sorokin, D., Schiller, B., Schulz, C., Gurevych, I.: Ukp-athene: Multi-sentence Textual Entailment for Claim Verification. arXiv:1809.01479 (2018)
Kaliyar, R.K., Goswami, A., Narang, P.: Fakebert: fake news detection in social media with a bert-based deep learning approach. Multimedia Tools Appl. 80, 11765–11788 (2021)
Karimi, H., Roy, P., Saba-Sadiya, S., Tang, J.: Multi-source multi-class fake news detection. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 1546–1557. Association for Computational Linguistics, Santa Fe, New Mexico, USA (Aug 2018). https://www.aclweb.org/anthology/C18-1131
Kirilin, A., Strube, M.: Exploiting a speakers credibility to detect fake news. In: Proceedings of Data Science, Journalism and Media Workshop at KDD (DSJM18) (2018)
Lazer, D.M., Baum, M.A., Benkler, Y., Berinsky, A.J., Greenhill, K.M., Menczer, F., Metzger, M.J., Nyhan, B., Pennycook, G., Rothschild, D., et al.: The science of fake news. Science 359(6380), 1094–1096 (2018)
Masala, M., Ruseti, S., Dascalu, M.: RoBERT—A Romanian BERT Model. In: Proceedings of the 28th International Conference on Computational Linguistics, pp. 6626–6637 (2020)
Oshikawa, R., Qian, J., Wang, W.Y.: A survey on natural language processing for fake news detection. arXiv:1811.00770 (2018)
Saikh, T., De, A., Ekbal, A., Bhattacharyya, P.: A deep learning approach for automatic detection of fake news. arXiv:2005.04938 (2020)
Shu, K., Mahudeswaran, D., Wang, S., Lee, D., Liu, H.: Fakenewsnet: a data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big Data 8(3), 171–188 (2020)
Singhal, S., Shah, R.R., Chakraborty, T., Kumaraguru, P., Satoh, S.: Spotfake: A multi-modal framework for fake news detection. In: 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM), pp. 39–47. IEEE (2019)
Thorne, J., Vlachos, A., Christodoulopoulos, C., Mittal, A.: Fever: a large-scale dataset for fact extraction and verification. arXiv:1803.05355 (2018)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017). https://doi.org/10.1017/S0140525X16001837, http://papers.nips.cc/paper/7181-attention-is-all-you-need, http://arxiv.org/abs/1706.03762
Vlachos, A., Riedel, S.: Fact checking: task definition and dataset construction. In: Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science, pp. 18–22 (2014)
Vo, N., Lee, K.: The rise of guardians: fact-checking URL recommendation to combat fake news. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 275–284 (2018)
Vosoughi, S., Roy, D., Aral, S.: The spread of true and false news online. Science 359(6380), 1146–1151 (2018)
Wang, W.Y.: Liar, liar pants on fire: a new benchmark dataset for fake news detection. arXiv:1705.00648 (2017)
Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., Cistac, P., Rault, T., Louf, R., Funtowicz, M., Davison, J., Shleifer, S., von Platen, P., Ma, C., Jernite, Y., Plu, J., Xu, C., Scao, T.L., Gugger, S., Drame, M., Lhoest, Q., Rush, A.M.: Transformers: state-of-the-art natural language processing. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp. 38–45. Association for Computational Linguistics, Online (Oct 2020). https://www.aclweb.org/anthology/2020.emnlp-demos.6
Acknowledgements
This work was supported by a grant of the Romanian Ministry of Education and Research, CNCS—UEFISCDI, project number PN-III-P1-1.1-TE-2019-1794, within PNCDI III. We would like to thank Ana Poenariu, the coordinator of the Factual project, for sharing the data and for her ongoing efforts to fight fake news in politics.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Busioc, C., Dumitru, V., Ruseti, S., Terian-Dan, S., Dascalu, M., Rebedea, T. (2022). What Are the Latest Fake News in Romanian Politics? An Automated Analysis Based on BERT Language Models. In: Mealha, Ó., Dascalu, M., Di Mascio, T. (eds) Ludic, Co-design and Tools Supporting Smart Learning Ecosystems and Smart Education. Smart Innovation, Systems and Technologies, vol 249. Springer, Singapore. https://doi.org/10.1007/978-981-16-3930-2_16
Download citation
DOI: https://doi.org/10.1007/978-981-16-3930-2_16
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3929-6
Online ISBN: 978-981-16-3930-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)