What Are the Latest Fake News in Romanian Politics? An Automated Analysis Based on BERT Language Models

Busioc, Costin; Dumitru, Vlad; Ruseti, Stefan; Terian-Dan, Simina; Dascalu, Mihai; Rebedea, Traian

doi:10.1007/978-981-16-3930-2_16

What Are the Latest Fake News in Romanian Politics? An Automated Analysis Based on BERT Language Models

Costin Busioc⁶,
Vlad Dumitru⁶,
Stefan Ruseti⁶,
Simina Terian-Dan⁷,
Mihai Dascalu^6,8 &
…
Traian Rebedea⁶

Conference paper
First Online: 31 August 2021

334 Accesses
4 Citations

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 249))

Abstract

Social media and news outlets facilitate information sharing, while the Web is flooded by information posted online on a daily basis. However, content may be differently transmitted from case to case, based on the authors’ intentions and vocabulary, to the extent that it generates completely opposite points of view. As such, fake news have become a global phenomenon, and recent events highlight a high impact of distorted or fake information, especially on the political side, when candidates’ discourses include tendentious statements that require careful validation before completely trusting the source. This paper proposes an automated analysis of political statements in Romanian by applying different state-of-the-art Natural Language Processing techniques, and evaluating the importance of context in determining their veracity. Our corpus consists of entries from Factual, a recent Romanian fact-checking initiative that assembled a list of public statements, alongside relevant contextual information for their interpretation. Our results are comparable to similar experiments performed on the PolitiFact dataset, and represent a strong baseline for experiments in low-resource languages, like Romanian.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://www.politifact.com/.
2.
https://www.factual.ro/.
3.
https://pypi.org/project/beautifulsoup4/ Retrieved March 15, 2021.

References

Allcott, H., Gentzkow, M.: Social media and fake news in the 2016 election. J. Econ. Perspect. 31(2), 211–36 (2017)
Article Google Scholar
Bovet, A., Makse, H.A.: Influence of fake news in twitter during the 2016 us presidential election. Nat. Commun. 10(1), 1–14 (2019)
Article Google Scholar
Deligiannis, N., Huu, T., Nguyen, D.M., Luo, X.: Deep learning for geolocating social media users and detecting fake news. In: NATO Workshop (2018)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, pp. 4171–4186 (2019)
Google Scholar
Dizikes, P.: Study: On twitter, false news travels faster than true stories (2018). https://news.mit.edu/2018/study-twitter-false-news-travels-faster-true-stories-0308
Hanselowski, A., Zhang, H., Li, Z., Sorokin, D., Schiller, B., Schulz, C., Gurevych, I.: Ukp-athene: Multi-sentence Textual Entailment for Claim Verification. arXiv:1809.01479 (2018)
Kaliyar, R.K., Goswami, A., Narang, P.: Fakebert: fake news detection in social media with a bert-based deep learning approach. Multimedia Tools Appl. 80, 11765–11788 (2021)
Article Google Scholar
Karimi, H., Roy, P., Saba-Sadiya, S., Tang, J.: Multi-source multi-class fake news detection. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 1546–1557. Association for Computational Linguistics, Santa Fe, New Mexico, USA (Aug 2018). https://www.aclweb.org/anthology/C18-1131
Kirilin, A., Strube, M.: Exploiting a speakers credibility to detect fake news. In: Proceedings of Data Science, Journalism and Media Workshop at KDD (DSJM18) (2018)
Google Scholar
Lazer, D.M., Baum, M.A., Benkler, Y., Berinsky, A.J., Greenhill, K.M., Menczer, F., Metzger, M.J., Nyhan, B., Pennycook, G., Rothschild, D., et al.: The science of fake news. Science 359(6380), 1094–1096 (2018)
Article Google Scholar
Masala, M., Ruseti, S., Dascalu, M.: RoBERT—A Romanian BERT Model. In: Proceedings of the 28th International Conference on Computational Linguistics, pp. 6626–6637 (2020)
Google Scholar
Oshikawa, R., Qian, J., Wang, W.Y.: A survey on natural language processing for fake news detection. arXiv:1811.00770 (2018)
Saikh, T., De, A., Ekbal, A., Bhattacharyya, P.: A deep learning approach for automatic detection of fake news. arXiv:2005.04938 (2020)
Shu, K., Mahudeswaran, D., Wang, S., Lee, D., Liu, H.: Fakenewsnet: a data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big Data 8(3), 171–188 (2020)
Article Google Scholar
Singhal, S., Shah, R.R., Chakraborty, T., Kumaraguru, P., Satoh, S.: Spotfake: A multi-modal framework for fake news detection. In: 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM), pp. 39–47. IEEE (2019)
Google Scholar
Thorne, J., Vlachos, A., Christodoulopoulos, C., Mittal, A.: Fever: a large-scale dataset for fact extraction and verification. arXiv:1803.05355 (2018)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017). https://doi.org/10.1017/S0140525X16001837, http://papers.nips.cc/paper/7181-attention-is-all-you-need, http://arxiv.org/abs/1706.03762
Vlachos, A., Riedel, S.: Fact checking: task definition and dataset construction. In: Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science, pp. 18–22 (2014)
Google Scholar
Vo, N., Lee, K.: The rise of guardians: fact-checking URL recommendation to combat fake news. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 275–284 (2018)
Google Scholar
Vosoughi, S., Roy, D., Aral, S.: The spread of true and false news online. Science 359(6380), 1146–1151 (2018)
Article Google Scholar
Wang, W.Y.: Liar, liar pants on fire: a new benchmark dataset for fake news detection. arXiv:1705.00648 (2017)
Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., Cistac, P., Rault, T., Louf, R., Funtowicz, M., Davison, J., Shleifer, S., von Platen, P., Ma, C., Jernite, Y., Plu, J., Xu, C., Scao, T.L., Gugger, S., Drame, M., Lhoest, Q., Rush, A.M.: Transformers: state-of-the-art natural language processing. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp. 38–45. Association for Computational Linguistics, Online (Oct 2020). https://www.aclweb.org/anthology/2020.emnlp-demos.6

Download references

Acknowledgements

This work was supported by a grant of the Romanian Ministry of Education and Research, CNCS—UEFISCDI, project number PN-III-P1-1.1-TE-2019-1794, within PNCDI III. We would like to thank Ana Poenariu, the coordinator of the Factual project, for sharing the data and for her ongoing efforts to fight fake news in politics.

Author information

Authors and Affiliations

Department of Computer Science, University Politehnica of Bucharest, 313 Splaiul Independentei, 060042, Bucharest, Romania
Costin Busioc, Vlad Dumitru, Stefan Ruseti, Mihai Dascalu & Traian Rebedea
Department of Romance Studies, Lucian Blaga University of Sibiu, 10 Bld Victoriei, 550024, Sibiu, Romania
Simina Terian-Dan
Academy of Romanian Scientists, 3 Str. Ilfov, 050044, Bucharest, Romania
Mihai Dascalu

Authors

Costin Busioc
View author publications
You can also search for this author in PubMed Google Scholar
Vlad Dumitru
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Ruseti
View author publications
You can also search for this author in PubMed Google Scholar
Simina Terian-Dan
View author publications
You can also search for this author in PubMed Google Scholar
Mihai Dascalu
View author publications
You can also search for this author in PubMed Google Scholar
Traian Rebedea
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mihai Dascalu .

Editor information

Editors and Affiliations

Department of Communication and Art/DigiMedia, University of Aveiro, Aveiro, Portugal
Óscar Mealha
Department of Computer Science, University Politehnica of Bucharest, Bucharest, Romania
Mihai Dascalu
Department of Engineering Computer Science and Mathematics, University of L’Aquila, L’Aquila, Italy
Tania Di Mascio

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Busioc, C., Dumitru, V., Ruseti, S., Terian-Dan, S., Dascalu, M., Rebedea, T. (2022). What Are the Latest Fake News in Romanian Politics? An Automated Analysis Based on BERT Language Models. In: Mealha, Ó., Dascalu, M., Di Mascio, T. (eds) Ludic, Co-design and Tools Supporting Smart Learning Ecosystems and Smart Education. Smart Innovation, Systems and Technologies, vol 249. Springer, Singapore. https://doi.org/10.1007/978-981-16-3930-2_16

Download citation

DOI: https://doi.org/10.1007/978-981-16-3930-2_16
Published: 31 August 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3929-6
Online ISBN: 978-981-16-3930-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics