Fake News Detection Utilizing Textual Cues

Chouliara, Vasiliki; Koukaras, Paraskevas; Tjortjis, Christos

doi:10.1007/978-3-031-34111-3_33

Vasiliki Chouliara¹⁹,
Paraskevas Koukaras¹⁹ &
Christos Tjortjis¹⁹

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 675))

Included in the following conference series:

IFIP International Conference on Artificial Intelligence Applications and Innovations

819 Accesses

Abstract

Easy and quick information diffusion on the web and especially in social media has been rapidly proliferating during the past decades. As information is posted without any kind of verification of its veracity, fake news has become a problem of great influence in our information driven society. Thus, to mitigate the consequences of fake news and its propagation, automated approaches to detect malicious content were created. This paper proposes an effective framework that utilizes only the text features of the news. We evaluate several features for differentiating fake from real news and we identify the best performing feature set that maximizes performance, using feature selection techniques. Text representation features were also explored as a potential solution. Additionally, the most popular Machine Learning and Deep Learning models were tested to conclude to the model that achieves the maximum accuracy. Our findings reveal that a combination of linguistic features and text-based word vector representations through ensemble methods can predict fake news with high accuracy. eXtreme Gradient Boosting (XGB) outperformed all other models, while linear Support Vector Machine (SVM) achieved comparable results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Hardcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://nlp.stanford.edu/projects/glove/

References

Gravanis, G., Vakali, A., Diamantaras, K., Karadais, P.: Behind the cues: a benchmarking study for fake news detection. Expert Syst Appl. 128, 201–213 (2019). https://doi.org/10.1016/j.eswa.2019.03.036
Article Google Scholar
Verma, P.K., Agrawal, P., Amorim, I., Prodan, R.: WELFake: word embedding over linguistic features for fake news detection. IEEE Trans Comput Soc Syst. 8, 881–893 (2021). https://doi.org/10.1109/TCSS.2021.3068519
Article Google Scholar
Kasseropoulos, D.P., Tjortjis, C.: An approach utilizing linguistic features for fake news detection. In: Maglogiannis, I., Macintyre, J., Iliadis, L. (eds.) AIAI 2021. IAICT, vol. 627, pp. 646–658. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-79150-6_51
Chapter Google Scholar
Kasseropoulos, D.P., Koukaras, P., Tjortjis, C.: Exploiting textual information for fake news detection. Int J Neural Syst. 32, 2250058 (2022). https://doi.org/10.1142/S0129065722500587
Article Google Scholar
Chouliara, V., Kapoteli, E., Koukaras, P., Tjortjis, C.: Social media sentiment analysis related to COVID-19 vaccinations. In: Peng, L.C., Vaidya, A., Chen, Y.-W., Jain, V., Jain, L.C. (eds.) Artificial Intelligence and Machine Learning for Healthcare: Vol. 2: Emerging Methodologies and Trends, pp. 47–69. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-11170-9_3
Zhang, X., Ghorbani, A.A.: An overview of online fake news: Characterization, detection, and discussion. Inf. Process Manag. 57, 102025 (2020). https://doi.org/10.1016/j.ipm.2019.03.004
Article Google Scholar
Zhou, X., Zafarani, R.: A survey of fake news: fundamental theories, detection methods, and opportunities. ACM Comput Surv. 53, 1–40 (2020). https://doi.org/10.1145/3395046
Article Google Scholar
Rashkin, H., Choi, E., Jang, J.Y., Volkova, S., Choi, Y.: Truth of varying shades: analyzing language in fake news and political fact-checking. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 2931–2937. Association for Computational Linguistics, Copenhagen (2017)
Google Scholar
Hobbs, R.: Propaganda in an age of algorithmic personalization: expanding literacy research and practice. Read Res Q. 55, 521–533 (2020). https://doi.org/10.1002/rrq.301
Article Google Scholar
Zannettou, S., Sirivianos, M., Blackburn, J., Kourtellis, N.: The web of false information: rumors, fake news, hoaxes, clickbait, and various other shenanigans. J. Data Inf. Qual. 11, 1–37 (2019). https://doi.org/10.1145/3309699
Article Google Scholar
Swami, V.: Social psychological origins of conspiracy theories: the case of the Jewish conspiracy theory in Malaysia. Front Psychol. 3 (2012). https://doi.org/10.3389/fpsyg.2012.00280
Alkhodair, S.A., Ding, S.H.H., Fung, B.C.M., Liu, J.: Detecting breaking news rumors of emerging topics in social media. Inf. Process Manag. 57 (2020). https://doi.org/10.1016/j.ipm.2019.02.016
Horne, B., Adali, S.: This just in: fake news packs a lot in title, uses simpler, repetitive content in text body, more similar to satire than real news. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 11 (2017). https://doi.org/10.1609/icwsm.v11i1.14976
Shu, K., Sliva, A., Wang, S., Tang, J., Liu, H.: Fake news detection on social media: a data mining perspective. SIGKDD Explor. Newsl. 19, 22–36 (2017). https://doi.org/10.1145/3137597.3137600
Article Google Scholar
de Beer, D., Matthee, M.: Approaches to identify fake news: a systematic literature review. In: Antipova, T. (ed.) ICIS 2020. LNNS, vol. 136, pp. 13–22. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-49264-9_2
Chapter Google Scholar
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on twitter. In: Proceedings of the 20th International Conference on World Wide Web, pp. 675–684. Association for Computing Machinery, New York (2011)
Google Scholar
Jin, Z., Cao, J., Zhang, Y., Luo, J.: News verification by exploiting conflicting social viewpoints in microblogs. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 30 (2016). https://doi.org/10.1609/aaai.v30i1.10382
Raza, S., Ding, C.: Fake news detection based on news content and social contexts: a transformer-based approach. Int. J. Data Sci. Anal. 13, 335–362 (2022). https://doi.org/10.1007/s41060-021-00302-z
Article Google Scholar
Horne, B.D., NØrregaard, J., Adali, S.: Robust fake news detection over time and attack. ACM Trans. Intell. Syst. Technol. 11 (2019). https://doi.org/10.1145/3363818
Ahmed, H., Traore, I., Saad, S.: Detecting opinion spams and fake news using text classification. Secur. Priv. 1, e9 (2018). https://doi.org/10.1002/spy2.9
Article Google Scholar
Ahmed, H., Traore, I., Saad, S.: Detection of online fake news using N-gram analysis and machine learning techniques. In: Traore, I., Woungang, I., Awad, A. (eds.) ISDDC 2017. LNCS, vol. 10618, pp. 127–138. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69155-8_9
Chapter Google Scholar
Zhou, L., Burgoon, J.K., Nunamaker, J.F., Twitchell, D.: Automating linguistics-based cues for detecting deception in text-based asynchronous computer-mediated communications. Group Decis. Negot. 13(1), 81–106 (2004). https://doi.org/10.1023/B:GRUP.0000011944.62889.6f
Article Google Scholar
Burgoon, J.K., Blair, J.P., Qin, T., Nunamaker, J.F.: Detecting deception through linguistic analysis. In: Chen, H., Miranda, R., Zeng, D.D., Demchak, C., Schroeder, J., Madhusudan, T. (eds.) ISI 2003. LNCS, vol. 2665, pp. 91–101. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-44853-5_7
Chapter Google Scholar
Kursa, M.B., Rudnicki, W.R.: Feature selection with the boruta package. J. Stat Softw. 36, 1–13 (2010). https://doi.org/10.18637/jss.v036.i11
Zervopoulos, A., Alvanou, A.G., Bezas, K., Papamichail, A., Maragoudakis, M., Kermanidis, K.: Deep learning for fake news detection on Twitter regarding the 2019 Hong Kong protests. Neural Comput. Appl. 34(2), 969–982 (2021). https://doi.org/10.1007/s00521-021-06230-0
Article Google Scholar

Download references

Acknowledgements

This research is co-financed by Greece and the European Union (European Social Fund-SF) through the Operational Program “Human Resources Development, Education and Lifelong Learning 2014–2020” in the context of the project “Support for International Actions of the International Hellenic University”, (MIS 5154651).

Author information

Authors and Affiliations

The Data Mining and Analytics Research Group, School of Science and Technology, International Hellenic University, Thessaloniki, Greece
Vasiliki Chouliara, Paraskevas Koukaras & Christos Tjortjis

Authors

Vasiliki Chouliara
View author publications
You can also search for this author in PubMed Google Scholar
Paraskevas Koukaras
View author publications
You can also search for this author in PubMed Google Scholar
Christos Tjortjis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christos Tjortjis .

Editor information

Editors and Affiliations

University of Piraeus, Piraeus, Greece
Ilias Maglogiannis
Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
University of Sunderland, Sunderland, UK
John MacIntyre
University of Leon, León, Spain
Manuel Dominguez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chouliara, V., Koukaras, P., Tjortjis, C. (2023). Fake News Detection Utilizing Textual Cues. In: Maglogiannis, I., Iliadis, L., MacIntyre, J., Dominguez, M. (eds) Artificial Intelligence Applications and Innovations. AIAI 2023. IFIP Advances in Information and Communication Technology, vol 675. Springer, Cham. https://doi.org/10.1007/978-3-031-34111-3_33

Download citation

DOI: https://doi.org/10.1007/978-3-031-34111-3_33
Published: 01 June 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34110-6
Online ISBN: 978-3-031-34111-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)