Abstract
Easy and quick information diffusion on the web and especially in social media has been rapidly proliferating during the past decades. As information is posted without any kind of verification of its veracity, fake news has become a problem of great influence in our information driven society. Thus, to mitigate the consequences of fake news and its propagation, automated approaches to detect malicious content were created. This paper proposes an effective framework that utilizes only the text features of the news. We evaluate several features for differentiating fake from real news and we identify the best performing feature set that maximizes performance, using feature selection techniques. Text representation features were also explored as a potential solution. Additionally, the most popular Machine Learning and Deep Learning models were tested to conclude to the model that achieves the maximum accuracy. Our findings reveal that a combination of linguistic features and text-based word vector representations through ensemble methods can predict fake news with high accuracy. eXtreme Gradient Boosting (XGB) outperformed all other models, while linear Support Vector Machine (SVM) achieved comparable results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Gravanis, G., Vakali, A., Diamantaras, K., Karadais, P.: Behind the cues: a benchmarking study for fake news detection. Expert Syst Appl. 128, 201–213 (2019). https://doi.org/10.1016/j.eswa.2019.03.036
Verma, P.K., Agrawal, P., Amorim, I., Prodan, R.: WELFake: word embedding over linguistic features for fake news detection. IEEE Trans Comput Soc Syst. 8, 881–893 (2021). https://doi.org/10.1109/TCSS.2021.3068519
Kasseropoulos, D.P., Tjortjis, C.: An approach utilizing linguistic features for fake news detection. In: Maglogiannis, I., Macintyre, J., Iliadis, L. (eds.) AIAI 2021. IAICT, vol. 627, pp. 646–658. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-79150-6_51
Kasseropoulos, D.P., Koukaras, P., Tjortjis, C.: Exploiting textual information for fake news detection. Int J Neural Syst. 32, 2250058 (2022). https://doi.org/10.1142/S0129065722500587
Chouliara, V., Kapoteli, E., Koukaras, P., Tjortjis, C.: Social media sentiment analysis related to COVID-19 vaccinations. In: Peng, L.C., Vaidya, A., Chen, Y.-W., Jain, V., Jain, L.C. (eds.) Artificial Intelligence and Machine Learning for Healthcare: Vol. 2: Emerging Methodologies and Trends, pp. 47–69. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-11170-9_3
Zhang, X., Ghorbani, A.A.: An overview of online fake news: Characterization, detection, and discussion. Inf. Process Manag. 57, 102025 (2020). https://doi.org/10.1016/j.ipm.2019.03.004
Zhou, X., Zafarani, R.: A survey of fake news: fundamental theories, detection methods, and opportunities. ACM Comput Surv. 53, 1–40 (2020). https://doi.org/10.1145/3395046
Rashkin, H., Choi, E., Jang, J.Y., Volkova, S., Choi, Y.: Truth of varying shades: analyzing language in fake news and political fact-checking. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 2931–2937. Association for Computational Linguistics, Copenhagen (2017)
Hobbs, R.: Propaganda in an age of algorithmic personalization: expanding literacy research and practice. Read Res Q. 55, 521–533 (2020). https://doi.org/10.1002/rrq.301
Zannettou, S., Sirivianos, M., Blackburn, J., Kourtellis, N.: The web of false information: rumors, fake news, hoaxes, clickbait, and various other shenanigans. J. Data Inf. Qual. 11, 1–37 (2019). https://doi.org/10.1145/3309699
Swami, V.: Social psychological origins of conspiracy theories: the case of the Jewish conspiracy theory in Malaysia. Front Psychol. 3 (2012). https://doi.org/10.3389/fpsyg.2012.00280
Alkhodair, S.A., Ding, S.H.H., Fung, B.C.M., Liu, J.: Detecting breaking news rumors of emerging topics in social media. Inf. Process Manag. 57 (2020). https://doi.org/10.1016/j.ipm.2019.02.016
Horne, B., Adali, S.: This just in: fake news packs a lot in title, uses simpler, repetitive content in text body, more similar to satire than real news. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 11 (2017). https://doi.org/10.1609/icwsm.v11i1.14976
Shu, K., Sliva, A., Wang, S., Tang, J., Liu, H.: Fake news detection on social media: a data mining perspective. SIGKDD Explor. Newsl. 19, 22–36 (2017). https://doi.org/10.1145/3137597.3137600
de Beer, D., Matthee, M.: Approaches to identify fake news: a systematic literature review. In: Antipova, T. (ed.) ICIS 2020. LNNS, vol. 136, pp. 13–22. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-49264-9_2
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on twitter. In: Proceedings of the 20th International Conference on World Wide Web, pp. 675–684. Association for Computing Machinery, New York (2011)
Jin, Z., Cao, J., Zhang, Y., Luo, J.: News verification by exploiting conflicting social viewpoints in microblogs. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 30 (2016). https://doi.org/10.1609/aaai.v30i1.10382
Raza, S., Ding, C.: Fake news detection based on news content and social contexts: a transformer-based approach. Int. J. Data Sci. Anal. 13, 335–362 (2022). https://doi.org/10.1007/s41060-021-00302-z
Horne, B.D., NØrregaard, J., Adali, S.: Robust fake news detection over time and attack. ACM Trans. Intell. Syst. Technol. 11 (2019). https://doi.org/10.1145/3363818
Ahmed, H., Traore, I., Saad, S.: Detecting opinion spams and fake news using text classification. Secur. Priv. 1, e9 (2018). https://doi.org/10.1002/spy2.9
Ahmed, H., Traore, I., Saad, S.: Detection of online fake news using N-gram analysis and machine learning techniques. In: Traore, I., Woungang, I., Awad, A. (eds.) ISDDC 2017. LNCS, vol. 10618, pp. 127–138. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69155-8_9
Zhou, L., Burgoon, J.K., Nunamaker, J.F., Twitchell, D.: Automating linguistics-based cues for detecting deception in text-based asynchronous computer-mediated communications. Group Decis. Negot. 13(1), 81–106 (2004). https://doi.org/10.1023/B:GRUP.0000011944.62889.6f
Burgoon, J.K., Blair, J.P., Qin, T., Nunamaker, J.F.: Detecting deception through linguistic analysis. In: Chen, H., Miranda, R., Zeng, D.D., Demchak, C., Schroeder, J., Madhusudan, T. (eds.) ISI 2003. LNCS, vol. 2665, pp. 91–101. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-44853-5_7
Kursa, M.B., Rudnicki, W.R.: Feature selection with the boruta package. J. Stat Softw. 36, 1–13 (2010). https://doi.org/10.18637/jss.v036.i11
Zervopoulos, A., Alvanou, A.G., Bezas, K., Papamichail, A., Maragoudakis, M., Kermanidis, K.: Deep learning for fake news detection on Twitter regarding the 2019 Hong Kong protests. Neural Comput. Appl. 34(2), 969–982 (2021). https://doi.org/10.1007/s00521-021-06230-0
Acknowledgements
This research is co-financed by Greece and the European Union (European Social Fund-SF) through the Operational Program “Human Resources Development, Education and Lifelong Learning 2014–2020” in the context of the project “Support for International Actions of the International Hellenic University”, (MIS 5154651).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 IFIP International Federation for Information Processing
About this paper
Cite this paper
Chouliara, V., Koukaras, P., Tjortjis, C. (2023). Fake News Detection Utilizing Textual Cues. In: Maglogiannis, I., Iliadis, L., MacIntyre, J., Dominguez, M. (eds) Artificial Intelligence Applications and Innovations. AIAI 2023. IFIP Advances in Information and Communication Technology, vol 675. Springer, Cham. https://doi.org/10.1007/978-3-031-34111-3_33
Download citation
DOI: https://doi.org/10.1007/978-3-031-34111-3_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34110-6
Online ISBN: 978-3-031-34111-3
eBook Packages: Computer ScienceComputer Science (R0)