Skip to main content

Machine Learning Methods for Fake News Classification

  • Conference paper
  • First Online:
Intelligent Data Engineering and Automated Learning – IDEAL 2019 (IDEAL 2019)

Abstract

The problem of the fake news publication is not new and it already has been reported in ancient ages, but it has started having a huge impact especially on social media users. Such false information should be detected as soon as possible to avoid its negative influence on the readers and in some cases on their decisions, e.g., during the election. Therefore, the methods which can effectively detect fake news are the focus of intense research. This work focuses on fake news detection in articles published online and on the basis of extensive research we confirmed that chosen machine learning algorithms can distinguish them from reliable information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Sabrina Tavernise, As Fake News Spreads Lies, More Readers Shrug at the Truth, The New York Times, Dec. 6, 2016, https://www.nytimes.com/2016/12/06/us/fake-news-partisan-republican-democrat.html.

  2. 2.

    Michael Peel, Fake news: How Lithuani’s ‘elves’ take on Russian trolls, Financial Times, Feb. 4, 2019, https://www.ft.com/content/b3701b12-2544-11e9-b329-c7e6ceb5ffdf.

  3. 3.

    https://www.kaggle.com/mrisdal/fake-news.

  4. 4.

    https://github.com/xehivs/fakenews.

References

  1. Afroz, S., Brennan, M., Greenstadt, R.: Detecting hoaxes, frauds, and deception in writing style online. In: Proceedings of the 2012 IEEE Symposium on Security and Privacy, SP 2012, Washington, DC, USA, pp. 461–475. IEEE Computer Society (2012). https://doi.org/10.1109/SP.2012.34

  2. Atodiresei, C.S., Tănăselea, A., Iftene, A.: Identifying fake news and fake users on Twitter. Procedia Comput. Sci. 126, 451–461 (2018). https://doi.org/10.1016/j.procs.2018.07.279. http://www.sciencedirect.com/science/article/pii/S1877050918312559. Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 22nd International Conference, KES-2018, Belgrade, Serbia

    Article  Google Scholar 

  3. Bondielli, A., Marcelloni, F.: A survey on fake news and rumour detection techniques. Inf. Sci. 497, 38–55 (2019). https://doi.org/10.1016/j.ins.2019.05.035. http://www.sciencedirect.com/science/article/pii/S0020025519304372

    Article  Google Scholar 

  4. Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, pp. 675–684. ACM, New York (2011). https://doi.org/10.1145/1963405.1963500, http://doi.acm.org/10.1145/1963405.1963500

  5. Chen, C., Wu, K., Venkatesh, S., Zhang, X.: Battling the internet water army: detection of hidden paid posters. In: 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013), pp. 116–120 (2011)

    Google Scholar 

  6. Choraś, M., Giełczyk, A., Demestichas, K., Puchalski, D., Kozik, R.: Pattern recognition solutions for fake news detection. In: Saeed, K., Homenda, W. (eds.) CISIM 2018. LNCS, vol. 11127, pp. 130–139. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99954-8_12

    Chapter  Google Scholar 

  7. Choraś, M., Pawlicki, M., Kozik, R., Demestichas, K., Kosmides, P., Gupta, M.: Socialtruth project approach to online disinformation (fake news) detection and mitigation. In: Proceedings of the 14th International Conference on Availability, Reliability and Security, p. 68. ACM (2019)

    Google Scholar 

  8. Conroy, N., Rubin, V.L., Chen, Y.: Automatic deception detection: methods for finding fake news. Proc. Assoc. Inf. Sci. Technol. 52, 1–4 (2015). https://doi.org/10.1002/pra2.2015.145052010082

    Article  Google Scholar 

  9. Ferrara, E., Varol, O., Davis, C., Menczer, F., Flammini, A.: The rise of social bots. Commun. ACM 59(7), 96–104 (2016). https://doi.org/10.1145/2818717. http://doi.acm.org/10.1145/2818717

    Article  Google Scholar 

  10. Giełczyk, A., Wawrzyniak, R., Choraś, M.: Evaluation of the existing tools for fake news detection. In: Saeed, K., Chaki, R., Janev, V. (eds.) CISIM 2019. LNCS, vol. 11703, pp. 144–151. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-28957-7_13

    Chapter  Google Scholar 

  11. Gravanis, G., Vakali, A., Diamantaras, K., Karadais, P.: Behind the cues: a benchmarking study for fake news detection. Expert Syst. Appl. 128, 201–213 (2019). https://doi.org/10.1016/j.eswa.2019.03.036. http://www.sciencedirect.com/science/article/pii/S0957417419301988

    Article  Google Scholar 

  12. Horne, B.D., Adali, S.: This just. In: Fake News Packs a Lot in Title, Uses Simpler, Repetitive Content in Text Body, More Similar to Satire than Real News. CoRR abs/1703.09398 (2017). http://arxiv.org/abs/1703.09398

  13. Jin, Z., Cao, J., Zhang, Y., Zhou, J., Tian, Q.: Novel visual and statistical image features for microblogs news verification. IEEE Trans. Multimedia 19(3), 598–608 (2017). https://doi.org/10.1109/TMM.2016.2617078

    Article  Google Scholar 

  14. Ksieniewicz, P.: Combining Random Subspace approach with smote oversampling for imbalanced data classification. In: Pérez García, H., Sánchez González, L., Castejón Limas, M., Quintián Pardo, H., Corchado Rodríguez, E. (eds.) HAIS 2019. LNCS, pp. 660–673. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-29859-3_56

    Chapter  Google Scholar 

  15. Ksieniewicz, P., Woźniak, M.: Dealing with the task of imbalanced, multidimensional data classification using ensembles of exposers. In: First International Workshop on Learning with Imbalanced Domains: Theory and Applications, pp. 164–175 (2017)

    Google Scholar 

  16. Sharma, K., Qian, F., Jiang, H., Ruchansky, N., Zhang, M., Liu, Y.: Combating fake news: a survey on identification and mitigation techniques. ACM Trans. Intell. Syst. Technol. 10(3), 21:1–21:42 (2019). https://doi.org/10.1145/3305260. http://doi.acm.org/10.1145/3305260

    Article  Google Scholar 

  17. Shu, K., Sliva, A., Wang, S., Tang, J., Liu, H.: Fake news detection on social media: a data mining perspective. SIGKDD Explor. Newsl. 19(1), 22–36 (2017). https://doi.org/10.1145/3137597.3137600. http://doi.acm.org/10.1145/3137597.3137600

    Article  Google Scholar 

  18. Zhang, X., Ghorbani, A.A.: An overview of online fake news: characterization, detection, and discussion. Inf. Process. Manag. (2019). https://doi.org/10.1016/j.ipm.2019.03.004. http://www.sciencedirect.com/science/article/pii/S0306457318306794

Download references

Acknowledgement

This work is funded under SocialTruth project, which has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 825477.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Paweł Ksieniewicz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ksieniewicz, P., Choraś, M., Kozik, R., Woźniak, M. (2019). Machine Learning Methods for Fake News Classification. In: Yin, H., Camacho, D., Tino, P., Tallón-Ballesteros, A., Menezes, R., Allmendinger, R. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2019. IDEAL 2019. Lecture Notes in Computer Science(), vol 11872. Springer, Cham. https://doi.org/10.1007/978-3-030-33617-2_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-33617-2_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-33616-5

  • Online ISBN: 978-3-030-33617-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics