In Search of Credible News

Hardalov, Momchil; Koychev, Ivan; Nakov, Preslav

doi:10.1007/978-3-319-44748-3_17

In Search of Credible News

Momchil Hardalov¹⁵,
Ivan Koychev¹⁵ &
Preslav Nakov¹⁶

Conference paper
First Online: 18 August 2016

2351 Accesses
43 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9883))

Abstract

We study the problem of finding fake online news. This is an important problem as news of questionable credibility have recently been proliferating in social media at an alarming scale. As this is an understudied problem, especially for languages other than English, we first collect and release to the research community three new balanced credible vs. fake news datasets derived from four online sources. We then propose a language-independent approach for automatically distinguishing credible from fake news, based on a rich feature set. In particular, we use linguistic (n-gram), credibility-related (capitalization, punctuation, pronoun use, sentiment polarity), and semantic (embeddings and DBPedia data) features. Our experiments on three different testsets show that our model can distinguish credible from fake news with very high accuracy.

This is a preview of subscription content, log in via an institution.

Notes

References

Brill, A.M.: Online journalists embrace new marketing function. Newsp. Res. J. 22(2), 28 (2001)
Article Google Scholar
Cassidy, W.P.: Online news credibility: an examination of the perceptions of newspaper journalists. J. Comput.-Mediat. Commun. 12(2), 478–498 (2007)
Article Google Scholar
Castillo, C., Mendoza, M., Poblete, B.: Predicting information credibility in time-sensitive social media. Internet Res. 23(5), 560–588 (2013)
Article Google Scholar
Graves, L.: Deciding what’s true: fact-checking journalism and the new ecology of news. Ph.D. thesis, Columbia University (2013)
Google Scholar
Johnson, T.J., Kaye, B.K., Bichard, S.L., Wong, W.J.: Every blog has its day: politically-interested internet users perceptions of blog credibility. J. Comput.-Mediat. Commun. 13(1), 100–122 (2007)
Article Google Scholar
Kapukaranov, B., Preslav, N.: Fine-grained sentiment analysis for movie reviews in Bulgarian. In: Proceedings of Recent Advances in Natural Language Processing, RANLP 2015, Hissar, Bulgaria, pp. 266–274 (2015)
Google Scholar
Ketterer, S.: Teaching students how to evaluate and use online resources. Journal. Mass Commun. Educ. 52(4), 4 (1998)
Article Google Scholar
Kohonen, T.: Improved versions of learning vector quantization. In: IJCNN International Joint Conference on Neural Networks, pp. 545–550 (1990)
Google Scholar
Liu, D.C., Nocedal, J.: On the limited memory BFGS method for large scale optimization. Math. Program. 45(1–3), 503–528 (1989)
Article MathSciNet MATH Google Scholar
Mihalcea, R., Strapparava, C.: Making computers laugh: investigations in automatic humor recognition. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, HLT-EMNLP 2005, Vancouver, British Columbia, Canada, pp. 531–538 (2005)
Google Scholar
Mihaylov, T., Georgiev, G., Nakov, P.: Finding opinion manipulation trolls in news community forums. In: Proceedings of 19th Conference on Computational Natural Language Learning, CoNLL 2015, Beijing, China, pp. 310–314 (2015)
Google Scholar
Mihaylov, T., Koychev, I., Georgiev, G., Nakov, P.: Exposing paid opinion manipulation trolls. In: Proceedings of International Conference Recent Advances in Natural Language Processing, RANLP 2015, Hissar, Bulgaria, pp. 443–450 (2015)
Google Scholar
Mihaylov, T., Nakov, P.: Hunting for troll comments in news community forums. In: Proceedings of 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, Berlin, Germany (2016)
Google Scholar
Papadopoulos, S., Bontcheva, K., Jaho, E., Lupu, M., Castillo, C.: Overview of the special issue on trust, veracity of information in social media. ACM Trans. Inf. Syst. 34(3), 14:1–14:5 (2016)
Article Google Scholar
Yang, D., Lavie, A., Dyer, C., Hovy, E.: Humor recognition and humor anchor extraction. In: Proceedings of 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015, Lisbon, Portugal, pp. 2367–2376 (2015)
Google Scholar
Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of 2nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud 2010, Boston, MA, p. 10 (2010)
Google Scholar
Zou, H., Hastie, T.: Regularization and variable selection via the elastic net. J. Roy. Stat. Soc.: Ser. B (Stat. Methodol.) 67(2), 301–320 (2005)
Article MathSciNet MATH Google Scholar
Zubiaga, A., Hoi, G.W.S., Liakata, M., Procter, R., Tolmie, P.: Analysing how people orient to and spread rumours in social media by looking at conversational threads (2015). arXiv preprint arXiv:1511.07487
Zubiaga, A., Ji, H.: Tweet, but verify: epistemic study of information verification on Twitter. Soc. Netw. Anal. Min. 4(1), 1–12 (2014)
Article Google Scholar

Download references

Acknowledgments

This research was performed by Momchil Hardalov, a student in Computer Science in the Sofia University “St Kliment Ohridski”, as part of his M.Sc. thesis. It is also part of the Interactive sYstems for Answer Search (Iyas) project, which is developed by the Arabic Language Technologies (ALT) group at the Qatar Computing Research Institute (QCRI), HBKU, part of Qatar Foundation in collaboration with MIT-CSAIL.

Author information

Authors and Affiliations

FMI, Sofia University “St. Kliment Ohridski”, Sofia, Bulgaria
Momchil Hardalov & Ivan Koychev
Qatar Computing Research Institute, HBKU, Doha, Qatar
Preslav Nakov

Authors

Momchil Hardalov
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Koychev
View author publications
You can also search for this author in PubMed Google Scholar
Preslav Nakov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Momchil Hardalov .

Editor information

Editors and Affiliations

Winston-Salem State University, Winston Salem, North Carolina, USA
Christo Dichev
Institute of Information and Communication Technologies, Bulgarian Academy of Sciences, Sofia, Bulgaria
Gennady Agre

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hardalov, M., Koychev, I., Nakov, P. (2016). In Search of Credible News. In: Dichev, C., Agre, G. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 2016. Lecture Notes in Computer Science(), vol 9883. Springer, Cham. https://doi.org/10.1007/978-3-319-44748-3_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-44748-3_17
Published: 18 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44747-6
Online ISBN: 978-3-319-44748-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics