Abstract
Social media has become influential and affects large public perception. Anyone can post and share messages on social networking sites. However, not all posts are trustworthy. Many online messages contain misleading or false information. There has been an extensive research to assess the credibility of social media data. Previous studies evaluate all online messages, which may be inappropriate due to a large amount of such data that can result in ineffectiveness of the system. This paper studies and presents the effects of the inclusion of such data—namely, non-newsworthy messages—in credibility assessment. Our findings affirm a negative effect of training a model with non-newsworthy data. The degree of performance degradation is also shown to have a strong connection to a degree of non-newsworthiness in training data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
References
Cerón-Guzmán, J.A., León, E.: Detecting social spammers in colombia 2014 presidential election. In: Lagunas, O.P., Alcántara, O.H., Figueroa, G.A. (eds.) MICAI 2015. LNCS (LNAI), vol. 9414, pp. 121–141. Springer, Cham (2015). doi:10.1007/978-3-319-27101-9_9
Aggarwal, A., Rajadesingan, A., Kumaraguru, P.: PhishAri: Automatic realtime phishing detection on Twitter. In: Proceedings of the 2012 eCrime Researchers Summit (eCrime 2012), Las Croabas, PR, USA, 23–24 October 2012, pp. 1–12. IEEE, Piscataway, NJ, USA (2012)
Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Chapman and Hall/CRC, Boca Raton, FL, USA (1984)
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on twitter. Proceedings of the 20th International Conference on World Wide Web (WWW 2011), Hyderabad, India, 28 March–1 April 2011, pp. 675–684. ACM, New York, NY, USA (2011)
Gupta, A., Kumaraguru, P.: Credibility ranking of tweets during high impact events. In: Proceedings of the 1st Workshop on Privacy and Security in Online Social Media (PSOSM 2012), Lyon, France, 17 April 2012, no. 2. ACM, New York, NY, USA (2012)
Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: Real-time credibility assessment of content on twitter. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 228–243. Springer, Cham (2014). doi:10.1007/978-3-319-13734-6_16
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington, MA, USA (2011)
Noyunsan, C., Katanyukul, T., Wu, Y., Runapongsa Saikaew, K.: Social network newsworthiness filter based on topic analysis. In: Proceedings of the 6th KKU International Engineering Conference (KKU-IENC 2016) (2016)
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, É.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Runapongsa Saikaew, K., Noyunsan, C.: Features for measuring credibility on Facebook information. Int. Sch. Sci. Res. Innov. 9(1), 174–177 (2015)
Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Addison-Wesley, Boston, MA, USA (2006)
Acknowledgements
This project is partially supported by Natural Sciences and Engineering Research Council of Canada (NSERC).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Noyunsan, C., Katanyukul, T., Leung, C.K., Saikaew, K.R. (2017). Effects of the Inclusion of Non-newsworthy Messages in Credibility Assessment. In: Sidorov, G., Herrera-Alcántara, O. (eds) Advances in Computational Intelligence. MICAI 2016. Lecture Notes in Computer Science(), vol 10061. Springer, Cham. https://doi.org/10.1007/978-3-319-62434-1_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-62434-1_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62433-4
Online ISBN: 978-3-319-62434-1
eBook Packages: Computer ScienceComputer Science (R0)