Effects of the Inclusion of Non-newsworthy Messages in Credibility Assessment

Noyunsan, Chaluemwut; Katanyukul, Tatpong; Leung, Carson K.; Saikaew, Kanda Runapongsa

doi:10.1007/978-3-319-62434-1_15

Chaluemwut Noyunsan¹⁵,
Tatpong Katanyukul¹⁵,
Carson K. Leung¹⁶ &
…
Kanda Runapongsa Saikaew¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10061))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

1374 Accesses

Abstract

Social media has become influential and affects large public perception. Anyone can post and share messages on social networking sites. However, not all posts are trustworthy. Many online messages contain misleading or false information. There has been an extensive research to assess the credibility of social media data. Previous studies evaluate all online messages, which may be inappropriate due to a large amount of such data that can result in ineffectiveness of the system. This paper studies and presents the effects of the inclusion of such data—namely, non-newsworthy messages—in credibility assessment. Our findings affirm a negative effect of training a model with non-newsworthy data. The degree of performance degradation is also shown to have a strong connection to a degree of non-newsworthiness in training data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Cerón-Guzmán, J.A., León, E.: Detecting social spammers in colombia 2014 presidential election. In: Lagunas, O.P., Alcántara, O.H., Figueroa, G.A. (eds.) MICAI 2015. LNCS (LNAI), vol. 9414, pp. 121–141. Springer, Cham (2015). doi:10.1007/978-3-319-27101-9_9
Chapter Google Scholar
Aggarwal, A., Rajadesingan, A., Kumaraguru, P.: PhishAri: Automatic realtime phishing detection on Twitter. In: Proceedings of the 2012 eCrime Researchers Summit (eCrime 2012), Las Croabas, PR, USA, 23–24 October 2012, pp. 1–12. IEEE, Piscataway, NJ, USA (2012)
Google Scholar
Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Chapman and Hall/CRC, Boca Raton, FL, USA (1984)
MATH Google Scholar
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on twitter. Proceedings of the 20th International Conference on World Wide Web (WWW 2011), Hyderabad, India, 28 March–1 April 2011, pp. 675–684. ACM, New York, NY, USA (2011)
Google Scholar
Gupta, A., Kumaraguru, P.: Credibility ranking of tweets during high impact events. In: Proceedings of the 1st Workshop on Privacy and Security in Online Social Media (PSOSM 2012), Lyon, France, 17 April 2012, no. 2. ACM, New York, NY, USA (2012)
Google Scholar
Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: Real-time credibility assessment of content on twitter. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 228–243. Springer, Cham (2014). doi:10.1007/978-3-319-13734-6_16
Google Scholar
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington, MA, USA (2011)
MATH Google Scholar
Noyunsan, C., Katanyukul, T., Wu, Y., Runapongsa Saikaew, K.: Social network newsworthiness filter based on topic analysis. In: Proceedings of the 6th KKU International Engineering Conference (KKU-IENC 2016) (2016)
Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, É.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
Runapongsa Saikaew, K., Noyunsan, C.: Features for measuring credibility on Facebook information. Int. Sch. Sci. Res. Innov. 9(1), 174–177 (2015)
Google Scholar
Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Addison-Wesley, Boston, MA, USA (2006)
Google Scholar

Download references

Acknowledgements

This project is partially supported by Natural Sciences and Engineering Research Council of Canada (NSERC).

Author information

Authors and Affiliations

Department of Computer Engineering, Faculty of Engineering, Khon Kaen University, Khon Kaen, 40002, Thailand
Chaluemwut Noyunsan, Tatpong Katanyukul & Kanda Runapongsa Saikaew
Department of Computer Science, Faculty of Science, University of Manitoba, Winnipeg, MB, R3T 2N2, Canada
Carson K. Leung

Authors

Chaluemwut Noyunsan
View author publications
You can also search for this author in PubMed Google Scholar
Tatpong Katanyukul
View author publications
You can also search for this author in PubMed Google Scholar
Carson K. Leung
View author publications
You can also search for this author in PubMed Google Scholar
Kanda Runapongsa Saikaew
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tatpong Katanyukul .

Editor information

Editors and Affiliations

Instituto Politécnico Nacional, Centro de Investigación en Computación, Mexico City, Mexico
Grigori Sidorov
Universidad Autónoma Metropolitana, Mexico City, Mexico
Oscar Herrera-Alcántara

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Noyunsan, C., Katanyukul, T., Leung, C.K., Saikaew, K.R. (2017). Effects of the Inclusion of Non-newsworthy Messages in Credibility Assessment. In: Sidorov, G., Herrera-Alcántara, O. (eds) Advances in Computational Intelligence. MICAI 2016. Lecture Notes in Computer Science(), vol 10061. Springer, Cham. https://doi.org/10.1007/978-3-319-62434-1_15

Download citation

DOI: https://doi.org/10.1007/978-3-319-62434-1_15
Published: 03 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62433-4
Online ISBN: 978-3-319-62434-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics