Skip to main content

Effects of the Inclusion of Non-newsworthy Messages in Credibility Assessment

  • Conference paper
  • First Online:
Advances in Computational Intelligence (MICAI 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10061))

Included in the following conference series:

  • 1374 Accesses

Abstract

Social media has become influential and affects large public perception. Anyone can post and share messages on social networking sites. However, not all posts are trustworthy. Many online messages contain misleading or false information. There has been an extensive research to assess the credibility of social media data. Previous studies evaluate all online messages, which may be inappropriate due to a large amount of such data that can result in ineffectiveness of the system. This paper studies and presents the effects of the inclusion of such data—namely, non-newsworthy messages—in credibility assessment. Our findings affirm a negative effect of training a model with non-newsworthy data. The degree of performance degradation is also shown to have a strong connection to a degree of non-newsworthiness in training data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.facebook.com/.

  2. 2.

    https://www.instagram.com/?hl=en or https://www.instagram.com/?hl=th.

  3. 3.

    https://twitter.com/?lang=en or https://twitter.com/?lang=th.

  4. 4.

    http://newsroom.fb.com/company-info/.

  5. 5.

    https://www.instagram.com/press/?hl=en.

  6. 6.

    https://about.twitter.com/company.

  7. 7.

    http://www.phishtank.com/.

  8. 8.

    https://developers.google.com/safebrowsing/.

  9. 9.

    https://pypi.python.org/pypi/scikit-learn/0.16.0.

  10. 10.

    https://github.com/chaluemwut/FBFilterCML.git.

References

  1. Cerón-Guzmán, J.A., León, E.: Detecting social spammers in colombia 2014 presidential election. In: Lagunas, O.P., Alcántara, O.H., Figueroa, G.A. (eds.) MICAI 2015. LNCS (LNAI), vol. 9414, pp. 121–141. Springer, Cham (2015). doi:10.1007/978-3-319-27101-9_9

    Chapter  Google Scholar 

  2. Aggarwal, A., Rajadesingan, A., Kumaraguru, P.: PhishAri: Automatic realtime phishing detection on Twitter. In: Proceedings of the 2012 eCrime Researchers Summit (eCrime 2012), Las Croabas, PR, USA, 23–24 October 2012, pp. 1–12. IEEE, Piscataway, NJ, USA (2012)

    Google Scholar 

  3. Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Chapman and Hall/CRC, Boca Raton, FL, USA (1984)

    MATH  Google Scholar 

  4. Castillo, C., Mendoza, M., Poblete, B.: Information credibility on twitter. Proceedings of the 20th International Conference on World Wide Web (WWW 2011), Hyderabad, India, 28 March–1 April 2011, pp. 675–684. ACM, New York, NY, USA (2011)

    Google Scholar 

  5. Gupta, A., Kumaraguru, P.: Credibility ranking of tweets during high impact events. In: Proceedings of the 1st Workshop on Privacy and Security in Online Social Media (PSOSM 2012), Lyon, France, 17 April 2012, no. 2. ACM, New York, NY, USA (2012)

    Google Scholar 

  6. Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: Real-time credibility assessment of content on twitter. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 228–243. Springer, Cham (2014). doi:10.1007/978-3-319-13734-6_16

    Google Scholar 

  7. Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington, MA, USA (2011)

    MATH  Google Scholar 

  8. Noyunsan, C., Katanyukul, T., Wu, Y., Runapongsa Saikaew, K.: Social network newsworthiness filter based on topic analysis. In: Proceedings of the 6th KKU International Engineering Conference (KKU-IENC 2016) (2016)

    Google Scholar 

  9. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, É.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  10. Runapongsa Saikaew, K., Noyunsan, C.: Features for measuring credibility on Facebook information. Int. Sch. Sci. Res. Innov. 9(1), 174–177 (2015)

    Google Scholar 

  11. Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Addison-Wesley, Boston, MA, USA (2006)

    Google Scholar 

Download references

Acknowledgements

This project is partially supported by Natural Sciences and Engineering Research Council of Canada (NSERC).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tatpong Katanyukul .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Noyunsan, C., Katanyukul, T., Leung, C.K., Saikaew, K.R. (2017). Effects of the Inclusion of Non-newsworthy Messages in Credibility Assessment. In: Sidorov, G., Herrera-Alcántara, O. (eds) Advances in Computational Intelligence. MICAI 2016. Lecture Notes in Computer Science(), vol 10061. Springer, Cham. https://doi.org/10.1007/978-3-319-62434-1_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-62434-1_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-62433-4

  • Online ISBN: 978-3-319-62434-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics