Skip to main content
Book cover

ICSH

ICSH 2015: Smart Health pp 131–142Cite as

Information Credibility: A Probabilistic Graphical Model for Identifying Credible Influenza Posts on Social Media

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9545))

Abstract

Social media is an important data source to compliment traditional epidemic surveillance. However, misinformation in social media hinders the exploitation of valuable information. Analysis of information credibility has drawn much attention of academia in recent years. In this paper, we focus on analyzing the credibility of influenza posts published on Sina Weibo. We propose a semi-supervised probabilistic graphical model to jointly learn the interactions between user trustworthiness, content reliability, and post credibility. To test the performance of the approach, we apply it to identify credible influenza posts published from May 2013 to June 2014 on Sina Weibo. Random Forests and the Bayesian Network are used as baselines for evaluation. The results show that our approach performs effectively with the highest average accuracy of 71.7 %, f-measure 51 %. Our proposed framework significantly outperformed the baselines in detecting credible influenza posts on Sina Weibo.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www.sina.com.cn.

  2. 2.

    http://ictclas.nlpir.org/.

  3. 3.

    http://www.cnki.net.

  4. 4.

    http://www.chinacdc.cn.

  5. 5.

    http://www.nhfpc.gov.cn.

References

  1. Al-Eidan, R., Al-Khalifa H., Al-Salman A.: Measuring the credibility of arabic text content in Twitter. In: 2010 Fifth International Conference on Digital Information Management (ICDIM), pp. 285–291. IEEE (2010)

    Google Scholar 

  2. Yang, C.C., Yang, H., Jiang, L., Zhang, M.: Social media mining for drug safety signal detection. In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing, pp. 33–40. ACM (2012)

    Google Scholar 

  3. Yang, H., Yang, C.C.: Harnessing social media for drug-drug interactions detection. In: 2013 IEEE International Conference on Healthcare Informatics (ICHI), pp. 22–29. IEEE (2013)

    Google Scholar 

  4. Gupta, A., Kumaraguru, P.: Credibility ranking of tweets during high impact events. In: Proceedings of the 1st Workshop on Privacy and Security in Online Social Media, p. 2. ACM (2012)

    Google Scholar 

  5. Yang, J., Counts, S., Morris, M.R., Hoff, A.: Microblog credibility perceptions: comparing the USA and China. In: Proceedings of the 2013 Conference on Computer Supported Cooperative Work, pp. 575–586. ACM (2013)

    Google Scholar 

  6. AlMansour, A.A., Brankovic, L., Iliopoulos, C.S.: A model for recalibrating credibility in different contexts and languages-a Twitter case study. Int. J. Digital Inf. Wirel. Commun. (IJDIWC) 4(1), 53–62 (2014)

    Google Scholar 

  7. Walter, Z.: Web credibility and stickiness of content web sites. In: International Conference on Wireless Communications, Networking and Mobile Computing, pp. 3820–3823. IEEE (2007)

    Google Scholar 

  8. Juffinger, A., Granitzer, M., Lex, E.: Blog credibility ranking by exploiting verified content. In: Proceedings of the 3rd Workshop on Information Credibility on the Web, pp. 51–58. ACM (2009)

    Google Scholar 

  9. Vydiswaran, V., Zhai, C., Roth, D.: Content-driven trust propagation framework. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 974–982. ACM (2011)

    Google Scholar 

  10. Wanas, N., El-Saban, M., Ashour, H., Ammar, W.: Automatic scoring of online discussion posts. In: Proceedings of the 2nd ACM Workshop on information Credibility on the Web, pp. 19–26. ACM (2008)

    Google Scholar 

  11. Benevenuto, F., Magno, G., Rodrigues, T., Almeida, V.: Detecting spammers on Twitter. In: Collaboration, Electronic Messaging, Anti-abuse and Spam Conference (CEAS), vol. 6, p. 12 (2010)

    Google Scholar 

  12. Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web, pp. 675–684. ACM (2011)

    Google Scholar 

  13. Qazvinian, V., Rosengren, E., Radev, D.R., Mei, Q.: Rumor has it: identifying misinformation in microblogs. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1589–1599. Association for Computational Linguistics (2011)

    Google Scholar 

  14. Gupta, M., Zhao, P., Han, J.: Evaluating event credibility on Twitter. In: SDM, pp. 153–164. SIAM (2012)

    Google Scholar 

  15. Pasternack, J., Roth, D.: Latent credibility analysis. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1009–1020. International World Wide Web Conferences Steering Committee (2013)

    Google Scholar 

  16. Sondhi, P., Vydiswaran, V., Zhai, C.: Reliability prediction of webpages in the medical domain. In: Baeza-Yates, R., de Vries, A.P., Zaragoza, H., Cambazoglu, B., Murdock, V., Lempel, R., Silvestri, F. (eds.) ECIR 2012. LNCS, vol. 7224, pp. 219–231. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  17. Mukherjee, S., Weikum, G., Danescu-Niculescu-Mizil, C.: People on drugs: credibility of user statements in health communities. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 65–74. ACM (2014)

    Google Scholar 

  18. Lafferty, J., McCallum, A., Pereira, F.C.: Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data (2001)

    Google Scholar 

Download references

Acknowledgements

We thank Jingwei Li, Qihui Xia, and Lidan Chen for the help with preprocessing and labeling data. We also show our great appreciation to professor Hsinchun Chen for the help with revising this paper. We finally would like to thank all the reviewers for their modification suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qiaozhen Guo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Guo, Q., Huang, W.(., Huang, K., Liu, X. (2016). Information Credibility: A Probabilistic Graphical Model for Identifying Credible Influenza Posts on Social Media. In: Zheng, X., Zeng, D., Chen, H., Leischow, S. (eds) Smart Health. ICSH 2015. Lecture Notes in Computer Science(), vol 9545. Springer, Cham. https://doi.org/10.1007/978-3-319-29175-8_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-29175-8_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-29174-1

  • Online ISBN: 978-3-319-29175-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics