Skip to main content

Health-Related Spammer Detection on Chinese Social Media

  • Conference paper
  • First Online:
Smart Health (ICSH 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9545))

Included in the following conference series:

  • 2424 Accesses

Abstract

Weibo (Chinese microblog) has become a popular social media platform for users to share health-related information. However, illegitimate users or spammers often generate and spread false or misleading health information so as to advertise and attract more attention. To address this issue, we propose a health-related spammer detection approach on Chinese social media. Our approach is a deep belief network (DBN) based model incorporating a comprehensive feature set, including burstiness-based features, profile-based features, and content-based features, to identify spammers who spread misleading health-related information. Especially, we create a medical and health domain lexicon to better extract content-based features. The experimental results show the approach achieves an F1 score of 86 % in detecting spammer and significantly outperforms the benchmark methods using baseline features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amleshwaram, A.A., Reddy, N., Yadav, S., Gu, G., Yang, C.: CATS: characterizing automation of twitter spammers. In: Fifth International Conference on Communication Systems and Networks (COMSNETS), pp. 1–10. IEEE (2013)

    Google Scholar 

  2. Chen, C., Wu, K., Srinivasan, V., Zhang, X.: Battling the internet water army: detection of hidden paid posters. In: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 116–120. ACM (2013)

    Google Scholar 

  3. Fei, G., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., Ghosh, R.: Exploiting burstiness in reviews for review spammer detection. In: ICWSM. Citeseer (2013)

    Google Scholar 

  4. Gao, Q., Tian, Y., Tu, M.: Exploring factors influencing chinese user’s perceived credibility of health and safety information on weibo. Comput. Hum. Behav. 45, 21–31 (2015)

    Article  Google Scholar 

  5. Ge, L., Gao, J., Li, X., Zhang, A.: Multi-source deep learning for information trustworthiness estimation. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 766–774. ACM (2013)

    Google Scholar 

  6. Heydari, A., ali Tavakoli, M., Salim, N., Heydari, Z.: Detection of review spam: a survey. Expert Syst. Appl. 42(7), 3634–3642 (2015)

    Article  Google Scholar 

  7. Hinton, G., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)

    Article  MATH  MathSciNet  Google Scholar 

  8. Jindal, N., Liu, B.: Opinion spam and analysis. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 219–230. ACM (2008)

    Google Scholar 

  9. Lin, Y., Zhu, T., Wang, X., Zhang, J., Zhou, A.: Towards online review spam detection. In: Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion, pp. 341–342. International World Wide Web Conferences Steering Committee (2014)

    Google Scholar 

  10. Liu, Y., Wu, B., Wang, B., Li, G.: SDHM: a hybrid model for spammer detection on Weibo. In: IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2014, pp. 942–947. IEEE (2014)

    Google Scholar 

  11. Mukherjee, S., Weikum, G., Danescu-Niculescu-Mizil, C.: People on drugs: credibility of user statements in health communities. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 65–74. ACM (2014)

    Google Scholar 

  12. Rosenblatt, M., et al.: Remarks on some nonparametric estimates of a density function. Ann. Math. Stat. 27(3), 832–837 (1956)

    Article  MATH  MathSciNet  Google Scholar 

  13. Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. ACM Press, New York (1986)

    Google Scholar 

  14. Vydiswaran, V., Zhai, C., Roth, D.: Gauging the internet doctor: ranking medical claims based on community knowledge. In: Proceedings of the 2011 Workshop on Data Mining for Medicine and Healthcare, pp. 42–51. ACM (2011)

    Google Scholar 

  15. Wang, G., Xie, S., Liu, B., Yu, P.S.: Review graph based online store review spammer detection. In: IEEE 11th International Conference on Data Mining (ICDM), pp. 1242–1247. IEEE (2011)

    Google Scholar 

  16. Xie, S., Wang, G., Lin, S., Yu, P.S.: Review spam detection via temporal pattern discovery. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 823–831. ACM (2012)

    Google Scholar 

  17. Yang, C., Harkreader, R., Gu, G.: Empirical evaluation and new design for fighting evolving twitter spammers. IEEE Trans. Inf. Forensics Secur. 8(8), 1280–1293 (2013)

    Article  Google Scholar 

  18. Zhang, Y.: Detect spammers in online social networks (2015)

    Google Scholar 

  19. A open source project for Chinese word segmentation. http://code.google.com/p/jcseg/

Download references

Acknowledgments

This work was supported by the National High-tech R&D Program of China (Grant No. SS2015AA020102), National Basic Research Program of China (Grant No. 2011CB302302), the 1000-Talent program, Tsinghua University Initiative Scientific Research Program.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xinhuan Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Chen, X., Zhang, Y., Xu, J., Xing, C., Chen, H. (2016). Health-Related Spammer Detection on Chinese Social Media. In: Zheng, X., Zeng, D., Chen, H., Leischow, S. (eds) Smart Health. ICSH 2015. Lecture Notes in Computer Science(), vol 9545. Springer, Cham. https://doi.org/10.1007/978-3-319-29175-8_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-29175-8_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-29174-1

  • Online ISBN: 978-3-319-29175-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics