Abstract
Weibo (Chinese microblog) has become a popular social media platform for users to share health-related information. However, illegitimate users or spammers often generate and spread false or misleading health information so as to advertise and attract more attention. To address this issue, we propose a health-related spammer detection approach on Chinese social media. Our approach is a deep belief network (DBN) based model incorporating a comprehensive feature set, including burstiness-based features, profile-based features, and content-based features, to identify spammers who spread misleading health-related information. Especially, we create a medical and health domain lexicon to better extract content-based features. The experimental results show the approach achieves an F1 score of 86 % in detecting spammer and significantly outperforms the benchmark methods using baseline features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amleshwaram, A.A., Reddy, N., Yadav, S., Gu, G., Yang, C.: CATS: characterizing automation of twitter spammers. In: Fifth International Conference on Communication Systems and Networks (COMSNETS), pp. 1–10. IEEE (2013)
Chen, C., Wu, K., Srinivasan, V., Zhang, X.: Battling the internet water army: detection of hidden paid posters. In: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 116–120. ACM (2013)
Fei, G., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., Ghosh, R.: Exploiting burstiness in reviews for review spammer detection. In: ICWSM. Citeseer (2013)
Gao, Q., Tian, Y., Tu, M.: Exploring factors influencing chinese user’s perceived credibility of health and safety information on weibo. Comput. Hum. Behav. 45, 21–31 (2015)
Ge, L., Gao, J., Li, X., Zhang, A.: Multi-source deep learning for information trustworthiness estimation. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 766–774. ACM (2013)
Heydari, A., ali Tavakoli, M., Salim, N., Heydari, Z.: Detection of review spam: a survey. Expert Syst. Appl. 42(7), 3634–3642 (2015)
Hinton, G., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)
Jindal, N., Liu, B.: Opinion spam and analysis. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 219–230. ACM (2008)
Lin, Y., Zhu, T., Wang, X., Zhang, J., Zhou, A.: Towards online review spam detection. In: Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion, pp. 341–342. International World Wide Web Conferences Steering Committee (2014)
Liu, Y., Wu, B., Wang, B., Li, G.: SDHM: a hybrid model for spammer detection on Weibo. In: IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2014, pp. 942–947. IEEE (2014)
Mukherjee, S., Weikum, G., Danescu-Niculescu-Mizil, C.: People on drugs: credibility of user statements in health communities. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 65–74. ACM (2014)
Rosenblatt, M., et al.: Remarks on some nonparametric estimates of a density function. Ann. Math. Stat. 27(3), 832–837 (1956)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. ACM Press, New York (1986)
Vydiswaran, V., Zhai, C., Roth, D.: Gauging the internet doctor: ranking medical claims based on community knowledge. In: Proceedings of the 2011 Workshop on Data Mining for Medicine and Healthcare, pp. 42–51. ACM (2011)
Wang, G., Xie, S., Liu, B., Yu, P.S.: Review graph based online store review spammer detection. In: IEEE 11th International Conference on Data Mining (ICDM), pp. 1242–1247. IEEE (2011)
Xie, S., Wang, G., Lin, S., Yu, P.S.: Review spam detection via temporal pattern discovery. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 823–831. ACM (2012)
Yang, C., Harkreader, R., Gu, G.: Empirical evaluation and new design for fighting evolving twitter spammers. IEEE Trans. Inf. Forensics Secur. 8(8), 1280–1293 (2013)
Zhang, Y.: Detect spammers in online social networks (2015)
A open source project for Chinese word segmentation. http://code.google.com/p/jcseg/
Acknowledgments
This work was supported by the National High-tech R&D Program of China (Grant No. SS2015AA020102), National Basic Research Program of China (Grant No. 2011CB302302), the 1000-Talent program, Tsinghua University Initiative Scientific Research Program.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Chen, X., Zhang, Y., Xu, J., Xing, C., Chen, H. (2016). Health-Related Spammer Detection on Chinese Social Media. In: Zheng, X., Zeng, D., Chen, H., Leischow, S. (eds) Smart Health. ICSH 2015. Lecture Notes in Computer Science(), vol 9545. Springer, Cham. https://doi.org/10.1007/978-3-319-29175-8_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-29175-8_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-29174-1
Online ISBN: 978-3-319-29175-8
eBook Packages: Computer ScienceComputer Science (R0)