Health-Related Spammer Detection on Chinese Social Media

Chen, Xinhuan; Zhang, Yong; Xu, Jennifer; Xing, Chunxiao; Chen, Hsinchun

doi:10.1007/978-3-319-29175-8_27

Xinhuan Chen¹⁷,
Yong Zhang¹⁷,
Jennifer Xu¹⁸,
Chunxiao Xing¹⁷ &
…
Hsinchun Chen^17,19

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9545))

Included in the following conference series:

ICSH

2424 Accesses

Abstract

Weibo (Chinese microblog) has become a popular social media platform for users to share health-related information. However, illegitimate users or spammers often generate and spread false or misleading health information so as to advertise and attract more attention. To address this issue, we propose a health-related spammer detection approach on Chinese social media. Our approach is a deep belief network (DBN) based model incorporating a comprehensive feature set, including burstiness-based features, profile-based features, and content-based features, to identify spammers who spread misleading health-related information. Especially, we create a medical and health domain lexicon to better extract content-based features. The experimental results show the approach achieves an F1 score of 86 % in detecting spammer and significantly outperforms the benchmark methods using baseline features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Amleshwaram, A.A., Reddy, N., Yadav, S., Gu, G., Yang, C.: CATS: characterizing automation of twitter spammers. In: Fifth International Conference on Communication Systems and Networks (COMSNETS), pp. 1–10. IEEE (2013)
Google Scholar
Chen, C., Wu, K., Srinivasan, V., Zhang, X.: Battling the internet water army: detection of hidden paid posters. In: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 116–120. ACM (2013)
Google Scholar
Fei, G., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., Ghosh, R.: Exploiting burstiness in reviews for review spammer detection. In: ICWSM. Citeseer (2013)
Google Scholar
Gao, Q., Tian, Y., Tu, M.: Exploring factors influencing chinese user’s perceived credibility of health and safety information on weibo. Comput. Hum. Behav. 45, 21–31 (2015)
Article Google Scholar
Ge, L., Gao, J., Li, X., Zhang, A.: Multi-source deep learning for information trustworthiness estimation. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 766–774. ACM (2013)
Google Scholar
Heydari, A., ali Tavakoli, M., Salim, N., Heydari, Z.: Detection of review spam: a survey. Expert Syst. Appl. 42(7), 3634–3642 (2015)
Article Google Scholar
Hinton, G., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006)
Article MATH MathSciNet Google Scholar
Jindal, N., Liu, B.: Opinion spam and analysis. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 219–230. ACM (2008)
Google Scholar
Lin, Y., Zhu, T., Wang, X., Zhang, J., Zhou, A.: Towards online review spam detection. In: Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion, pp. 341–342. International World Wide Web Conferences Steering Committee (2014)
Google Scholar
Liu, Y., Wu, B., Wang, B., Li, G.: SDHM: a hybrid model for spammer detection on Weibo. In: IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2014, pp. 942–947. IEEE (2014)
Google Scholar
Mukherjee, S., Weikum, G., Danescu-Niculescu-Mizil, C.: People on drugs: credibility of user statements in health communities. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 65–74. ACM (2014)
Google Scholar
Rosenblatt, M., et al.: Remarks on some nonparametric estimates of a density function. Ann. Math. Stat. 27(3), 832–837 (1956)
Article MATH MathSciNet Google Scholar
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. ACM Press, New York (1986)
Google Scholar
Vydiswaran, V., Zhai, C., Roth, D.: Gauging the internet doctor: ranking medical claims based on community knowledge. In: Proceedings of the 2011 Workshop on Data Mining for Medicine and Healthcare, pp. 42–51. ACM (2011)
Google Scholar
Wang, G., Xie, S., Liu, B., Yu, P.S.: Review graph based online store review spammer detection. In: IEEE 11th International Conference on Data Mining (ICDM), pp. 1242–1247. IEEE (2011)
Google Scholar
Xie, S., Wang, G., Lin, S., Yu, P.S.: Review spam detection via temporal pattern discovery. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 823–831. ACM (2012)
Google Scholar
Yang, C., Harkreader, R., Gu, G.: Empirical evaluation and new design for fighting evolving twitter spammers. IEEE Trans. Inf. Forensics Secur. 8(8), 1280–1293 (2013)
Article Google Scholar
Zhang, Y.: Detect spammers in online social networks (2015)
Google Scholar
A open source project for Chinese word segmentation. http://code.google.com/p/jcseg/

Download references

Acknowledgments

This work was supported by the National High-tech R&D Program of China (Grant No. SS2015AA020102), National Basic Research Program of China (Grant No. 2011CB302302), the 1000-Talent program, Tsinghua University Initiative Scientific Research Program.

Author information

Authors and Affiliations

Department of Computer Science and Technology, Research Institute of Information Technology, Tsinghua University, Beijing, China
Xinhuan Chen, Yong Zhang, Chunxiao Xing & Hsinchun Chen
Department of Computer Information Systems, Bentley University, Waltham, USA
Jennifer Xu
MIS Department, University of Arizona, Tucson, USA
Hsinchun Chen

Authors

Xinhuan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Xu
View author publications
You can also search for this author in PubMed Google Scholar
Chunxiao Xing
View author publications
You can also search for this author in PubMed Google Scholar
Hsinchun Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinhuan Chen .

Editor information

Editors and Affiliations

Institute of Automation,Bldg.1004, Chinese Academy of Sciences, Beijing, China
Xiaolong Zheng
University of Arizona, Tucson, Arizona, USA
Daniel Dajun Zeng
University of Arizona, Phoenix, USA
Hsinchun Chen
Mayo Clinic, Scottsdale, USA
Scott J. Leischow

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, X., Zhang, Y., Xu, J., Xing, C., Chen, H. (2016). Health-Related Spammer Detection on Chinese Social Media. In: Zheng, X., Zeng, D., Chen, H., Leischow, S. (eds) Smart Health. ICSH 2015. Lecture Notes in Computer Science(), vol 9545. Springer, Cham. https://doi.org/10.1007/978-3-319-29175-8_27

Download citation

DOI: https://doi.org/10.1007/978-3-319-29175-8_27
Published: 20 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-29174-1
Online ISBN: 978-3-319-29175-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics