Abstract
Social media have become popular communication platforms in recent years. A huge number of users disseminate and share information on these websites. Due to their popularity, social media have attracted numerous malicious users (spammers) to send spams, spread malware and phish scams. It is highly desirable to automatically distinguish legitimate users from spammers. Existing approaches mainly use behavior, content, or profile information as features to characterize the social spammers. However, to avoid being caught by the websites, the spammers pretend to post normal messages sometimes and change their behaviors continuously. This makes the behavior and content based approaches less effective.
In this paper, we propose a novel method to detect social spammers via structural properties. Specifically, we adopt 12 types of topological features in users’ ego network, including average degree, density, modularity, rich club connectivity, centrality, average shortest path, and cluster coefficient, to learn the classification model for spammer detection. Experimental results on a real world microblog data set demonstrate that the proposed method is very effective. It reaches an accuracy of 82.14 % with only structural features. Furthermore, its performance can be significantly improved to 94.00 % when combined with other features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Benevenuto, F., Magno, G., Rodrigues, T., Almeida., V.: Detecting spammers on twitter. In: Proceedings of the 7th CEAS, pp. 6–12 (2010)
Bhat, S.Y., Abulaish, M.: Community-based features for identifying spammers in online social networks. In: Proceedings of the ASONAM, pp. 100–107 (2013)
Chen, C., Wu, K., Srinivasan, V., Zhang, X.: Battling the internet water army: detection of hidden paid posters. In: Proceedings of the ASONAM, pp. 116–120 (2013)
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
Hu, X., Tang, J., Liu, H.: Leveraging knowledge across media for spammer detection in microblogging. In: Proceedings of the SIGIR, pp. 547–556 (2014)
Hu, X., Tang, J., Zhang, Y., Liu, H.: Social spammer detection in microblogging. In: Proceedings of the IJCAI, pp. 2633–2639 (2013)
Jiang, M., Cui, P., Beutel, A., Faloutsos, C., Yang, S.: CatchSync: catching synchronized behavior in large directed graphs. In: Proceedings of the KDD, pp. 941–950 (2014)
Jindal, N., Liu, B., Lim, E.P.: Finding unusual review patterns using unexpected rules. In: Proceedings of the CIKM, pp. 1549–1552 (2011)
Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: social honeypots+ machine learning. In: Proceedings of the SIGIR, pp. 435–442 (2010)
Li, F., Huang, M., Yang, Y., Zhu, X.: Learning to identify review spam. In: Proceedings of the IJCAI, pp. 2488–2493 (2011)
Lim, E.P., Nguyen, V.A., Jindal, N., Liu, B., Lauw, H.W.: Detecting product review spammers using rating behaviors. In: Proceedings of the CIKM, pp. 939–948 (2010)
McCord, M., Chuah, M.: Spam detection on Twitter using traditional classifiers. In: Calero, J.M.A., Yang, L.T., Mármol, F.G., García Villalba, L.J., Li, A.X., Wang, Y. (eds.) ATC 2011. LNCS, vol. 6906, pp. 175–186. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23496-5_13
Mukherjee, A., Kumar, A., Liu, B., Wang, J., Hsu, M., Castellanos, M., Ghosh, R.: Spotting opinion spammers using behavioral footprints. In: Proceedings of the KDD, pp. 632–640 (2013)
Ren, Y., Ji, D., Zhang, H., Yin, L.: Positive and unlabeled learning for deceptive reviews detection. In: Proceedings of the EMNLP, pp. 488–498 (2014)
Sedhai, S., Sun, A.: Hspam14: a collection of 14 million tweets for hashtag-oriented spam research. In: Proceedings of the SIGIR, pp. 223–232 (2015)
Stringhini, G., Kruegel, C., Vigna, G.: Detecting spammers on social networks. In: Proceedings of the ACSAC, pp. 1–9 (2010)
Tian, T., Zhu, J., Xia, F., Zhuang, X., Zhang, T.: Crowd fraud detection in internet advertising. In: Proceedings of the 24th WWW, pp. 1100–1110 (2015)
Wu, F., Shu, J., Huang, Y., Yuan, Z.: Social spammer and spam message co-detection in microblogging with social context regularization. In: Proceedings of the CIKM, pp. 1601–1610 (2015)
Yang, C., Harkreader, R., Zhang, J., Shin, S., Gu, G.: Analyzing spammer social networks for fun and profit, a case study of cyber criminal ecosystem on Twitter. In: Proceedings of the WWW, pp. 71–80 (2012)
Yang, Z., Wilson, C., Wang, X., Gao, T., Zhao, B.Y., Dai, Y.: Uncovering social network sybils in the wild. ACM Trans. Knowl. Discov. Data 8, Article No. 2 (2014)
Zafarani, R., Liu, H.: Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of the KDD, pp. 41–49 (2013)
Zhang, X., Li, Z., Zhu, S., Liang, W.: Detecting spam and promoting campaigns in Twitter. ACM Trans. Web 10(1), 4:1–4:28 (2016)
Zhu, Y., Wang, X., Zhong, E., Liu, N.N., Li, H., Yang, Q.: Discovering spammers in social networks. In: Proceedings of the AAAI (2012)
Acknowledgements
The work described in this paper has been supported in part by the NSFC projects (61572376, 61272275), and the 111 project (B07037).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, B., Qian, T., Chen, Y., You, Z. (2016). Social Spammer Detection via Structural Properties in Ego Network. In: Li, Y., Xiang, G., Lin, H., Wang, M. (eds) Social Media Processing. SMP 2016. Communications in Computer and Information Science, vol 669. Springer, Singapore. https://doi.org/10.1007/978-981-10-2993-6_21
Download citation
DOI: https://doi.org/10.1007/978-981-10-2993-6_21
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2992-9
Online ISBN: 978-981-10-2993-6
eBook Packages: Computer ScienceComputer Science (R0)