Social Spammer Detection via Structural Properties in Ego Network

Zhang, Baochao; Qian, Tieyun; Chen, Yiqi; You, Zhenni

doi:10.1007/978-981-10-2993-6_21

Baochao Zhang¹⁴,
Tieyun Qian¹⁴,
Yiqi Chen¹⁴ &
…
Zhenni You¹⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 669))

Included in the following conference series:

Chinese National Conference on Social Media Processing

1203 Accesses
2 Citations

Abstract

Social media have become popular communication platforms in recent years. A huge number of users disseminate and share information on these websites. Due to their popularity, social media have attracted numerous malicious users (spammers) to send spams, spread malware and phish scams. It is highly desirable to automatically distinguish legitimate users from spammers. Existing approaches mainly use behavior, content, or profile information as features to characterize the social spammers. However, to avoid being caught by the websites, the spammers pretend to post normal messages sometimes and change their behaviors continuously. This makes the behavior and content based approaches less effective.

In this paper, we propose a novel method to detect social spammers via structural properties. Specifically, we adopt 12 types of topological features in users’ ego network, including average degree, density, modularity, rich club connectivity, centrality, average shortest path, and cluster coefficient, to learn the classification model for spammer detection. Experimental results on a real world microblog data set demonstrate that the proposed method is very effective. It reaches an accuracy of 82.14 % with only structural features. Furthermore, its performance can be significantly improved to 94.00 % when combined with other features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Benevenuto, F., Magno, G., Rodrigues, T., Almeida., V.: Detecting spammers on twitter. In: Proceedings of the 7th CEAS, pp. 6–12 (2010)
Google Scholar
Bhat, S.Y., Abulaish, M.: Community-based features for identifying spammers in online social networks. In: Proceedings of the ASONAM, pp. 100–107 (2013)
Google Scholar
Chen, C., Wu, K., Srinivasan, V., Zhang, X.: Battling the internet water army: detection of hidden paid posters. In: Proceedings of the ASONAM, pp. 116–120 (2013)
Google Scholar
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
MATH Google Scholar
Hu, X., Tang, J., Liu, H.: Leveraging knowledge across media for spammer detection in microblogging. In: Proceedings of the SIGIR, pp. 547–556 (2014)
Google Scholar
Hu, X., Tang, J., Zhang, Y., Liu, H.: Social spammer detection in microblogging. In: Proceedings of the IJCAI, pp. 2633–2639 (2013)
Google Scholar
Jiang, M., Cui, P., Beutel, A., Faloutsos, C., Yang, S.: CatchSync: catching synchronized behavior in large directed graphs. In: Proceedings of the KDD, pp. 941–950 (2014)
Google Scholar
Jindal, N., Liu, B., Lim, E.P.: Finding unusual review patterns using unexpected rules. In: Proceedings of the CIKM, pp. 1549–1552 (2011)
Google Scholar
Lee, K., Caverlee, J., Webb, S.: Uncovering social spammers: social honeypots+ machine learning. In: Proceedings of the SIGIR, pp. 435–442 (2010)
Google Scholar
Li, F., Huang, M., Yang, Y., Zhu, X.: Learning to identify review spam. In: Proceedings of the IJCAI, pp. 2488–2493 (2011)
Google Scholar
Lim, E.P., Nguyen, V.A., Jindal, N., Liu, B., Lauw, H.W.: Detecting product review spammers using rating behaviors. In: Proceedings of the CIKM, pp. 939–948 (2010)
Google Scholar
McCord, M., Chuah, M.: Spam detection on Twitter using traditional classifiers. In: Calero, J.M.A., Yang, L.T., Mármol, F.G., García Villalba, L.J., Li, A.X., Wang, Y. (eds.) ATC 2011. LNCS, vol. 6906, pp. 175–186. Springer, Heidelberg (2011). doi:10.1007/978-3-642-23496-5_13
Chapter Google Scholar
Mukherjee, A., Kumar, A., Liu, B., Wang, J., Hsu, M., Castellanos, M., Ghosh, R.: Spotting opinion spammers using behavioral footprints. In: Proceedings of the KDD, pp. 632–640 (2013)
Google Scholar
Ren, Y., Ji, D., Zhang, H., Yin, L.: Positive and unlabeled learning for deceptive reviews detection. In: Proceedings of the EMNLP, pp. 488–498 (2014)
Google Scholar
Sedhai, S., Sun, A.: Hspam14: a collection of 14 million tweets for hashtag-oriented spam research. In: Proceedings of the SIGIR, pp. 223–232 (2015)
Google Scholar
Stringhini, G., Kruegel, C., Vigna, G.: Detecting spammers on social networks. In: Proceedings of the ACSAC, pp. 1–9 (2010)
Google Scholar
Tian, T., Zhu, J., Xia, F., Zhuang, X., Zhang, T.: Crowd fraud detection in internet advertising. In: Proceedings of the 24th WWW, pp. 1100–1110 (2015)
Google Scholar
Wu, F., Shu, J., Huang, Y., Yuan, Z.: Social spammer and spam message co-detection in microblogging with social context regularization. In: Proceedings of the CIKM, pp. 1601–1610 (2015)
Google Scholar
Yang, C., Harkreader, R., Zhang, J., Shin, S., Gu, G.: Analyzing spammer social networks for fun and profit, a case study of cyber criminal ecosystem on Twitter. In: Proceedings of the WWW, pp. 71–80 (2012)
Google Scholar
Yang, Z., Wilson, C., Wang, X., Gao, T., Zhao, B.Y., Dai, Y.: Uncovering social network sybils in the wild. ACM Trans. Knowl. Discov. Data 8, Article No. 2 (2014)
Google Scholar
Zafarani, R., Liu, H.: Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of the KDD, pp. 41–49 (2013)
Google Scholar
Zhang, X., Li, Z., Zhu, S., Liang, W.: Detecting spam and promoting campaigns in Twitter. ACM Trans. Web 10(1), 4:1–4:28 (2016)
Article Google Scholar
Zhu, Y., Wang, X., Zhong, E., Liu, N.N., Li, H., Yang, Q.: Discovering spammers in social networks. In: Proceedings of the AAAI (2012)
Google Scholar

Download references

Acknowledgements

The work described in this paper has been supported in part by the NSFC projects (61572376, 61272275), and the 111 project (B07037).

Author information

Authors and Affiliations

State Key Laboratory of Software Engineering, Wuhan University, Wuhan, China
Baochao Zhang, Tieyun Qian, Yiqi Chen & Zhenni You

Authors

Baochao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tieyun Qian
View author publications
You can also search for this author in PubMed Google Scholar
Yiqi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhenni You
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tieyun Qian .

Editor information

Editors and Affiliations

Beijing Language and Culture University, Beijing, China
Yuming Li
Jiangxi Normal University, Nanchang, China
Guoxiong Xiang
Dalian University of Technology, Dalian, China
Hongfei Lin
Jiangxi Normal University, Nanchang, China
Mingwen Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, B., Qian, T., Chen, Y., You, Z. (2016). Social Spammer Detection via Structural Properties in Ego Network. In: Li, Y., Xiang, G., Lin, H., Wang, M. (eds) Social Media Processing. SMP 2016. Communications in Computer and Information Science, vol 669. Springer, Singapore. https://doi.org/10.1007/978-981-10-2993-6_21

Download citation

DOI: https://doi.org/10.1007/978-981-10-2993-6_21
Published: 19 October 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2992-9
Online ISBN: 978-981-10-2993-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics