Abstract
Inferring users’ social role on a mobile communication network is of significance for various applications, such as financial fraud detection, viral marketing, and target promotions. Different with the social network, which has lots of user generated contents (UGC) including texts, pictures, and videos, considering the privacy issues, mobile communication network only contains the communication pattern data, such as message frequency and phone call frequency as well as duration. Moreover, the profile data of mobile users is always noisy, ambiguous, and sparse, which makes the task more challenging. In this paper, we use the graph embedding methods as a feature extractor and then combine it with the hand-crafted structure-related features in a feed-forward neural network. Different with previous embedding methods, we consider the label info while sampling the context. To handle the noisy and sparsity challenge, we further project the generated embedding onto a much smaller subspace. Through our experiments, we can increase the precision by up to 10% even with a huge portion of noisy and sparse labeled data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Astudillo, R.F., Amir, S., Lin, W., Silva, M., Trancoso, I.: Learning word representations from scarce and noisy data with embedding sub-spaces. In: ACL 2015 (2015)
Dong, Y., Yang, Y., Tang, J., Yang, Y., Chawla, N.V.: Inferring user demographics and social strategies in mobile social networks. In: KDD 2014, pp. 15–24 (2014)
Hecht, B., Hong, L., Suh, B., Chi, E.H.: Tweets from Justin Bieber’s heart: the dynamics of the location field in user profiles. In: CHI 2011, pp. 237–246 (2011)
Li, R., Wang, S., Deng, H., Wang, R., Chang, K.C.C.: Towards social user profiling: unified and discriminative influence model for inferring home locations. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1023–1031. ACM (2012)
Pennacchiotti, M., Popescu, A.M.: Democrats, republicans and starbucks afficionados: user classification in twitter. In: KDD 2011, pp. 430–438 (2011)
Raghavan, U.N., Albert, R., Kumara, S.: Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76(3), 036106 (2007)
Recht, B., Re, C., Wright, S., Niu, F.: Hogwild: a lock-free approach to parallelizing stochastic gradient descent. In: NIPS 2011 (2011)
Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: large-scale information network embedding. In: WWW 2015, pp. 1067–1077 (2015)
Chen, Y., Li, H., Zhang, J., Miao, G.: Inferring social roles of mobile users based on communication behaviors. In: Cui, B., Zhang, N., Xu, J., Lian, X., Liu, D. (eds.) WAIM 2016. LNCS, vol. 9658, pp. 402–414. Springer, Cham (2016). doi:10.1007/978-3-319-39937-9_31
Zeng, G., Luo, P., Chen, E., Wang, M.: From social user activities to people affiliation. In: ICDM 2013, pp. 1277–1282 (2013)
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with local and global consistency. In: NIPS 2004 (2004)
Acknowledgement
This work was supported by Natural Science Foundation of China (No. 61170003).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Zhang, J., Chen, Y., Hong, S., Li, H. (2017). REBUILD: Graph Embedding Based Method for User Social Role Identity on Mobile Communication Network. In: Tan, Y., Takagi, H., Shi, Y. (eds) Data Mining and Big Data. DMBD 2017. Lecture Notes in Computer Science(), vol 10387. Springer, Cham. https://doi.org/10.1007/978-3-319-61845-6_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-61845-6_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-61844-9
Online ISBN: 978-3-319-61845-6
eBook Packages: Computer ScienceComputer Science (R0)