Skip to main content
Log in

Inferring social network user profiles using a partial social graph

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

User profile inference on online social networks is a key task for targeted advertising and building recommender systems that rely on social network data. However, current algorithms for user profiling suffer from one or more of the following limitations: (1) assuming that the full social graph or a large training set of crawled data is available for training, (2) not exploiting the rich information that is available in social networks such as group memberships and likes, (3) treating numeric attributes as nominal attributes, and (4) not assessing the certainty of their predictions. In this paper, to address these limitations, we propose an algorithm named Partial Graph Profile Inference+ (PGPI+). The PGPI+ algorithm can accurately infer user profiles under the constraint of a partial social graph. PGPI+ does not require training, and it lets the user select the trade-off between the amount of information to be crawled for inferring a user profile and the accuracy the inference. Besides, PGPI+ is designed to use rich information about users when available: user profiles, friendship links, group memberships, and the ”views” and ”likes” from social networks such as Facebook. Moreover, to also address limitations 3 and 4, PGPI+ considers numeric attributes in addition to nominal attributes, and can evaluate the certainty of its predictions. An experimental evaluation with 31,247 user profiles from the Facebook and Pokec social networks shows that PGPI+ predicts user profiles with a higher accuracy than several start-of-the-art algorithms, and by accessing (crawling) less information from the social graph. Furthermore, an interesting result is that some profile attributes such as the status (student/professor) and genre can be predicted with more than 95 % accuracy using PGPI+.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  • Bhattacharya, P., Zafar, M.B., Ganguly, N., Ghosh, S., & Gummadi, K.P. (2014). Inferring user interests in the twitter social network. In Proc. 8th ACM conference on recommender systems (pp. 357–360).

  • Bi, B., Shokouhi, M., Kosinski, M., & Graepel, T. (2013). Inferring the demographics of search users: social data meets search queries. In Proc. 22nd intern. Conf. on world wide web (pp. 131–140).

  • Blenn, N., Doerr, C., Shadravan, N., & Van Mieghem, P. (2012). How much do your friends know about you?: reconstructing private information from the friendship graph. In Proc. of the fifth workshop on social network systems, pp. 1–6. ACM.

  • Bonchi, F., Castillo, C., Gionis, A., & Jaimes, A. (2011). Social network analysis and mining for business applications. ACM Transactions on Intelligent Systems and Technology, 2(3), 1–37.

    Article  Google Scholar 

  • Chaabane, A., Acs, G., & Kaafar, M.A. (2012). You are what you like! information leakage through users interests. In Proc. of the 19th annual network and distributed system security symposium, the internet society.

  • Chaudhari, G., Avadhanula, V., & Sarawagi, S. (2014). A few good predictions: selective node labeling in a social network. In Proc. of the 7th ACM international conference on web search and data mining, pp. 353–362. ACM.

  • Chen, P.T., & Hsieh, H.P. (2012). Personalized mobile advertising: Its key attributes, trends, and social impact. Technological Forecasting and Social Change, 79(3), 543–557.

    Article  Google Scholar 

  • Dahimene, R., Constantin, C., & Du Mouza, C. (2014). Recland: A recommender system for social networks. In Proc. 23rd ACM intern. Conf. on information and knowledge management (pp. 2063–2065).

  • Davis, C.A. Jr., & et al. (2011). Inferring the location of twitter messages based on user relationships. Transactions in GIS, 15(6), 735–751.

    Article  Google Scholar 

  • Dong, Y., Yang, Y., Tang, J., Yang, Y., & Chawla, V.N. (2014). Inferring user demographics and social strategies in mobile social networks. In Proc. of the 20th ACM international conference on knowledge discovery and data mining, pp. 15–24. ACM.

  • Dougnon, Y.R., Fournier-Viger, P., & Nkambou, R. (2015). Inferring user profiles in social networks using a partial social graph. In Proc. 28th canadian conference on artificial intelligence, springer, LNAI 9091 (pp. 84–99).

  • Gong, N.Z., Talwalkar, A., Mackey, L., Huang, L., Shin, E.C.R., Stefanov, E., & Song, D. (2011). Jointly predicting links and inferring attributes using a social-attribute network (san). Proc. 6th SNA-KDD Workshop (SNA-KDD) Aug. 12, 2012, Beijing, China.

  • Huang, W., Weber, I., & Vieweg, S. (2014). Inferring nationalities of Twitter users and studying inter-national linking. In Proc. 25th ACM conference on hypertext and social media (pp. 237–242).

  • He, J., Chu, W.W., & Liu, Z.V. (2006). Inferring privacy information from social networks. In Proc. of 2006 IEEE international conference on intelligence and security informatics (pp. 154–165). Heidelberg: Springer.

  • Heatherly, R., Kantarcioglu, M., & Thuraisingham, B. (2013). Preventing private information inference attacks on social networks. IEEE Transactions on Knowledge and Data Engineering, 25(8), 1849–1862.

    Article  Google Scholar 

  • Jurgens, D. (2013). Thats what friends are for: Inferring location in online social media platforms based on social relationships. In Proc. of the 7th international AAAI conference on weblogs and social media (pp. 273–282): AAAI press.

  • Kong, X., Shi, X., & Yu, P.S. (2011). Multi-Label Collective classification. In Proc. 2011 SIAM international conference on data mining (pp. 618–629).

  • Kosinski, M., Stillwell, D., & Graepel, T. (2013). Private traits and attributes are predictable from digital records of human behavior. National Academy of Sciences, 110(15), 5802–5805.

    Article  Google Scholar 

  • Li, R., Wang, C., & Chang, K.C.C. (2014). User profiling in an ego network: co-profiling attributes and relationships. In Proc. of the 23rd international conference on world wide web, pp. 819–830. ACM.

  • Lindamood, J., Heatherly, R., Kantarcioglu, M., & Thuraisingham, B. (2009). Inferring private information using social network data. In Proc. of the 18th international conference on world wide web, pp. 1145–1146. ACM.

  • Mazhari, S., Fakhrahmad, S.M., & Sadeghbeygi, H. (2015). A user-profile-based friendship recommendation solution in social networks. Journal of Information Science, 41(3), 284–295.

    Article  Google Scholar 

  • Mislove, A., Viswanath, B., Gummadi, K.P., & Druschel, P. (2010). You are who you know: inferring user profiles in online social networks. In Proc. of the 3rd ACM international conference on web search and data mining, pp. 251–260. ACM.

  • Papadopoulos, S., Kompatsiaris, Y., Vakali, A., & Spyridonos, P. (2012). Community detection in social media. Data Mining and Knowledge Discovery, 24(3), 515–554.

    Article  Google Scholar 

  • Pennacchiotti, M., & Popescu, A.M. (2011). A machine learning approach to twitter user classification. In Proc. 5th intern. AAAI conf. on weblogs and social media, (Vol. 11 pp. 281–288).

  • Petrovic, I., Perkovic, P., & Stajduhar, I. (2015). A profile-and community-driven book recommender system. In 38Th international convention on information and communication technology, electronics and microelectronics (pp. 631–635).

  • Quercia, D., Kosinski, M., Stillwell, D., & Crowcroft, J. (2011). Our Twitter profiles, our selves: Predicting personality with Twitter. In Proc. of the 3rd IEEE international conference on social computing (pp. 180–185): IEEE press.

  • Scott, J., & Carrington, P.J. (2011). The SAGE handbook of social network analysis SAGE publications.

  • Takaffoli, M., Sangi, F., Fagnan, J., & Zäane, O.R. (2011). Community evolution mining in dynamic social networks. Procedia-Social and Behavioral Sciences, 22, 49–58.

    Article  Google Scholar 

  • Tang, J., Yi, C., & Huan, L. (2013). Mining social media with social theories: a survey. SIGKDD Explore Newsletter, 15(2), 20–29.

    Article  Google Scholar 

  • Traud, A.L., Mucha, P.J., & Porter, M.A. (2012). Social structure of Facebook networks. Physica A: Statistical Mechanics and its Applications, 391(16), 4165–4180.

    Article  Google Scholar 

  • Tiroshi, A., Kuflik, T., Kay, J., & Kummerfeld, B. (2012). Recommender systems and the social web. In Proc. 19th international conference on advances in user modeling (pp. 60–70).

  • Tucker, C.E. (2014). Social networks, personalized advertising, and privacy controls. Journal of Marketing Research, 51(5), 546–562.

    Article  Google Scholar 

  • Wakita, K., & Tsurumi, T. (2007). Finding community structure in mega-scale social networks. In Proc. 16th international conference on world wide web (pp. 1275–1276).

  • Varshney, U., & Vetter, R. (2001). A framework for the emerging mobile commerce applications. In Proc. 34th intern. Annual hawaii conf. on system sciences (pp. 1–10).

  • Xu, D.J., Liao, S.S., & Li, Q. (2008). Combining empirical experimentation and modeling techniques: a design research approach for personalized mobile advertising applications. Decision Support Systems, 44(3), 710–724.

    Article  Google Scholar 

  • Vatanparast, R. (2007). Piercing the fog of mobile advertising. In Proc. Intern. Conf. on management of mobile business (pp. 1–19).

  • Wang, X., & Sukthankar, G. (2013). Multi-label relational neighbor classification using social context features. In Proc. 19th ACM SIGKDD intern. Conf. on knowledge discovery and data mining (pp. 464–472).

  • Al Zamal, F., Liu, W., & Ruths, D. (2012). Homophily and latent attribute inference: Inferring latent attributes of twitter users from neighbors. In Proc. 6th intern. AAAI conf. on weblogs and social media.

  • Zhao, Z., Cheng, Z., Hong, L., & Chi, E.H. (2015). Improving user topic interest profiles by behavior factorization. In Proc. 24th intern. Conf. on world wide web (pp. 1406–1416).

  • Zhao, Y., Wang, G., Yu, P.S., Liu, S., & Zhang, S. (2013). Inferring social roles and statuses in social networks. In Proc. of the 19th ACM SIGKDD intern. Conf. on knowledge discovery and data mining (pp. 695–703).

  • Zhong, Y., Yuan, N.J., Zhong, W., Zhang, F., & Xie, X. (2015). You are where you go: Inferring demographic attributes from location check-ins. In Proc. 8th ACM intern. Conf. on web search and data mining (pp. 295–304).

  • Zhou, X., Xu, Y., Li, Y., Josang, A., & Cox, C. (2012). The state-of-the-art in personalized recommender systems for social networking. Artificial Intelligence Review, 37(2), 119–132.

    Article  Google Scholar 

Download references

Acknowledgements

This work is financed by a National Science and Engineering Research Council (NSERC) of Canada research grant.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Philippe Fournier-Viger.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dougnon, R.Y., Fournier-Viger, P., Lin, J.CW. et al. Inferring social network user profiles using a partial social graph. J Intell Inf Syst 47, 313–344 (2016). https://doi.org/10.1007/s10844-016-0402-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-016-0402-y

Keywords

Navigation