Skip to main content
Log in

Supervised Classification for Link Prediction in Facebook Ego Networks With Anonymized Profile Information

  • Published:
Journal of Classification Aims and scope Submit manuscript

Abstract

Social networks are very dynamic objects where nodes and links are continuously added or removed. Hence, an important but challenging task is link prediction, that is, to predict the likelihood of a future association between any two nodes. We use a classification approach to perform link prediction on data retrieved from Facebook in the typical form of ego networks. In addition to the more traditional topological features, we also consider the attributes of the nodes—i.e., users’ publicly available profile information—to fully assess the similarity between nodes. We propose two new attribute-based features, validating their predictive power through an extensive comparison with natural competitors from the literature. Finally, one of the proposed features is selected when building a state-of-the-art procedure for link prediction that achieves an average AUROC of 96.59% over 85 test ego networks. Valuable insights on the interpretation of the results in the specific context of friendship recommendation in Facebook are also provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Data Availability

The authors declare that all data supporting the findings of this study are available within the article.

Notes

  1. https://www.statista.com/statistics/264810/number-of-monthly-active-facebook-users-worldwide/(Nov. 15, 2021).

  2. http://snap.stanford.edu/socialcircles/.

  3. https://www.kaggle.com/c/learning-social-circles.

References

  • Adamic, L.A., & Adar, E. (2003). Friends and neighbors on the web. Social Networks, 25(3), 211–230.

    Article  Google Scholar 

  • Aiello, L.M., Barrat, A., Schifanella, R., Cattuto, C., Markines, B., & Menczer, F. (2012). Friendship prediction and homophily in social media. ACM Transactions on the Web (TWEB), 6(2), 1–33.

    Article  Google Scholar 

  • Akcora, C.G., Carminati, B., & Ferrari, E. (2013). User similarities on social networks. Social Network Analysis and Mining, 3(3), 475–495.

    Article  Google Scholar 

  • Bhattacharyya, P., Garg, A., & Wu, S.F. (2011). Analysis of user keyword similarity in online social networks. Social Network Analysis and Mining, 1(3), 143–158.

    Article  Google Scholar 

  • Cardoso, F.M., Meloni, S., Santanche, A., & Moreno, Y. (2019). Topical alignment in online social systems. Frontiers in Physics, 7, 58.

    Article  Google Scholar 

  • Crandall, D., Cosley, D., Huttenlocher, D., Kleinberg, J., & Suri, S. (2008). Feedback effects between similarity and social influence in online communities. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 160–168).

  • Elkabani, I., & Khachfeh, R.A.A. (2015). Homophily-Based Link prediction in the facebook online social network: a rough sets approach. Journal of Intelligent Systems, 24(4), 491–503.

    Article  Google Scholar 

  • Guy, I., Jacovi, M., Perer, A., Ronen, I., & Uziel, E. (2010). Same places, same things, same people? mining user similarity on social media. In Proceedings of the 2010 ACM conference on Computer supported cooperative work (pp. 41–50).

  • Han, X., Wang, L., Han, S.N., Chen, C., Crespi, N., & Farahbakhsh, R. (2015). Link prediction for new users in social networks. In 2015 IEEE International Conference on Communications (ICC) (pp. 1250–1255). IEEE.

  • Han, X., Wang, L., Park, S., Cuevas, A., & Crespi, N. (2014). Alike people, alike interests? a large-scale study on interest similarity in social networks. In 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014) (pp. 491–496). IEEE.

  • Hanley, J. A., & Mcneil, B. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143, 29–36.

    Article  Google Scholar 

  • Hasan, M.A., Chaoji, V., Salem, S., & Zaki, M.J. (2006). Link prediction using supervised learning. In Proceedings of SDM Workshop of Link Analysis, Counterterrorism and Security.

  • Hasan, M.A., & Zaki, M.J. (2011). A survey of link prediction in social networks. In C.C. Aggarwal (Ed.) Social Network Data Analytics, chapter 9 (pp. 243–275). Boston: Springer.

  • Jaccard, P. (1901). Étude comparative de la distribution florale dans une portion des Alpes et des Jura. Bulletin del la Société Vaudoides Sciences Naturelles, 37, 547–579.

    Google Scholar 

  • Kumar, A., Singh, S.S., Singh, K., & Biswas, B. (2020). Link prediction techniques, applications, and performance: A survey. Physica A: Statistical Mechanics and its Applications, 553, 124289.

    Article  MathSciNet  Google Scholar 

  • Lü, L., & Zhou, T. (2011). Link prediction in complex networks: A survey. Physica A Statistical Mechanics and its Applications, 390, 1150–1170.

    Article  Google Scholar 

  • Mazhari, S., Fakhrahmad, S.M., & Sadeghbeygi, H. (2015). A user-profile-based friendship recommendation solution in social networks. Journal of Information Science, 41(3), 284–295.

    Article  Google Scholar 

  • McAuley, J., & Leskovec, J. (2014). Discovering social circles in ego networks. ACM Transactions on Knowledge Discovery from Data, 8(1), 4:1–4:28.

    Article  Google Scholar 

  • McPherson, M., Smith-Lovin, L., & Cook, J.M. (2001). Birds of a feather: Homophily in social networks. Annual Review of Sociology, 27(1), 415–444.

    Article  Google Scholar 

  • Naruchitparames, J., Güneş, M.H., & Louis, S.J. (2011). Friend recommendations in social networks using genetic algorithms and network topology. In 2011 IEEE Congress of Evolutionary Computation (CEC) (pp. 2207–2214). IEEE.

  • Salton, G., & McGill, M.J. (1986). Introduction to modern information retrieval. New York: McGraw-Hill, Inc.

    MATH  Google Scholar 

  • Szymkiewic, D. (1934). Une contribution statistique a la geographie floristique. Acta Societatis Botanicorum Poloniae, 34(3), 249–265.

    Google Scholar 

  • Wang, P., Xu, B., Wu, Y., & Zhou, X. (2015). Link prediction in social networks: The state-of-the-art. Science China Information Sciences, 58, 1–38.

    Google Scholar 

  • Zhou, T., Lü, L., & Zhang, Y.-C. (2009). Predicting missing links via local information. The European Physical Journal B, 71(4), 623–630.

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to express their gratitude to the two anonymous reviewers whose comments and suggestions have helped to clarify and improve the overall quality of this manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Riccardo Giubilei.

Ethics declarations

Conflict of Interest

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Giubilei, R., Brutti, P. Supervised Classification for Link Prediction in Facebook Ego Networks With Anonymized Profile Information. J Classif 39, 302–325 (2022). https://doi.org/10.1007/s00357-021-09408-2

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00357-021-09408-2

Keywords

Navigation