Skip to main content
Log in

Application of data mining methods for link prediction in social networks

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Using social networking services is becoming more popular day by day. Social network analysis views relationships in terms of nodes (people) and edges (links or connections—the relationship between the people). The websites of the social networks such as Facebook currently are among the most popular internet services just after giant portals such as Yahoo, MSN and search engines such as Google. One of the main problems in analyzing these networks is the prediction of relationships between people in the network. It is hard to find one method that can identify relation between people in the social network. The purpose of this paper is to forecast the friendship relation between individuals among a social network, especially the likelihood of a relation between an existing member with a new member. For this purpose, we used a few hypotheses to make the graph of relationships between members of social network, and we used the method of logistic regression to complete the graph. Test data from Flickr website are used to evaluate the proposed method. The results show that the method has achieved 99 % accuracy in prediction of friendship relationships.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Boyd DM, Ellison NB (2008) Social network sites: definition, history and scholarship. J Comput Mediat Commun, pp 210–230

  • Corlette D, Shipman FM (2010) Link prediction applied to an open large-scale online social network. In: Proceedings of the 21st ACM conference on Hypertext and hypermedia

  • Cranshaw J, Toch E, Hong J, Kittur A, Sadeh N (2010) Bridging the gap between physical location and online social networks. In: Proceedings of UBICOMP’10, New York, pp 119–128

  • Faloutsos C, McCurley K, Tomkins A (2004) Fast Discovery of Connection Subgraphs. In: International Conference Knowledge Discovery and Data Mining, New York, NY, USA

  • Hoppe B, Reinelt C (2010) Social network analysis and the evaluation of leadership networks., The Leadership Quarterly, vol 21, issue 4. Elsevier, Amsterdam, pp 600–619

    Google Scholar 

  • Hosmer DW, Lemeshow (2000) Applied Logistic Regression, 2nd edn. Wiley, New York

  • Kaggle.com. IJCNN social network challenge (2011) http://www.kaggle.com/socialNetwork

  • Kleinbaum DG, Klein M, Pryor ER (2010) Logistic Regression: a self learning text, Springer Science and Business Media LLC

  • Narayana A, Shi E, Rubinsten BIP (2011) Linke prediction by de-anonymization: how we won the kaggle social network challenge, The 2011 International Joint Conference Neural Networks (IJCNN)

  • Rasekh AH, Liaghat Z (2009) Predictive relationship of friendship in social networks. In: The Fifth 2011 International Conference on Data Mining, pp 14–15, 47–53

  • Safaei M, Sahan M, Ilkan M (2009) Social graph generation & forecasting using social network mining, 33rd Annual IEEE International Computer Software and Applications Conference

  • Wang Y (2005) A multinomial logistic regression modeling approach for anomaly intrusion detection. Comput Secur 24(8):647–662

    Article  Google Scholar 

  • Witten H, Eibe F (2005) Data Mining. Morgan Kauf-mann Publishers, San Francisco

  • Zou KH, James A, Malley O, Mauri L (2007) Receiver-operating characteristic analysis for evaluating diagnostic tests and predictive models. Circ J Am Heart Assoc 115(15):654–657

    Google Scholar 

  • Zhang J, Jin R, Yang Y, Hauptmann (2003) A. G. Modified Logistic Regression: an approximation to svm and it is applications in large-scale text categorization. In: Proceedings of the 20th International Conference on Machine Learning, Washington, vol 21–24, pp 888–895

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zeinab Liaghat.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liaghat, Z., Rasekh, A.H. & Mahdavi, A. Application of data mining methods for link prediction in social networks. Soc. Netw. Anal. Min. 3, 143–150 (2013). https://doi.org/10.1007/s13278-013-0097-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13278-013-0097-9

Keywords

Navigation