ABSTRACT
The link prediction problem is to predict the existence of a link between every node pair in the network based on the past observed networks arising in many practical applications such as recommender systems, information retrieval, and the marketing analysis of social networks. Here, we propose a new mathematical programming approach for predicting a future network utilizing the node degree distribution identified from historical observation of the past networks. We develop an integer programming problem for the link prediction problem, where the objective is to maximize the sum of link scores (probabilities) while respecting the node degree distribution of the networks. The performance of the proposed framework is tested on the real-life Facebook networks. The computational results show that the proposed approach can considerably improve the performance of previously published link prediction methods.
- L. A. Adamic and E. Adar. Friends and neighbors on the web. Social Networks, 25(3):211--230, 2003.Google ScholarCross Ref
- M. Al Hasan, V. Chaoji, S. Salem, and M. Zaki. Link prediction using supervised learning. In SDM'06: Workshop on Link Analysis, Counter-terrorism and Security. Citeseer, 2006.Google Scholar
- R. Anstee. A polynomial algorithm for b-matchings: an alternative approach. Information Processing Letters, 24(3):153--157, 1987. Google ScholarDigital Library
- J. Bader, A. Chaudhuri, J. Rothberg, and J. Chant. Gaining confidence in high-throughput protein interaction networks. Nature biotechnology, 22(1):78--85, 2003.Google ScholarCross Ref
- A. Barabási and R. Albert. Emergence of scaling in random networks. Science, 286(5439):509, 1999.Google ScholarCross Ref
- A. L. Barabasi, H. Jeong, Z. Neda, E. Ravasz, A. Schubert, and T. Vicsek. Evolution of the social network of scientific collaborations. Physica a-Statistical Mechanics and Its Applications, 311(3--4):590--614, 2002.Google Scholar
- V. Boginski, S. Butenko, and P. Pardalos. Mining market data: a network approach. Computers & Operations Research, 33(11):3171--3184, 2006. Google ScholarDigital Library
- A. Bradley. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7):1145--1159, 1997. Google ScholarDigital Library
- A. Clauset, C. Moore, and M. Newman. Hierarchical structure and the prediction of missing links in networks. Nature, 453(7191):98--101, 2008.Google ScholarCross Ref
- W. Cook and W. Pulleyblank. Linear systems for constrained matching problems. Mathematics of Operations Research, 12(1):97--120, 1987. Google ScholarDigital Library
- Z. Huang, X. Li, and H. Chen. Link prediction approach to collaborative filtering. In Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries, pages 141--142. ACM, 2005. Google ScholarDigital Library
- H. Jeong, B. Tombor, R. Albert, Z. Oltvai, and A. Barabási. The large-scale organization of metabolic networks. Nature, 407(6804):651--654, 2000.Google ScholarCross Ref
- B. Karrer and M. Newman. Stochastic blockmodels and community structure in networks. Physical Review E, 83(1):016107, 2011.Google ScholarCross Ref
- L. Katz. A new status index derived from sociometric analysis. Psychometrika, 18(1):39--43, 1953.Google ScholarCross Ref
- H. Kim, I. Kim, Y. Lee, and B. Kahng. Scale-free network in stock markets. Journal of Korean Physical Society, 40:1105--1108, 2002.Google Scholar
- D. Liben-Nowell and J. Kleinberg. The link-prediction problem for social networks. Journal of the American Society for Information Science and Technology, 58(7):1019--1031, 2007. Google ScholarDigital Library
- L. Lu and T. Zhou. Link prediction in complex networks: A survey. Arxiv preprint arXiv:1010.0725, 2010.Google Scholar
- C. Manning, P. Raghavan, H. Schütze, and E. Corporation. Introduction to information retrieval, volume 1. Cambridge University Press Cambridge, UK, 2008. Google ScholarCross Ref
- A. Medina, I. Matta, and J. Byers. On the origin of power laws in Internet topologies. ACM SIGCOMM Computer Communication Review, 30(2):18--28, 2000. Google ScholarDigital Library
- M. E. J. Newman. Clustering and preferential attachment in growing networks. Physical Review E, 64(2):025102, 2001.Google ScholarCross Ref
- M. E. J. Newman. The structure of scientific collaboration networks. Proceedings of the National Academy of Sciences of the United States of America, 98(2):404--409, 2001.Google ScholarCross Ref
- M. E. J. Newman. The structure and function of complex networks. SIAM Review, 45(2):167--256, 2003.Google ScholarDigital Library
- G. Salton. Automatic text processing: the transformation. Analysis and Retrieval of Information by Computer, 1989. Google ScholarDigital Library
- J. Shetty and J. Adibi. The Enron email dataset database schema and brief statistical report. Information Sciences Institute Technical Report, 2004.Google Scholar
- B. Viswanath, A. Mislove, M. Cha, and K. P. Gummadi. On the evolution of user interaction in facebook. In Proceedings of the 2nd ACM SIGCOMM Workshop on Social Networks (WOSN'09), August 2009. Google ScholarDigital Library
- S. Zhou and R. Mondragón. Accurately modeling the Internet topology. Physical Review E, 70(6):066108, 2004.Google ScholarCross Ref
Index Terms
- A novel link prediction approach for scale-free networks
Recommendations
A Network Structural Approach to the Link Prediction Problem
The link prediction problem is an emerging real-life social network problem in which data mining techniques have played a critical role. It arises in many practical applications such as recommender systems, information retrieval, and marketing analysis ...
Link Prediction Across Multiple Social Networks
ICDMW '10: Proceedings of the 2010 IEEE International Conference on Data Mining WorkshopsThe problem of link prediction has been studied extensively in literature. There are various versions of the link prediction problem \textit{e.g.,} link existence problem, link removal problem, predicting edge weights over time etc. In this paper we ...
HEM: An Improved Parametric Link Prediction Algorithm Based on Hybrid Network Evolution Mechanism
Advanced Data Mining and ApplicationsAbstractLink prediction plays an important role in the research of complex networks. Its task is to predict missing links or possible new links in the future via existing information in the network. In recent years, many powerful link prediction ...
Comments