Skip to main content

Supervised Link Prediction Using Random Walks

  • Conference paper
  • First Online:
Social Media Processing (SMP 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 568))

Included in the following conference series:

Abstract

Network structure has become increasingly popular in big-data representation over the last few years. As a result, network based analysis techniques are applied to networks containing millions of nodes. Link prediction helps people to uncover the missing or unknown links between nodes in networks, which is an essential task in network analysis.

Random walk based methods have shown outstanding performance in such task. However, the primary bottleneck for such methods is adapting to networks with different structure and dynamics, and scaling to the network magnitude. Inspired by Random Walk with Restart (RWR), a promising approach for link prediction, this paper proposes a set of path based features and a supervised learning technique, called Supervised Random Walk with Restart (SRWR) to identify missing links. We show that by using these features, a classifier can successfully order the potential links by their closeness to the query node. A new type of heterogeneous network, called Generalized Bi-relation Netowrk (GBN), is defined in this paper, upon which the novel structural features are introduced. Finally experiments are performed on a disease-chemical-gene interaction network, whose result shows SRWR significantly outperforms standard RWR algorithm in terms of the Area Under ROC Curve (AUC) gained and better than or equal to the best algorithms in the field of gene prioritization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Also named “similarity”, “closeness” or other similar words in literature, they will be used interchangable in following text.

References

  1. Backstrom, L., Leskovec, J.: Supervised random walks: predicting and recommending links in social networks. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, WSDM 2011, pp. 635–644. ACM, New York (2011)

    Google Scholar 

  2. Bromberg, Y.: Disease gene prioritization. PLoS Comput. Biol. 9(4), e1002902 (2013). 00014

    Article  Google Scholar 

  3. Chakrabarti, S., Agarwal, A.: Learning parameters in entity relationship graphs from ranking preferences. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) PKDD 2006. LNCS (LNAI), vol. 4213, pp. 91–102. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  4. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Tech. 2(3), 27:1–27:27 (2011). 22106

    Article  Google Scholar 

  5. Cohen, S., Kimelfeld, B., Koutrika, G.: A Survey on Proximity Measures for Social Networks. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 7538, pp. 191–206. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  6. Cukierski, W., Hamner, B., Yang, B.: Graph-based features for supervised link prediction. In: The 2011 International Joint Conference on Neural Networks (IJCNN), pp. 1237–1244, July 2011

    Google Scholar 

  7. Davis, A.P., Grondin, C.J., Lennon-Hopkin, K., Saraceni-Richards, C., Sciaky, D., King, B.L., Wiegers, T.C., Mattingly, C.J.: The comparative toxicogenomics database’s 10th year anniversary: update 2015. Nucleic Acids Res. 43(Database issue), D914–D920 (2015)

    Article  Google Scholar 

  8. Fire, M., Tenenboim, L., Lesser, O., Puzis, R., Rokach, L., Elovici, Y.: Link prediction in social networks using computationally efficient topological features. In: 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), pp. 73–80, October 2011

    Google Scholar 

  9. Hasan, M.A., Chaoji, V., Salem, S., Zaki, M.: Link prediction using supervised learning. In: Proceedings of SDM 2006 Workshop on Link Analysis. Counterterrorism and Security (2006). 00358

    Google Scholar 

  10. Hasan, M.A., Zaki, M.J.: A survey of link prediction in social networks. In: Aggarwal, C.C. (ed.) Social Network Data Analytics, pp. 243–275. Springer, USA (2011). 00107

    Chapter  Google Scholar 

  11. Jaccard, P.: Étude comparative de la distribution florale dans une portion des Alpes et du Jura. Bulletin de la Societe Vaudoise des Sciences Naturelles 37(142), 547–579 (1901)

    Google Scholar 

  12. Katz, L.: A new status index derived from sociometric analysis. Psychometrika 18(1), 39–43 (1953)

    Article  MATH  Google Scholar 

  13. Lao, N., Cohen, W.W.: Fast query execution for retrieval models based on path-constrained random walks. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2010, pp. 881–888. ACM, New York (2010)

    Google Scholar 

  14. Li, Y., Patra, J.C.: Genome-wide inferring genecphenotype relationship by walking on the heterogeneous network. Bioinformatics 26(9), 1219–1224 (2010)

    Article  Google Scholar 

  15. Liben-Nowell, D., Kleinberg, J.: The link prediction problem for social networks. In: Proceedings of the Twelfth International Conference on Information and Knowledge Management, CIKM 2003, pp. 556–559. ACM, New York (2003)

    Google Scholar 

  16. Lu, L., Zhou, T.: Link prediction in complex networks: a survey. Physica A Stat. Mech. Appl. 390(6), 1150–1170 (2011)

    Article  Google Scholar 

  17. Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web (1999)

    Google Scholar 

  18. Pan, J.Y., Yang, H.J., Faloutsos, C., Duygulu, P.: Automatic multimedia cross-modal correlation discovery. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2004, pp. 653–658. ACM, New York (2004)

    Google Scholar 

  19. Salton, G.: Introduction to Modern Information Retrieval. Mcgraw-Hill College, New York (1983)

    MATH  Google Scholar 

  20. Tong, H., Faloutsos, C., Pan, J.Y.: Fast random walk with restart and its applications. In: Proceedings of the Sixth International Conference on Data Mining, ICDM 2006, pp. 613–622. IEEE Computer Society, Washington, DC, USA (2006)

    Google Scholar 

  21. Xia, J., Caragea, D., Hsu, W.: Bi-relational network analysis using a fast random walk with restart. In: Ninth IEEE International Conference on Data Mining, ICDM 2009, pp. 1052–1057 (2009). 00011

    Google Scholar 

  22. Xie, M., Hwang, T., Kuang, R.: Prioritizing disease genes by Bi-random walk. In: Tan, P.-N., Chawla, S., Ho, C.K., Bailey, J. (eds.) PAKDD 2012, Part II. LNCS, vol. 7302, pp. 292–303. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  23. Zhang, J., Kong, X., Yu, P.S.: Predicting social links for new users across aligned heterogeneous social networks, October 2013. arXiv: arXiv:1310.3492 [physics]

Download references

Acknowledgement

This material is supported by National Institutes of Health under the grant number R01LM011986. The content of the information in this document does not necessarily reflect the position or the policy of the Government, and no official endorsement should be inferred. The U.S. Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation here on. This work is also supported in part by the National High-Technology Research and Development Program (863 Program) of China under Grand 2013AA01A212, National Science Foundation Grant 61272067, 61370229 and Jiaying University Grant (“Collaboration Mechanism and Application in Social Networks.”).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuechang Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer Science+Business Media Singapore

About this paper

Cite this paper

Liu, Y., Tong, H., Xie, L., Tang, Y. (2015). Supervised Link Prediction Using Random Walks. In: Zhang, X., Sun, M., Wang, Z., Huang, X. (eds) Social Media Processing. SMP 2015. Communications in Computer and Information Science, vol 568. Springer, Singapore. https://doi.org/10.1007/978-981-10-0080-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-0080-5_10

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-0079-9

  • Online ISBN: 978-981-10-0080-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics