Abstract
In recent times, real-world systems have been rapidly growing in size resulting in increased complexity. Networks are an interpretation of the complex system which describes the structure of a complex system by understanding the relations between the elements. Link prediction plays an important role in network analysis through which we can observe the hidden or missing link between the nodes. In this paper, we have proposed an improved hybrid similarity-based link prediction approach. We use five different metrics to experimentally evaluate the performance of the proposed approach such as AUC, precision, recall, F1 score, and accuracy. We also compare the proposed approach with recent and existing link prediction approaches against six real-world datasets. We find that the proposed approach performs well for all the considered metrics as compared to other existing link prediction approaches. Additionally, we also compare the proposed approach against state-of-the-art link prediction approaches using classification-based machine learning algorithms such as logistic regression, random forest, k-nearest neighbors, and naïve Bayes. The results show that the proposed approach outdoes the other link prediction approaches in terms of AUC. The proposed method has potential practical use for network analysis in various networks such as information networks, technological networks, and social networks.
Similar content being viewed by others
References
Adamic LA, Adar E (2003) Friends and neighbors on the Web. Soc Netw 25(3):211–230
Aggarwal CC (2011) An introduction to social network data analytics. Springer, US, pp 1–15
Ahmad I, Akhtar MU, Noor S, Shahnaz A (2020) Missing link prediction using common neighbor and centrality based parameterized algorithm. Sci Rep 10(1):364
Ayoub J, Lotfi D, El Marraki M, Hammouch A (2020) Accurate link prediction method based on path length between a pair of unlinked nodes and their degree. Soc Netw Anal Min 10(1):9. https://doi.org/10.1007/s13278-019-0618-2
Barabási AL, Jeong H, Néda Z, Ravasz E, Schubert A, Vicsek T (2002) Evolution of the social network of scientific collaborations. Phys Stat Mech Its Appl 311(3–4):590–614
Berahmand K, Nasiri E, Forouzandeh S, Li Y (2022) A preference random walk algorithm for link prediction through mutual influence nodes in complex networks. J King Saud Univ-Comput Inf Sci 34(8):5375–5387. https://doi.org/10.1016/j.jksuci.2021.05.006
Cho H, Yu Y (2018) Link prediction for interdisciplinary collaboration via co-authorship network. Soc Netw Anal Min 8(1):25. https://doi.org/10.1007/s13278-018-0501-6
Cormen T H, Leiserson C E, Rivest R L, Stein C (eds) (2001) Introduction to algorithms, 2nd Edition, MIT Press, Cambridge, Massachusetts
Daud NN, Ab Hamid SH, Saadoon M, Sahran F, Anuar NB (2020) Applications of link prediction in social networks: a review. J Network Comput Appl 166:102716. https://doi.org/10.1016/j.jnca.2020.102716
Dong L, Li Y, Yin H, Le H, Rui M (2013) The algorithm of link prediction on social network. Math Probl Eng 2013:1–7
Du X, Yan J, Zhang R, Zha H (2020) Cross-network skip-gram embedding for joint network alignment and link prediction. IEEE Trans Knowl Data Eng 34(3):1080–1095
Gao Z, Rezaeipanah A (2023) A novel link prediction model in multilayer online social networks using the development of katz similarity metric. Neural Process Lett 55(4):4989–5011. https://doi.org/10.1007/s11063-022-11076-1
Grover A, Leskovec J (2016) Node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining (pp 855-864)
Guimerà R, Danon L, Díaz-Guilera A, Giralt F, Arenas A (2003) Self-similar community structure in a network of human interactions. Phys Rev E 68(6):065103. https://doi.org/10.1103/PhysRevE.68.065103
Hossin M, Sulaiman MN (2015) A review on evaluation metrics for data classification evaluations. Int J Data Min Knowl Manag Proc 5(2):1
Katz L (1953) A new status index derived from sociometric analysis. Psychometrika 18(1):39–43
Keikha MM, Rahgozar M, Asadpour M (2021) DeepLink: a novel link prediction framework based on deep learning. J Inf Sci 47(5):642–657
Kumar A, Singh SS, Singh K, Biswas B (2020) Link prediction techniques, applications, and performance: a survey. Phys Stat Mech Its Appl 553:124289
Liben-Nowell D, Kleinberg J (2007) The link-prediction problem for social networks. J Am Soc Inf Sci Technol 58(7):1019–1031
Lusseau D, Schneider K, Boisseau OJ, Haase P, Slooten E, Dawson SM (2003) The bottlenose dolphin community of doubtful sound features a large proportion of long-lasting associations. Behav Ecol Sociobiol 54(4):396–405. https://doi.org/10.1007/s00265-003-0651-y
Martínez V, Berzal F, Cubero JC (2016) A survey of link prediction in complex networks. ACM Comput Surv (CSUR) 49(4):1–33. https://doi.org/10.1145/3012704
Mishra S, Singh SS, Kumar A, Biswas B (2022) MNERLP-MUL: merged node and edge relevance based link prediction in multiplex networks. J Comput Sci 60:101606. https://doi.org/10.1016/j.jocs.2022.101606
Newman M (2010) Networks: an introduction. Oxford University Press, Great Britain
Page L, Brin S, Motwani R, Winograd T (2023) The pagerank citation ranking: bringing order to the web. Accessed 21 Jan 2023. http://ilpubs.stanford.edu:8090/422/
Papadimitriou A, Symeonidis P, Manolopoulos Y (2012) Fast and accurate link prediction in social networking systems. J Syst Softw 85(9):2119–2132
Rai AK, Kumar S (2022) Identifying the leaders and main conspirators of the attacks in terrorist networks. ETRI J 44(6):977–990. https://doi.org/10.4218/etrij.2021-0239
Ravasz E, Somera AL, Mongru DA, Oltvai ZN, Barabási AL (2002) Hierarchical organization of modularity in metabolic networks. Science 297(5586):1551–1555
Rossi, R., & Ahmed, N. (2015). The Network Data Repository with Interactive Graph Analytics and Visualization. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9277
Sorensen TA (1948) A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on Danish commons. Biol Skar 5:1–34
Kumari A, Behera R K, Sahoo K, Nayyar A, Luhach A K, Sahoo S P (2022) Supervised link prediction using structured‐based feature extraction in social network - Concurrency and Computation: Practice and Experience 34(17): e5839. https://doi.org/10.1002/cpe.5839
Symeonidis P, Tiakas E (2014) Transitive node similarity: predicting and recommending links in signed social networks. World Wide Web 17(4):743–776
Tong, H, Faloutsos C, and Pan J.Y, "Fast Random Walk with Restart and Its Applications," Sixth International Conference on Data Mining (ICDM'06), Hong Kong, China, 2006, pp. 613-622, https://doi.org/10.1109/ICDM.2006.70.
Ullah A, Wang B, Sheng J, Long J, Khan N, Ejaz M (2022) A novel relevance-based information interaction model for community detection in complex networks. Expert Syst Appl 196:116607. https://doi.org/10.1016/j.eswa.2022.116607
Ullah A, Shao J, Yang Q, Khan N, Bernard CM, Kumar R (2023) LSS: a locality-based structure system to evaluate the spreader’s importance in social complex networks. Expert Syst Appl 228:120326. https://doi.org/10.1016/j.eswa.2023.120326
Wang P, Xu B, Wu Y, Zhou X (2015) Link prediction in social networks: the state-of-the-art. Sci China Inf Sci 58(1):1–38. https://doi.org/10.1007/s11432-014-5237-y
Wang L, Ren J, Xu B, Li J, Luo W, Xia F (2020) MODEL: motif-based deep feature learning for link prediction. IEEE Trans Comput Soc Syst 7(2):503–516
Wang H, Liu G, Hu P (2023) TDAN: transferable domain adversarial network for link prediction in heterogeneous social networks. ACM Trans Knowl Discov Data 18(1):1–22. https://doi.org/10.1145/3610229
Yu C, Zhao X, An L, Lin X (2017) Similarity-based link prediction in social networks: a path and node combined approach. J Inf Sci 43(5):683–695. https://doi.org/10.1177/0165551516664039
Zachary WW (1977) An information flow model for conflict and fission in small groups. J Anthropol Res 33(4):452–473. https://doi.org/10.1086/jar.33.4.3629752
Zhou T, Lü L, Zhang Y-C (2009) Predicting missing links via local information. Eur Phys J B 71(4):623–630. https://doi.org/10.1140/epjb/e2009-00335-8
Acknowledgements
We are thankful to the anonymous referee for their helpful suggestions which have improved the manuscript significantly. Also, we want to acknowledge fruitful discussions with Dr. Aavishkar Katti during the revision of the manuscript
Author information
Authors and Affiliations
Contributions
A.S did the theoretical modeling, experimental evaluation, and analysis of the results. A.K.R. was responsible for technical guidance in the analysis of results throughout and also guided effectively for the revision of the manuscript. A.K.Y was responsible for technical guidance in many aspects of the research problem.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sharma, A., Yadav, A.K. & Rai, A.K. A novel and precise approach for similarity-based link prediction in diverse networks. Soc. Netw. Anal. Min. 14, 11 (2024). https://doi.org/10.1007/s13278-023-01160-2
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-023-01160-2