An Adaptive Method for the Efficient Similarity Calculation

Cai, Yuanzhe; Liu, Hongyan; He, Jun; Du, Xiaoyong; Jia, Xu

doi:10.1007/978-3-642-00887-0_31

Yuanzhe Cai^19,20,
Hongyan Liu²¹,
Jun He^19,20,
Xiaoyong Du^19,20 &
…
Xu Jia^19,20

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5463))

Included in the following conference series:

International Conference on Database Systems for Advanced Applications

1554 Accesses

Abstract

SimRank is a well-known algorithm for similarity calculation based on object-to-object relationship. However, it suffers from high computation cost. In this paper, we find that the convergence behavior of different object pairs is different when we use SimRank to compute the similarity of objects. Many similarity scores converge fast, while others need more time before convergence. Based on this observation, we propose an adaptive method called Adaptive-SimRank to speed up similarity calculation. Using this method, we don’t need to recalculate those converged pairs’ similarity. The experiments conducted on web datasets and synthetic dataset show that our new method can reduce the running time by nearly 35%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

SimSky: An Accuracy-Aware Algorithm for Single-Source SimRank Search

SimRank*: effective and scalable pairwise similarity search based on graph topology

Article Open access 11 January 2019

Fast computation of General SimRank on heterogeneous information network

Article Open access 21 May 2024

References

Jeh, G., Widom, J.: SimRank: A measure of structural-context similarity. In: SIGKDD (2002)
Google Scholar
Small, H.: Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for Information Science (1973)
Google Scholar
Kessler, M.M.: Bibliographic coupling between scientific papers. American Documentation (1963)
Google Scholar
Amsler, R.: Applications of citation-based automatic classification. Linguistic Research Center (1972)
Google Scholar
Fogaras, D., Racz, B.: Scaling link-base similarity search. In: WWW (2005)
Google Scholar
Yin, X.X., Han, J.W., Yu, P.S.: LinkClus: Efficient Clustering via Heterogeneous Semantic Links. In: VLDB (2006)
Google Scholar
Jeh, G., Widom, J.: Scaling personalized web search, Technical report (2001)
Google Scholar
Page, L., Brin, S., Motwani, R., References, T.: The PageRank citation ranking: Bringing order to the Web, Technical report (1998)
Google Scholar
Langville, A.N., Meyer, C.D.: Deeper inside PageRank. Internet Math. J. (2003)
Google Scholar
Kamvar, S., Haveliwala, T., Golub, G.: Adaptive Methods for the Computation of PageRank, Technical report (2003)
Google Scholar
CMU four university data set, http://www.cs.cmu.edu/afs/cs/project/theo-20/www/data/
Han, J.W., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2001)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Key Labs of Data Engineering and Knowledge Engineering, MOE, P.R. China
Yuanzhe Cai, Jun He, Xiaoyong Du & Xu Jia
School of Information, Renmin University of China, P.R. China
Yuanzhe Cai, Jun He, Xiaoyong Du & Xu Jia
Department of Management Science and Engineering, Tsinghua University, P.R. China
Hongyan Liu

Authors

Yuanzhe Cai
View author publications
You can also search for this author in PubMed Google Scholar
Hongyan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jun He
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyong Du
View author publications
You can also search for this author in PubMed Google Scholar
Xu Jia
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electrical Engineering, The University of Queensland, QLD 4072, Brisbane, Australia
Xiaofang Zhou & Ke Deng &
Tokyo Institute of Technology, Graduate School of Information Science and Engineering, 2-12-1 Oh-Okayama Meguro-ku, 152-8552, Tokyo, Japan
Haruo Yokota
CSIRO, Castray Esplanade, TAS 7000, Hobart, Australia
Qing Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cai, Y., Liu, H., He, J., Du, X., Jia, X. (2009). An Adaptive Method for the Efficient Similarity Calculation. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds) Database Systems for Advanced Applications. DASFAA 2009. Lecture Notes in Computer Science, vol 5463. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00887-0_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-00887-0_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00886-3
Online ISBN: 978-3-642-00887-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics