Skip to main content

An Adaptive Method for the Efficient Similarity Calculation

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5463))

Included in the following conference series:

Abstract

SimRank is a well-known algorithm for similarity calculation based on object-to-object relationship. However, it suffers from high computation cost. In this paper, we find that the convergence behavior of different object pairs is different when we use SimRank to compute the similarity of objects. Many similarity scores converge fast, while others need more time before convergence. Based on this observation, we propose an adaptive method called Adaptive-SimRank to speed up similarity calculation. Using this method, we don’t need to recalculate those converged pairs’ similarity. The experiments conducted on web datasets and synthetic dataset show that our new method can reduce the running time by nearly 35%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jeh, G., Widom, J.: SimRank: A measure of structural-context similarity. In: SIGKDD (2002)

    Google Scholar 

  2. Small, H.: Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for Information Science (1973)

    Google Scholar 

  3. Kessler, M.M.: Bibliographic coupling between scientific papers. American Documentation (1963)

    Google Scholar 

  4. Amsler, R.: Applications of citation-based automatic classification. Linguistic Research Center (1972)

    Google Scholar 

  5. Fogaras, D., Racz, B.: Scaling link-base similarity search. In: WWW (2005)

    Google Scholar 

  6. Yin, X.X., Han, J.W., Yu, P.S.: LinkClus: Efficient Clustering via Heterogeneous Semantic Links. In: VLDB (2006)

    Google Scholar 

  7. Jeh, G., Widom, J.: Scaling personalized web search, Technical report (2001)

    Google Scholar 

  8. Page, L., Brin, S., Motwani, R., References, T.: The PageRank citation ranking: Bringing order to the Web, Technical report (1998)

    Google Scholar 

  9. Langville, A.N., Meyer, C.D.: Deeper inside PageRank. Internet Math. J. (2003)

    Google Scholar 

  10. Kamvar, S., Haveliwala, T., Golub, G.: Adaptive Methods for the Computation of PageRank, Technical report (2003)

    Google Scholar 

  11. CMU four university data set, http://www.cs.cmu.edu/afs/cs/project/theo-20/www/data/

  12. Han, J.W., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2001)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cai, Y., Liu, H., He, J., Du, X., Jia, X. (2009). An Adaptive Method for the Efficient Similarity Calculation. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds) Database Systems for Advanced Applications. DASFAA 2009. Lecture Notes in Computer Science, vol 5463. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00887-0_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-00887-0_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-00886-3

  • Online ISBN: 978-3-642-00887-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics