Abstract
The PageRank algorithm demonstrates the significance of the computation of document ranking of general importance or authority in Web information retrieval. However, doing a PageRank computation for the whole Web graph is both time-consuming and costly. State of the art Web crawler based search engines also suffer from the latency in retrieving a complete Web graph for the computation of PageRank. We look into the problem of computing PageRank in a decentralized and timely fashion by making use of SiteRank and aggregating rankings from multiple sites. A SiteRank is basically the ranking generated by applying the classical PageRank algorithm to the graph of Web sites, i.e., the Web graph at the granularity of Web sites instead of Web pages. Our empirical results show that SiteRank also follows a power-law distribution. Our experimental results demonstrate that the decomposition of global Web document ranking computation by making use of SiteRank is a very promising approach for computing global document rankings in a decentralized Web search system. In particular, by sharing SiteRank among member servers, such a search system also obtains a new means to fight link spamming.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aberer, K., Wu, J.: A framework for decentralized ranking in web information retrieval. In: Zhou, X., Zhang, Y., Orlowska, M.E. (eds.) APWeb 2003. LNCS, vol. 2642, pp. 213–226. Springer, Heidelberg (2003)
Abiteboul, S., Preda, M., Cobena, G.: Adaptive on-line page importance computation. In: Proceedings of World Wide Wed Conference 2003 (WWW 2003), Budapest, Hungary, May 2003, May 20-24 (2003)
Bharat, K., Chang, B.-W., Henzinger, M., Ruhl, M.: Who links to whom: Mining linkage between web sites. In: Proceedings of the IEEE International Conference on Data Mining (ICDM 2001), San Jose, USA (November 2001)
Faloutsos, M., Faloutsos, P., Faloutsos, C.: On power-law relationships of the internet topology. In: SIGCOMM, pp. 251–262 (1999)
Harchol-Balter, M., Leighton, T., Lewin, D.: Resource discovery in distributed networks. In: Proceedings of the eighteenth annual ACM symposium on Principles of distributed computing, pp. 229–237. ACM Press, New York (1999)
Sepandar, D., Kamvar, T.H., Haveliwala, C.D.: Manning, and Gene H. Golub. Exploiting the block structure of theweb for computing pagerank. Technical report, Stanford University (March 2003) (submitted on 4th of March 2003)
Kleinberg, J.: Authoritative sources in a hyperlinked environment. In: Proceedings of the ACM-SIAM Symposium on Discrete Algorithms (1998)
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: Bringing order to the web. Technical report, Stanford University (January 1998)
Pandurangan, G., Raghavan, P., Upfal, E.: Using pagerank to characterize web structure. In: Ibarra, O.H., Zhang, L. (eds.) COCOON 2002. LNCS, vol. 2387, p. 330. Springer, Heidelberg (2002)
Wu, J., Aberer, K.: Using siterank in p2p information retrieval. Technical Report IC/2004/31, Swiss Federal Institute of Technology, Lausanne, Switzerland (March 2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, J., Aberer, K. (2004). Using SiteRank for Decentralized Computation of Web Document Ranking. In: De Bra, P.M.E., Nejdl, W. (eds) Adaptive Hypermedia and Adaptive Web-Based Systems. AH 2004. Lecture Notes in Computer Science, vol 3137. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27780-4_30
Download citation
DOI: https://doi.org/10.1007/978-3-540-27780-4_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22895-0
Online ISBN: 978-3-540-27780-4
eBook Packages: Springer Book Archive