Skip to main content

Improved Link-Based Algorithms for Ranking Web Pages

  • Conference paper
Advances in Web-Age Information Management (WAIM 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3129))

Included in the following conference series:

Abstract

Several link-based algorithms, such as PageRank [7], HITS [4] and SALSA [5], have been developed to evaluate the popularity of web pages. These algorithms can be interpreted as computing the steady-state distribution of various Markov processes over web pages. The PageRank and HITS algorithms tend to over-rank tightly interlinked collections of pages, such as well-organized message boards. We show that this effect can be alleviated using a number of modifications to the underlying Markov process. Specifically, rather than weight all outlinks from a given page equally, greater weight is given to links between pages that are, in other respects, further off in the web, and less weight is given to links between pages that are nearby. We have experimented with a number of variants of this idea, using a number of different measures of ”distance” in the Web, and a number of different weighting schemes. We show that these revised algorithms often do avoid the over-ranking problem and give better overall rankings.

This work is supported by NSF grant #IIS-0097537. There is an extended version of this paper as NYU computer science dept. technical report TR2003-846, online at http://csdocs.cs.nyu.edu/Dienst/UI/2.0/Describe/ncstrl.nyu cs/TR2003-846.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Albert, R., Jeong, H., Barabasi, A.: Diameter of the world-wide-web. Nature 401, 130–131 (1999)

    Article  Google Scholar 

  2. Borodin, A., Roberts, G., Rosenthal, J., Tsaparas, P.: Finding authorities and hubs from link structure on the world wide web. In: Proc. 10th World Wide Web Conference (2001)

    Google Scholar 

  3. Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graph structure in the web. Computer Networks and ISDN Systems 30, 309–320 (2000)

    Google Scholar 

  4. Kleinberg, L.: Authoritative sources in a hyperlinked environment. In: Proc. 9th ACM-SIAM Symposium on Discrete Algorithms (1998)

    Google Scholar 

  5. Lempel, R., Moran, S.: The stochastic approach for link-structure analysis (salsa) and the tkc effect. In: Proc. 9th International World Wide Web Conference (2000)

    Google Scholar 

  6. Ng, A., Zheng, A., Jordan, M.: Stable algorithms for link analysis. In: Proc. ACM SIGIR (2001)

    Google Scholar 

  7. Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web. Stanford Digital Library working paper, 1997-0072 (1997)

    Google Scholar 

  8. Wang, Z.: Improved link-based algorithm for ranking web pages. NYU Computer Science Dept. technical report TR2003-846, extended version (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, Z. (2004). Improved Link-Based Algorithms for Ranking Web Pages. In: Li, Q., Wang, G., Feng, L. (eds) Advances in Web-Age Information Management. WAIM 2004. Lecture Notes in Computer Science, vol 3129. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27772-9_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27772-9_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22418-1

  • Online ISBN: 978-3-540-27772-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics