Skip to main content

Parallelizing the Computation of PageRank

  • Conference paper
Algorithms and Models for the Web-Graph (WAW 2007)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4863))

Included in the following conference series:

Abstract

This paper presents a technique we call ParaSolve that exploits the sparsity structure of the web graph matrix to improve on the degree of parallelism in a number of distributed approaches for computating PageRank. Specifically, a typical algorithm (such as power iteration or GMRES) for solving the linear system corresponding to PageRank, call it LinearSolve, may be converted to a distributed algorithm, Distrib(LinearSolve), by partitioning the problem and applying a standard technique (i.e., Distrib). By reducing the number of inter-partition multiplications, we may greatly increase the degree of parallelism, while achieving a similar degree of accuracy. This should lead to increasingly better performance as we utilize more processors. For example, using GeoSolve (a variant of Jacobi iteration) as our linear solver and the 2001 web graph from Stanford’s WebBase project, on 12 processors ParaSolve(GeoSolve) outperforms Distrib(GeoSolve) by a factor of 1.4, while on 32 processors the performance ratio improves to 2.8.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://law.dsi.unimi.it/

  2. http://webgraph.dsi.unimi.it/

  3. http://www.cs.brown.edu/rooms/ilab/

  4. http://www.osl.iu.edu/research/mtl/

  5. Balay, S., Buschelman, K., Eijkhout, V., Gropp, W.D., Kaushik, D., Knepley, M.G., McInnes, L.C., Smith, B.F., Zhang, H.: PETSc users manual. Technical Report ANL-95/11 - Revision 2.1.5, Argonne National Laboratory (2004)

    Google Scholar 

  6. Gleich, D., Zhukov, L., Berkhin, P.: Fast parallel pagerank: A linear system approach. In: WWW 2005: Proceedings of the 14th international conference on World Wide Web, ACM Press, New York (2005)

    Google Scholar 

  7. Kamvar, S., Haveliwala, T., Manning, C., Golub, G.: Exploiting the block structure of the web for computing pagerank. Technical report, Stanford University Technical Report (2003)

    Google Scholar 

  8. Kohlschütter, C., Chirita, P.-A., Nejdl, W.: Efficient parallel computation of pagerank. In: Lalmas, M., MacFarlane, A., Rüger, S., Tombros, A., Tsikrika, T., Yavlinsky, A. (eds.) Advances in Information Retrieval. LNCS, vol. 3936, pp. 241–252. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  9. McSherry, F.: A uniform approach to accelerated pagerank computation. In: WWW 2005: Proceedings of the 14th international conference on World Wide Web, pp. 575–582. ACM Press, New York (2005)

    Chapter  Google Scholar 

  10. Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Anthony Bonato Fan R. K. Chung

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wicks, J., Greenwald, A. (2007). Parallelizing the Computation of PageRank. In: Bonato, A., Chung, F.R.K. (eds) Algorithms and Models for the Web-Graph. WAW 2007. Lecture Notes in Computer Science, vol 4863. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77004-6_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77004-6_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77003-9

  • Online ISBN: 978-3-540-77004-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics