Skip to main content

Scaling the PageRank Algorithm for Very Large Graphs on the Fugaku Supercomputer

  • Conference paper
  • First Online:
Computational Science – ICCS 2022 (ICCS 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13350))

Included in the following conference series:

Abstract

The PageRank algorithm is a widely used linear algebra method with many applications. As graphs with billions or more of nodes become increasingly common, being able to scale this algorithm on modern HPC architectures is of prime importance. While most existing approaches have explored distributed computing to compute an approximation of the PageRank scores, we focus on the numerical computation using the power iteration method. We develop and implement a distributed parallel version of the PageRank. This application is deployed on the supercomputer Fugaku, using up to one thousand compute nodes to assess scalability on random stochastic matrices. These large-scale experiments show that the network-on-chip of the A64FX processor acts as an additional level of computation, in between nodes and cores.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Code is available at https://github.com/jgurhem/TBSLA/tree/dev_array.

References

  1. Ajima, Y., et al.: The tofu interconnect D. In: 2018 IEEE International Conference on Cluster Computing (CLUSTER), pp. 646–654 (2018). https://doi.org/10.1109/CLUSTER.2018.00090

  2. Alappat, C.L., et al.: Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX. CoRR abs/2009.13903 (2020). https://arxiv.org/abs/2009.13903

  3. Dai, L., Freris, N.M.: Fully distributed pagerank computation with exponential convergence. arXiv preprint arXiv:1705.09927 (2017)

  4. De Jager, D.: PageRank: three distributed algorithms. Master’s thesis, Imperial College London, London, pubs. doc. ic. ac. uk/pagerank-algorithms (2004)

    Google Scholar 

  5. Dongarra, J.: Report on the Fujitsu Fugaku system. University of Tennessee-Knoxville Innovative Computing Laboratory, Technical Report ICLUT-20-06 (2020)

    Google Scholar 

  6. Dongarra, J., Heroux, M.A., Luszczek, P.: High-performance conjugate-gradient benchmark: a new metric for ranking high-performance computing systems. Int. J. High Perform. Comput. Appl. 30(1), 3–10 (2016)

    Article  Google Scholar 

  7. Guo, T., Cao, X., Cong, G., Lu, J., Lin, X.: Distributed algorithms on exact personalized PageRank. In: Proceedings of the 2017 ACM International Conference on Management of Data, pp. 479–494 (2017)

    Google Scholar 

  8. Gurhem, J., Vandromme, M., Tsuji, M., Petiton, S.G., Sato, M.: Sequences of sparse matrix-vector multiplication on Fugaku’s A64FX processors. In: 2021 IEEE International Conference on Cluster Computing (CLUSTER), pp. 751–758. IEEE (2021)

    Google Scholar 

  9. Hugues, M.R., Petiton, S.G.: Sparse matrix formats evaluation and optimization on a GPU. In: 2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC), pp. 122–129. IEEE (2010)

    Google Scholar 

  10. Ihde, N., et al.: A survey of big data, high performance computing, and machine learning benchmarks (2021)

    Google Scholar 

  11. Ishii, H., Tempo, R., Bai, E.W.: A web aggregation approach for distributed randomized PageRank algorithms. IEEE Trans. Autom. Control 57(11), 2703–2717 (2012)

    Article  MathSciNet  Google Scholar 

  12. Klicpera, J., Bojchevski, A., Günnemann, S.: Predict then propagate: graph neural networks meet personalized PageRank. arXiv preprint arXiv:1810.05997 (2018)

  13. Lin, W.: Distributed algorithms for fully personalized PageRank on large graphs. In: The World Wide Web Conference, pp. 1084–1094 (2019)

    Google Scholar 

  14. Lumsdaine, A., Gregor, D., Hendrickson, B., Berry, J.: Challenges in parallel graph processing. Parallel Process. Lett. 17(01), 5–20 (2007)

    Article  MathSciNet  Google Scholar 

  15. Ma, N., Guan, J., Zhao, Y.: Bringing PageRank to the citation analysis. Inf. Process. Manage. 44(2), 800–810 (2008)

    Article  Google Scholar 

  16. Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: Bringing order to the web. Technical report, Stanford InfoLab (1999)

    Google Scholar 

  17. Pop, F., Dobre, C.: An efficient PageRank approach for urban traffic optimization. Mathematical Problems in Engineering 2012 (2012)

    Google Scholar 

  18. Das Sarma, A., Molla, A.R., Pandurangan, G., Upfal, E.: Fast distributed PageRank computation. In: Frey, D., Raynal, M., Sarkar, S., Shyamasundar, R.K., Sinha, P. (eds.) ICDCN 2013. LNCS, vol. 7730, pp. 11–26. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-35668-1_2

    Chapter  Google Scholar 

  19. Sato, M., et al.: Co-design for A64FX manycore processor and “Fugaku". In: SC20: International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–15 (2020). https://doi.org/10.1109/SC41405.2020.00051

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maxence Vandromme .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vandromme, M., Gurhem, J., Tsuji, M., Petiton, S., Sato, M. (2022). Scaling the PageRank Algorithm for Very Large Graphs on the Fugaku Supercomputer. In: Groen, D., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M.A. (eds) Computational Science – ICCS 2022. ICCS 2022. Lecture Notes in Computer Science, vol 13350. Springer, Cham. https://doi.org/10.1007/978-3-031-08751-6_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-08751-6_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-08750-9

  • Online ISBN: 978-3-031-08751-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics