Abstract
The PageRank algorithm is a widely used linear algebra method with many applications. As graphs with billions or more of nodes become increasingly common, being able to scale this algorithm on modern HPC architectures is of prime importance. While most existing approaches have explored distributed computing to compute an approximation of the PageRank scores, we focus on the numerical computation using the power iteration method. We develop and implement a distributed parallel version of the PageRank. This application is deployed on the supercomputer Fugaku, using up to one thousand compute nodes to assess scalability on random stochastic matrices. These large-scale experiments show that the network-on-chip of the A64FX processor acts as an additional level of computation, in between nodes and cores.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Code is available at https://github.com/jgurhem/TBSLA/tree/dev_array.
References
Ajima, Y., et al.: The tofu interconnect D. In: 2018 IEEE International Conference on Cluster Computing (CLUSTER), pp. 646–654 (2018). https://doi.org/10.1109/CLUSTER.2018.00090
Alappat, C.L., et al.: Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX. CoRR abs/2009.13903 (2020). https://arxiv.org/abs/2009.13903
Dai, L., Freris, N.M.: Fully distributed pagerank computation with exponential convergence. arXiv preprint arXiv:1705.09927 (2017)
De Jager, D.: PageRank: three distributed algorithms. Master’s thesis, Imperial College London, London, pubs. doc. ic. ac. uk/pagerank-algorithms (2004)
Dongarra, J.: Report on the Fujitsu Fugaku system. University of Tennessee-Knoxville Innovative Computing Laboratory, Technical Report ICLUT-20-06 (2020)
Dongarra, J., Heroux, M.A., Luszczek, P.: High-performance conjugate-gradient benchmark: a new metric for ranking high-performance computing systems. Int. J. High Perform. Comput. Appl. 30(1), 3–10 (2016)
Guo, T., Cao, X., Cong, G., Lu, J., Lin, X.: Distributed algorithms on exact personalized PageRank. In: Proceedings of the 2017 ACM International Conference on Management of Data, pp. 479–494 (2017)
Gurhem, J., Vandromme, M., Tsuji, M., Petiton, S.G., Sato, M.: Sequences of sparse matrix-vector multiplication on Fugaku’s A64FX processors. In: 2021 IEEE International Conference on Cluster Computing (CLUSTER), pp. 751–758. IEEE (2021)
Hugues, M.R., Petiton, S.G.: Sparse matrix formats evaluation and optimization on a GPU. In: 2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC), pp. 122–129. IEEE (2010)
Ihde, N., et al.: A survey of big data, high performance computing, and machine learning benchmarks (2021)
Ishii, H., Tempo, R., Bai, E.W.: A web aggregation approach for distributed randomized PageRank algorithms. IEEE Trans. Autom. Control 57(11), 2703–2717 (2012)
Klicpera, J., Bojchevski, A., Günnemann, S.: Predict then propagate: graph neural networks meet personalized PageRank. arXiv preprint arXiv:1810.05997 (2018)
Lin, W.: Distributed algorithms for fully personalized PageRank on large graphs. In: The World Wide Web Conference, pp. 1084–1094 (2019)
Lumsdaine, A., Gregor, D., Hendrickson, B., Berry, J.: Challenges in parallel graph processing. Parallel Process. Lett. 17(01), 5–20 (2007)
Ma, N., Guan, J., Zhao, Y.: Bringing PageRank to the citation analysis. Inf. Process. Manage. 44(2), 800–810 (2008)
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: Bringing order to the web. Technical report, Stanford InfoLab (1999)
Pop, F., Dobre, C.: An efficient PageRank approach for urban traffic optimization. Mathematical Problems in Engineering 2012 (2012)
Das Sarma, A., Molla, A.R., Pandurangan, G., Upfal, E.: Fast distributed PageRank computation. In: Frey, D., Raynal, M., Sarkar, S., Shyamasundar, R.K., Sinha, P. (eds.) ICDCN 2013. LNCS, vol. 7730, pp. 11–26. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-35668-1_2
Sato, M., et al.: Co-design for A64FX manycore processor and “Fugaku". In: SC20: International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 1–15 (2020). https://doi.org/10.1109/SC41405.2020.00051
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Vandromme, M., Gurhem, J., Tsuji, M., Petiton, S., Sato, M. (2022). Scaling the PageRank Algorithm for Very Large Graphs on the Fugaku Supercomputer. In: Groen, D., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M.A. (eds) Computational Science – ICCS 2022. ICCS 2022. Lecture Notes in Computer Science, vol 13350. Springer, Cham. https://doi.org/10.1007/978-3-031-08751-6_28
Download citation
DOI: https://doi.org/10.1007/978-3-031-08751-6_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-08750-9
Online ISBN: 978-3-031-08751-6
eBook Packages: Computer ScienceComputer Science (R0)