Abstract
The use of High Level Synthesis (HLS) tools is on the rise; however, performance modelling research has been mainly focused on regular applications with uniform memory access patterns. These performance models fail to accurately capture the performance of graph applications with irregular memory access patterns. This paper presents a domain-specific performance model targeting graph applications synthesized using HLS tools for FPGAs. The performance model utilizes information from the hardware specification, the application’s kernel, and the graph input. While the compilation process of HLS tools takes hours, the information required by the performance model can be extracted from the intermediate compilation report, which compiles in seconds. The goal of this work is to provide FPGA users with a performance modelling framework for graph applications, to estimate performance and explore the optimization space. We tested the framework on Intel’s new Devcloud platform and achieved speedup up to \(3.4\times \) by applying our framework’s recommended optimization strategy compared to the single pipeline implementation. The framework recommended the best optimization strategy in 90% of the test cases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alon, N., Babai, L., Itai, A.: A fast and simple randomized parallel algorithm for the maximal independent set problem. J. Algorithms 7(4), 567–583 (1986)
Che, S., Beckmann, B.M., Reinhardt, S.K., Skadron, K.: Pannotia: understanding irregular GPGPU graph applications. In: 2013 IEEE International Symposium on Workload Characterization (IISWC) (2013)
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Section 22.4: topological sort. In: Introduction to Algorithms, 2nd edn., pp. 549–552. MIT Press and McGraw-Hill (2001)
Da Silva, B., Braeken, A., D’Hollander, E.H., Touhafi, A.: Performance modeling for FPGAs: extending the roofline model with high-level synthesis tools. Int. J. Reconfigurable Comput. 2013 (2013)
Hassan, M.W., Helal, A.E., Athanas, P.M., Feng, W.C., Hanafy, Y.Y.: Exploring FPGA-specific optimizations for irregular openCL applications. In: 2018 International Conference on ReConFigurable Computing and FPGAs (ReConFig) (2018)
Hasssan, M.W., Athanas, P.M.: Graph analytics on hybrid system (GAHS) case study: Pagerank. In: 2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (2021)
Intel: Intel devcloud for oneAPI projects (2019). https://software.intel.com/en-us/devcloud/oneapi
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)
Rossi, R.A., Ahmed, N.K.: The network data repository with interactive graph analytics and visualization. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (2015). http://networkrepository.com
Umuroglu, Y., Morrison, D., Jahre, M.: Hybrid breadth-first search on a single-chip FPGA-CPU heterogeneous platform. In: 2015 25th International Conference on Field Programmable Logic and Applications (FPL), pp. 1–8, September 2015. https://doi.org/10.1109/FPL.2015.7293939
Wang, Y.: Accelerating Graph Processing on a Shared-Memory FPGA System. Ph.D. thesis, Carnegie Mellon University, Pittsburgh, USA (2018)
Wang, Z., He, B., Zhang, W., Jiang, S.: A performance analysis framework for optimizing openCL applications on FPGAs. In: IEEE International Symposium on High Performance Computer Architecture (HPCA), pp. 114–125, March 2016
Williams, S., Waterman, A., Patterson, D.: Roofline: an insightful visual performance model for multicore architectures. Commun. ACM 52(4), 65–76 (2009)
Zhao, J., Feng, L., Sinha, S., Zhang, W., Liang, Y., He, B.: Performance modeling and directives optimization for high-level synthesis on FPGA. IEEE Trans. Comput. Aided Des. Integr. Circ. Syst. 39(7) (2019)
Zohouri, H.R., Maruyama, N., Smith, A., Matsuda, M., Matsuoka, S.: Evaluating and optimizing openCL kernels for high performance computing with FPGAs. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 35:1–35:12 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Hassan, M.W., Athanas, P.M., Hanafy, Y.Y. (2021). Domain-Specific Modeling and Optimization for Graph Processing on FPGAs. In: Derrien, S., Hannig, F., Diniz, P.C., Chillet, D. (eds) Applied Reconfigurable Computing. Architectures, Tools, and Applications. ARC 2021. Lecture Notes in Computer Science(), vol 12700. Springer, Cham. https://doi.org/10.1007/978-3-030-79025-7_24
Download citation
DOI: https://doi.org/10.1007/978-3-030-79025-7_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-79024-0
Online ISBN: 978-3-030-79025-7
eBook Packages: Computer ScienceComputer Science (R0)