Domain-Specific Modeling and Optimization for Graph Processing on FPGAs

Hassan, Mohamed W.; Athanas, Peter M.; Hanafy, Yasser Y.

doi:10.1007/978-3-030-79025-7_24

Mohamed W. Hassan¹²,
Peter M. Athanas¹² &
Yasser Y. Hanafy¹²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12700))

Included in the following conference series:

International Symposium on Applied Reconfigurable Computing

872 Accesses
1 Citations

Abstract

The use of High Level Synthesis (HLS) tools is on the rise; however, performance modelling research has been mainly focused on regular applications with uniform memory access patterns. These performance models fail to accurately capture the performance of graph applications with irregular memory access patterns. This paper presents a domain-specific performance model targeting graph applications synthesized using HLS tools for FPGAs. The performance model utilizes information from the hardware specification, the application’s kernel, and the graph input. While the compilation process of HLS tools takes hours, the information required by the performance model can be extracted from the intermediate compilation report, which compiles in seconds. The goal of this work is to provide FPGA users with a performance modelling framework for graph applications, to estimate performance and explore the optimization space. We tested the framework on Intel’s new Devcloud platform and achieved speedup up to \(3.4\times \) by applying our framework’s recommended optimization strategy compared to the single pipeline implementation. The framework recommended the best optimization strategy in 90% of the test cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alon, N., Babai, L., Itai, A.: A fast and simple randomized parallel algorithm for the maximal independent set problem. J. Algorithms 7(4), 567–583 (1986)
Article MathSciNet Google Scholar
Che, S., Beckmann, B.M., Reinhardt, S.K., Skadron, K.: Pannotia: understanding irregular GPGPU graph applications. In: 2013 IEEE International Symposium on Workload Characterization (IISWC) (2013)
Google Scholar
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Section 22.4: topological sort. In: Introduction to Algorithms, 2nd edn., pp. 549–552. MIT Press and McGraw-Hill (2001)
Google Scholar
Da Silva, B., Braeken, A., D’Hollander, E.H., Touhafi, A.: Performance modeling for FPGAs: extending the roofline model with high-level synthesis tools. Int. J. Reconfigurable Comput. 2013 (2013)
Google Scholar
Hassan, M.W., Helal, A.E., Athanas, P.M., Feng, W.C., Hanafy, Y.Y.: Exploring FPGA-specific optimizations for irregular openCL applications. In: 2018 International Conference on ReConFigurable Computing and FPGAs (ReConFig) (2018)
Google Scholar
Hasssan, M.W., Athanas, P.M.: Graph analytics on hybrid system (GAHS) case study: Pagerank. In: 2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (2021)
Google Scholar
Intel: Intel devcloud for oneAPI projects (2019). https://software.intel.com/en-us/devcloud/oneapi
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)
Google Scholar
Rossi, R.A., Ahmed, N.K.: The network data repository with interactive graph analytics and visualization. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (2015). http://networkrepository.com
Umuroglu, Y., Morrison, D., Jahre, M.: Hybrid breadth-first search on a single-chip FPGA-CPU heterogeneous platform. In: 2015 25th International Conference on Field Programmable Logic and Applications (FPL), pp. 1–8, September 2015. https://doi.org/10.1109/FPL.2015.7293939
Wang, Y.: Accelerating Graph Processing on a Shared-Memory FPGA System. Ph.D. thesis, Carnegie Mellon University, Pittsburgh, USA (2018)
Google Scholar
Wang, Z., He, B., Zhang, W., Jiang, S.: A performance analysis framework for optimizing openCL applications on FPGAs. In: IEEE International Symposium on High Performance Computer Architecture (HPCA), pp. 114–125, March 2016
Google Scholar
Williams, S., Waterman, A., Patterson, D.: Roofline: an insightful visual performance model for multicore architectures. Commun. ACM 52(4), 65–76 (2009)
Article Google Scholar
Zhao, J., Feng, L., Sinha, S., Zhang, W., Liang, Y., He, B.: Performance modeling and directives optimization for high-level synthesis on FPGA. IEEE Trans. Comput. Aided Des. Integr. Circ. Syst. 39(7) (2019)
Google Scholar
Zohouri, H.R., Maruyama, N., Smith, A., Matsuda, M., Matsuoka, S.: Evaluating and optimizing openCL kernels for high performance computing with FPGAs. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 35:1–35:12 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

ECE department, Virginia Tech, Blacksburg, USA
Mohamed W. Hassan, Peter M. Athanas & Yasser Y. Hanafy

Authors

Mohamed W. Hassan
View author publications
You can also search for this author in PubMed Google Scholar
Peter M. Athanas
View author publications
You can also search for this author in PubMed Google Scholar
Yasser Y. Hanafy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohamed W. Hassan .

Editor information

Editors and Affiliations

IRISA, University of Rennes 1, Rennes, France
Steven Derrien
Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Frank Hannig
INESC-ID, Lisboa, Portugal
Pedro C. Diniz
ENSSAT, University of Rennes 1, Lannion, France
Daniel Chillet

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hassan, M.W., Athanas, P.M., Hanafy, Y.Y. (2021). Domain-Specific Modeling and Optimization for Graph Processing on FPGAs. In: Derrien, S., Hannig, F., Diniz, P.C., Chillet, D. (eds) Applied Reconfigurable Computing. Architectures, Tools, and Applications. ARC 2021. Lecture Notes in Computer Science(), vol 12700. Springer, Cham. https://doi.org/10.1007/978-3-030-79025-7_24

Download citation

DOI: https://doi.org/10.1007/978-3-030-79025-7_24
Published: 23 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-79024-0
Online ISBN: 978-3-030-79025-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Domain-Specific Modeling and Optimization for Graph Processing on FPGAs