Skip to main content

Domain-Specific Modeling and Optimization for Graph Processing on FPGAs

  • Conference paper
  • First Online:
Applied Reconfigurable Computing. Architectures, Tools, and Applications (ARC 2021)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12700))

Included in the following conference series:

Abstract

The use of High Level Synthesis (HLS) tools is on the rise; however, performance modelling research has been mainly focused on regular applications with uniform memory access patterns. These performance models fail to accurately capture the performance of graph applications with irregular memory access patterns. This paper presents a domain-specific performance model targeting graph applications synthesized using HLS tools for FPGAs. The performance model utilizes information from the hardware specification, the application’s kernel, and the graph input. While the compilation process of HLS tools takes hours, the information required by the performance model can be extracted from the intermediate compilation report, which compiles in seconds. The goal of this work is to provide FPGA users with a performance modelling framework for graph applications, to estimate performance and explore the optimization space. We tested the framework on Intel’s new Devcloud platform and achieved speedup up to \(3.4\times \) by applying our framework’s recommended optimization strategy compared to the single pipeline implementation. The framework recommended the best optimization strategy in 90% of the test cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alon, N., Babai, L., Itai, A.: A fast and simple randomized parallel algorithm for the maximal independent set problem. J. Algorithms 7(4), 567–583 (1986)

    Article  MathSciNet  Google Scholar 

  2. Che, S., Beckmann, B.M., Reinhardt, S.K., Skadron, K.: Pannotia: understanding irregular GPGPU graph applications. In: 2013 IEEE International Symposium on Workload Characterization (IISWC) (2013)

    Google Scholar 

  3. Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Section 22.4: topological sort. In: Introduction to Algorithms, 2nd edn., pp. 549–552. MIT Press and McGraw-Hill (2001)

    Google Scholar 

  4. Da Silva, B., Braeken, A., D’Hollander, E.H., Touhafi, A.: Performance modeling for FPGAs: extending the roofline model with high-level synthesis tools. Int. J. Reconfigurable Comput. 2013 (2013)

    Google Scholar 

  5. Hassan, M.W., Helal, A.E., Athanas, P.M., Feng, W.C., Hanafy, Y.Y.: Exploring FPGA-specific optimizations for irregular openCL applications. In: 2018 International Conference on ReConFigurable Computing and FPGAs (ReConFig) (2018)

    Google Scholar 

  6. Hasssan, M.W., Athanas, P.M.: Graph analytics on hybrid system (GAHS) case study: Pagerank. In: 2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (2021)

    Google Scholar 

  7. Intel: Intel devcloud for oneAPI projects (2019). https://software.intel.com/en-us/devcloud/oneapi

  8. Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)

    Google Scholar 

  9. Rossi, R.A., Ahmed, N.K.: The network data repository with interactive graph analytics and visualization. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (2015). http://networkrepository.com

  10. Umuroglu, Y., Morrison, D., Jahre, M.: Hybrid breadth-first search on a single-chip FPGA-CPU heterogeneous platform. In: 2015 25th International Conference on Field Programmable Logic and Applications (FPL), pp. 1–8, September 2015. https://doi.org/10.1109/FPL.2015.7293939

  11. Wang, Y.: Accelerating Graph Processing on a Shared-Memory FPGA System. Ph.D. thesis, Carnegie Mellon University, Pittsburgh, USA (2018)

    Google Scholar 

  12. Wang, Z., He, B., Zhang, W., Jiang, S.: A performance analysis framework for optimizing openCL applications on FPGAs. In: IEEE International Symposium on High Performance Computer Architecture (HPCA), pp. 114–125, March 2016

    Google Scholar 

  13. Williams, S., Waterman, A., Patterson, D.: Roofline: an insightful visual performance model for multicore architectures. Commun. ACM 52(4), 65–76 (2009)

    Article  Google Scholar 

  14. Zhao, J., Feng, L., Sinha, S., Zhang, W., Liang, Y., He, B.: Performance modeling and directives optimization for high-level synthesis on FPGA. IEEE Trans. Comput. Aided Des. Integr. Circ. Syst. 39(7) (2019)

    Google Scholar 

  15. Zohouri, H.R., Maruyama, N., Smith, A., Matsuda, M., Matsuoka, S.: Evaluating and optimizing openCL kernels for high performance computing with FPGAs. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 35:1–35:12 (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohamed W. Hassan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hassan, M.W., Athanas, P.M., Hanafy, Y.Y. (2021). Domain-Specific Modeling and Optimization for Graph Processing on FPGAs. In: Derrien, S., Hannig, F., Diniz, P.C., Chillet, D. (eds) Applied Reconfigurable Computing. Architectures, Tools, and Applications. ARC 2021. Lecture Notes in Computer Science(), vol 12700. Springer, Cham. https://doi.org/10.1007/978-3-030-79025-7_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-79025-7_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-79024-0

  • Online ISBN: 978-3-030-79025-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics