skip to main content
10.1145/3543622.3573152acmconferencesArticle/Chapter ViewAbstractPublication PagesfpgaConference Proceedingsconference-collections
poster

Graph-OPU: An FPGA-Based Overlay Processor for Graph Neural Networks

Published: 12 February 2023 Publication History

Abstract

Graph Neural Networks (GNNs) have outstanding performance on graph-structured data and have been extensively accelerated by field-programmable gate array (FPGA) in various ways. However, existing accelerators significantly lack flexibility, especially in the following two aspects: 1) Many FPGA-based accelerators only support one GNN model. 2) The processes of re-synthesizing and bitstream re-generating are very time-consuming for new GNN models. To this end, we propose a highly integrated FPGA-based overlay processor for general GNN accelerations named Graph-OPU. Regarding the data structure and operation irregularity, we customize the instruction sets to support irregular operation patterns in the inference process of GNN models. Then, we customize our datapath and optimize the data format in the microarchitecture to take full advantage of high bandwidth memory (HBM). Moreover, we design the computation module to ensure a unified and fully-pipelined process of sparse matrix multiplication (SpMM) and general matrix multiplication (GEMM). Users can avoid the process of FPGA reconfiguration or RTL regeneration for the newly invented GNN models. We implement the hardware prototype on Xilinx Alveo U50 and test the mainstream GNN models with 9 datasets. Graph-OPU can achieve an average of 435× and 18× speedup, while 2013× and 109× better energy efficiency, compared with the Intel I7-12700KF processor and NVIDIA RTX3090 GPU, respectively. To the best of our knowledge, Graph-OPU is the first in-depth study on FPGA-based general processors for GNN acceleration with high speedup and energy efficiency.

Cited By

View all
  • (2023)Efficient Implementation of Activation Function on FPGA for Accelerating Neural Networks2023 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS46773.2023.10181406(1-5)Online publication date: 21-May-2023
  • (2023)FET-OPU: A Flexible and Efficient FPGA-Based Overlay Processor for Transformer Networks2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD)10.1109/ICCAD57390.2023.10323752(1-9)Online publication date: 28-Oct-2023
  • (2023)LTrans-OPU: A Low-Latency FPGA-Based Overlay Processor for Transformer Networks2023 33rd International Conference on Field-Programmable Logic and Applications (FPL)10.1109/FPL60245.2023.00048(283-287)Online publication date: 4-Sep-2023

Index Terms

  1. Graph-OPU: An FPGA-Based Overlay Processor for Graph Neural Networks

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    FPGA '23: Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays
    February 2023
    283 pages
    ISBN:9781450394178
    DOI:10.1145/3543622
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 February 2023

    Check for updates

    Author Tags

    1. field-programmable gate array
    2. graph neural network
    3. high bandwidth memory
    4. overlay processor

    Qualifiers

    • Poster

    Funding Sources

    Conference

    FPGA '23
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 125 of 627 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Efficient Implementation of Activation Function on FPGA for Accelerating Neural Networks2023 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS46773.2023.10181406(1-5)Online publication date: 21-May-2023
    • (2023)FET-OPU: A Flexible and Efficient FPGA-Based Overlay Processor for Transformer Networks2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD)10.1109/ICCAD57390.2023.10323752(1-9)Online publication date: 28-Oct-2023
    • (2023)LTrans-OPU: A Low-Latency FPGA-Based Overlay Processor for Transformer Networks2023 33rd International Conference on Field-Programmable Logic and Applications (FPL)10.1109/FPL60245.2023.00048(283-287)Online publication date: 4-Sep-2023

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media