skip to main content
10.1145/3624354.3630083acmconferencesArticle/Chapter ViewAbstractPublication PagesconextConference Proceedingsconference-collections
poster

Poster: A Fast, Scalable, and Energy-efficient Edge Acceleration Architecture based on GPU Cluster

Published:05 December 2023Publication History

ABSTRACT

GPU-aided servers are widely used to accelerate the increasing computing workload on edge network. However, massive computing requests can cause delays in transmission and processing at edge servers, leading to long response time and high system energy consumption. Moreover, in distributed scenarios, GPUs from different vendors can hardly communicate directly through the network. As a rare open-source GPU architecture, Vortex has been emerged to enable an FPGA accelerator to serve as a PCIe-based soft GPU. On this basis, Vortex GPUs in our edge acceleration architecture further integrate the low-latency RDMA function. Compared with the original Vortex GPU-aided servers cluster, our preliminary study shows that our Vortex GPU cluster is fast, scalable, and energy-efficient.

References

  1. Ching-Hsiang Chu, Xiaoyi Lu, Ammar A. Awan, Hari Subramoni, Bracy Elton, and Dhabaleswar K. Panda. 2019. Exploiting Hardware Multicast and GPUDirect RDMA for Efficient Broadcast. IEEE Transactions on Parallel and Distributed Systems 30, 3 (2019), 575--588. https://doi.org/10.1109/TPDS.2018.2867222Google ScholarGoogle ScholarCross RefCross Ref
  2. Blaise Tine, Krishna Praveen Yalamarthy, Fares Elsabbagh, and Kim Hyesoon. 2021. Vortex: Extending the RISC-V ISA for GPGPU and 3D-Graphics. In MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture (Virtual Event, Greece) (MICRO '21). Association for Computing Machinery, New York, NY, USA, 754--766. https://doi.org/10.1145/3466752.3480128Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Hong Zhang, Yupeng Tang, Anurag Khandelwal, and Ion Stoica. 2023. SHEPHERD: Serving DNNs in the Wild. In 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23). USENIX Association, Boston, MA, 787--808. https://www.usenix.org/conference/nsdi23/presentation/zhang-hongGoogle ScholarGoogle Scholar

Index Terms

  1. Poster: A Fast, Scalable, and Energy-efficient Edge Acceleration Architecture based on GPU Cluster

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CoNEXT 2023: Companion of the 19th International Conference on emerging Networking EXperiments and Technologies
      December 2023
      80 pages
      ISBN:9798400704079
      DOI:10.1145/3624354

      Copyright © 2023 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 5 December 2023

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      Overall Acceptance Rate198of789submissions,25%
    • Article Metrics

      • Downloads (Last 12 months)41
      • Downloads (Last 6 weeks)6

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader