skip to main content
10.1145/3489525.3511681acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
short-paper

Extending SYCL's Programming Paradigm with Tensor-based SIMD Abstractions

Authors Info & Claims
Published:09 April 2022Publication History

ABSTRACT

Heterogeneous computing has emerged as an important method for supporting more than one kind of processors or accelerators in a program. There is generally a trade off between source code portability and device performance for heterogeneous programming. Thus, new programming abstractions to assist programmers to reduce their development efforts while minimizing performance penalties is extremely valuable.

The Khronos SYCL standard defines an abstract single-programmultiple- data (SPMD) programming model for heterogeneous computing. This paper presents a language extension on top of the SYCL standard to enable flexibility for programmers. We introduce a set of single-instruction-multiple-data (SIMD) abstractions based on multi-dimensional arrays (Tensors) in conjuction with the existing SPMD programming paradigm.

Our work is based on a C++ language and a set new of LLVM intermediate representation (IR) for representing the SIMD programs. This also includes a set of custom optimization passes that performs instruction lowering, automatic address allocation, and synchronization insertion. We show how our work can be used in conjunction with conventional SYCL SPMD programming for various benchmarks such as general matrix multiplication (GEMM) and lower upper (LU) inverse and evaluate its hardware utilization performance.

References

  1. 2021. Intel's LLVM Project. https://github.com/intel/llvm/.Google ScholarGoogle Scholar
  2. 2021. SYCL Khronos Group. https://khronos.org/sycl/.Google ScholarGoogle Scholar
  3. Wilson Feng, Rasool Maghareh, and Kai-Ting AmyWang. 2021. Extending DPC++ with Support for Huawei Ascend AI Chipset. In InternationalWorkshop on OpenCL (Munich, Germany) (IWOCL'21). Association for Computing Machinery, New York, NY, USA, Article 13, 4 pages. https://doi.org/10.1145/3456669.3456684Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Heng Liao, Jiajin Tu, Jing Xia, and Xiping Zhou. 2019. DaVinci: A Scalable Architecture for Neural Network Computing. In 2019 IEEE Hot Chips 31 Symposium (HCS). 1--44. https://doi.org/10.1109/HOTCHIPS.2019.8875654Google ScholarGoogle Scholar
  5. Florent Lopez and Theo Mary. 2020. Mixed Precision LU Factorization on GPU Tensor Cores: Reducing Data Movement and Memory Footprint. Technical Report ICL-UT-20--13.Google ScholarGoogle Scholar
  6. Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, and Jeremy Kepner. 2021. AI Accelerator Survey and Trends. arXiv:2109.08957 [cs.AR]Google ScholarGoogle Scholar
  7. Philippe Tillet, H. T. Kung, and David Cox. 2019. Triton: An Intermediate Language and Compiler for Tiled Neural Network Computations. In Proceedings of the 3rd ACM SIGPLAN InternationalWorkshop on Machine Learning and Programming Languages (Phoenix, AZ, USA) (MAPL 2019). Association for Computing Machinery, New York, NY, USA, 10 19. https://doi.org/10.1145/3315508.3329973Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Extending SYCL's Programming Paradigm with Tensor-based SIMD Abstractions

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICPE '22: Proceedings of the 2022 ACM/SPEC on International Conference on Performance Engineering
        April 2022
        242 pages
        ISBN:9781450391436
        DOI:10.1145/3489525

        Copyright © 2022 Owner/Author

        This work is licensed under a Creative Commons Attribution International 4.0 License.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 9 April 2022

        Check for updates

        Qualifiers

        • short-paper

        Acceptance Rates

        ICPE '22 Paper Acceptance Rate14of58submissions,24%Overall Acceptance Rate252of851submissions,30%

        Upcoming Conference

      • Article Metrics

        • Downloads (Last 12 months)44
        • Downloads (Last 6 weeks)5

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader