RV-GEMM: Neural Network Inference Acceleration with Near-Memory GEMM Instructions on RISC-V
Abstract
References
Index Terms
- RV-GEMM: Neural Network Inference Acceleration with Near-Memory GEMM Instructions on RISC-V
Recommendations
Automatic generation of fast BLAS3-GEMM: a portable compiler approach
CGO '17: Proceedings of the 2017 International Symposium on Code Generation and OptimizationGEMM is the main computational kernel in BLAS3. Its micro-kernel is either hand-crafted in assembly code or generated from C code by general-purpose compilers (guided by architecture-specific directives or auto-tuning). Therefore, either performance or ...
A GEMM interface and implementation on NVIDIA GPUs for multiple small matrices
We present an interface and an implementation of the General Matrix Multiply (GEMM) routine for multiple small matrices processed simultaneously on NVIDIA graphics processing units (GPUs). We focus on matrix sizes under 16. The implementation can be ...
A Reconfigurable Architecture for Binary Acceleration of Loops with Memory Accesses
This article presents a reconfigurable hardware/software architecture for binary acceleration of embedded applications. A Reconfigurable Processing Unit (RPU) is used as a coprocessor of the General Purpose Processor (GPP) to accelerate the execution of ...
Comments
Information & Contributors
Information
Published In
![cover image ACM Conferences](/cms/asset/27ee1c89-e18e-43cb-863d-5336f742f02c/3649153.cover.jpg)
Sponsors
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Author Tags
Qualifiers
- Extended-abstract
- Research
- Refereed limited
Conference
Acceptance Rates
Upcoming Conference
- Sponsor:
- sigmicro
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 171Total Downloads
- Downloads (Last 12 months)171
- Downloads (Last 6 weeks)27
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in