Conferences >2017 International Conference...

Customizable FPGA OpenCL matrix multiply design template for deep neural networks

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Deep neural networks (DNNs) have gained popularity for their state-of-the-art accuracy and relative ease of use. DNNs rely on a growing variety of matrix multiply operati...Show More

Metadata

Abstract:

Deep neural networks (DNNs) have gained popularity for their state-of-the-art accuracy and relative ease of use. DNNs rely on a growing variety of matrix multiply operations (i.e., dense to sparse, FP32 to N-bit). We propose an OpenCL-based matrix multiply design template, which enables automated design exploration to generate optimized FPGA matrix accelerators for DNN applications. Given the desired matrix operations (e.g., sparsity, data types), our template rapidly produces performance and area estimates for a variety of design variants and/or FPGA platforms. Upon identifying compelling design points and target platforms, FPGA implementations can then be generated automatically using the Intel® OpenCL™ FPGA SDK. We show the effectiveness of the template with a comparison to hand-tuned RTL, a design space exploration, and a DNN case study.

Published in: 2017 International Conference on Field Programmable Technology (ICFPT)

Date of Conference: 11-13 December 2017

Date Added to IEEE Xplore: 05 February 2018

ISBN Information:

DOI: 10.1109/FPT.2017.8280155

Conference Location: Melbourne, VIC, Australia

Contents

References is not available for this document.

Customizable FPGA OpenCL matrix multiply design template for deep neural networks

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Customizable FPGA OpenCL matrix multiply design template for deep neural networks

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?