Conferences >2022 IEEE 65th International ...

Low Power Neural Network Accelerators Using Collaborative Weight Tuning and Shared Shift-Add optimization

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Accelerators for DNN inference in embedded applications using fixed-point arithmetic are attractive from the perspectives of hardware complexity and power consumption. Wh...Show More

Metadata

Abstract:

Accelerators for DNN inference in embedded applications using fixed-point arithmetic are attractive from the perspectives of hardware complexity and power consumption. While techniques have been proposed for DNN inference with constrained values, these are typically accompanied by loss of inferencing accuracy. We propose instead an inferencing architecture predicated on tuning of weights with minimal impact on accuracy to facilitate sharing of shift and add operations across different weight computations in a multiplier-less manner. The highly nonlinear relationship between the distribution of ones and zeros in the binary encoding of multiplier weights is exploited to share shift-add operations. A systolic array architecture supporting such a computation paradigm is developed. Experimental results on hardware savings, power and latency tradeoffs are presented and demonstrate the benefits of the proposed scheme.

Published in: 2022 IEEE 65th International Midwest Symposium on Circuits and Systems (MWSCAS)

Date of Conference: 07-10 August 2022

Date Added to IEEE Xplore: 22 August 2022

ISBN Information:

ISSN Information:

DOI: 10.1109/MWSCAS54063.2022.9859458

Conference Location: Fukuoka, Japan

Contents

References is not available for this document.

Low Power Neural Network Accelerators Using Collaborative Weight Tuning and Shared Shift-Add optimization

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Low Power Neural Network Accelerators Using Collaborative Weight Tuning and Shared Shift-Add optimization

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?