Journals & Magazines >IEEE Transactions on Computer... >Volume: 38 Issue: 1

Design and Analysis of a Neural Network Inference Engine Based on Adaptive Weight Compression

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Neural networks generally require significant memory capacity/bandwidth to store/access a large number of synaptic weights. This paper presents design of an energy-effici...Show More

Metadata

Abstract:

Neural networks generally require significant memory capacity/bandwidth to store/access a large number of synaptic weights. This paper presents design of an energy-efficient neural network inference engine based on adaptive weight compression using a JPEG image encoding algorithm. To maximize compression ratio with minimum accuracy loss, the quality factor of the JPEG encoder is adaptively controlled depending on the accuracy impact of each block. With 1% accuracy loss, the proposed approach achieves 63.4× compression for multilayer perceptron (MLP) and 31.3× for LeNet-5 with the MNIST dataset, and 15.3× for AlexNet and 10.2× for ResNet-50 with ImageNet. The reduced memory requirement leads to higher throughput and lower energy for neural network inference (3× effective memory bandwidth and 22× lower system energy for MLP).

Published in: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ( Volume: 38, Issue: 1, January 2019)

Page(s): 109 - 121

Date of Publication: 02 February 2018

ISSN Information:

DOI: 10.1109/TCAD.2018.2801228

Funding Agency:

Contents

References is not available for this document.

Design and Analysis of a Neural Network Inference Engine Based on Adaptive Weight Compression

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Design and Analysis of a Neural Network Inference Engine Based on Adaptive Weight Compression

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?