Journals & Magazines >IEEE Transactions on Computer... >Volume: 42 Issue: 1

Structured Term Pruning for Computational Efficient Neural Networks Inference

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The state-of-the-art convolutional neural network accelerators are showing a growing interest in exploiting the bit-level sparsity and eliminating the ineffectual computa...Show More

Metadata

Abstract:

The state-of-the-art convolutional neural network accelerators are showing a growing interest in exploiting the bit-level sparsity and eliminating the ineffectual computations of zero bits. However, the excessive redundancy and the irregular distribution of nonzero bits limit the real speedup in the accelerators. To address this, we propose an algorithm-architecture codesign, named structured term pruning (STP), to boost the computation efficiency of neural networks inference. Specifically, we enhance the bit sparsity by guiding the weights toward the value with fewer power-of-two terms. Then, we structure the terms with layer-wise group budgets. Retraining is adopted to recover the accuracy drop. We also design the hardware of the group processing element and the fast signed-digital encoder for efficient implementation of STP networks. The system design of STP is realized with some easy alterations on an input stationary systolic array design. Extensive evaluation results demonstrate that STP can reduce significant inference computation costs, and achieve

$2.35\times$ computational energy saving for the ResNet18 network on the ImageNet dataset.

Published in: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ( Volume: 42, Issue: 1, January 2023)

Page(s): 190 - 203

Date of Publication: 19 April 2022

ISSN Information:

DOI: 10.1109/TCAD.2022.3168506

Funding Agency:

Contents

References is not available for this document.

Structured Term Pruning for Computational Efficient Neural Networks Inference

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Structured Term Pruning for Computational Efficient Neural Networks Inference

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?