Conferences >2024 IEEE International Sympo...

Hybrid-Grained Pruning and Hardware Acceleration for Convolutional Neural Networks

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Throughout various convolutional neural network (CNN) models, the sparsity increases as the network deepens, which poses significant potential to model compression and ha...Show More

Metadata

Abstract:

Throughout various convolutional neural network (CNN) models, the sparsity increases as the network deepens, which poses significant potential to model compression and hardware acceleration. In this paper, a dual-factor hybrid-grained pruning method is introduced to make a good balance between model compression and accuracy preservation. The pro-posed pruning method combines hardware-friendly unstructured vector-level pruning with structured filter-level pruning to explore multiple grains of sparsity in CNNs. The architecture of the corresponding hardware accelerator is then proposed based on the row-based convolution dataflow, which could fully utilize the hybrid sparsity to accelerate CNN processing. Experimental results demonstrate that the proposed method increases the compression rate by 1.08× while causing 0.21% accuracy loss compared to the state-of-the-art filter pruning method in VGG16, and 2.39% hardware resource increase compared to the accelerator without sparsity optimization.

Published in: 2024 IEEE International Symposium on Circuits and Systems (ISCAS)

Date of Conference: 19-22 May 2024

Date Added to IEEE Xplore: 02 July 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/ISCAS58744.2024.10558640

Conference Location: Singapore, Singapore

Funding Agency:

Contents

References is not available for this document.

Hybrid-Grained Pruning and Hardware Acceleration for Convolutional Neural Networks

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Hybrid-Grained Pruning and Hardware Acceleration for Convolutional Neural Networks

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?