Accumulation-Aware Shift and Difference-Add Booth Multiplier for Energy-Efficient Convolutional Neural Network Inference

Wu, Zhong-Da; Ruan, Shanq-Jang; Yan, Bai-Kui

doi:10.1007/s00034-021-01751-4

Accumulation-Aware Shift and Difference-Add Booth Multiplier for Energy-Efficient Convolutional Neural Network Inference

Published: 24 May 2021

Volume 40, pages 6050–6066, (2021)
Cite this article

Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

341 Accesses
1 Altmetric
Explore all metrics

Abstract

In recent years, the convolutional neural networks (CNNs) have been applied to many fields due to its high performance for extracting complex features. However, these CNNs models are robust but come at the cost of lots of computational complexity. As a result, a bunch of studies researched the various architectures and data flows for optimizing the throughput and energy efficiency. This paper presents a reused data flow and a shift and difference-add booth multiplier to reduce energy consumption. The evaluation result uses the pre-trained VGG16 model with a batch size of three as a benchmark. The result shows that the proposed design reduces the number of state toggles in the booth multiplier by 1.96 times and reduces the DRAM and global buffer accesses to 61.6% and 74.7% as prior work, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 5

Balancing Convolutional Neural Networks Pipeline in FPGAs

An Improved Deterministic Stochastic MAC (SC-MAC) for High Power Efficiency Design

Efficient Processing Element Architecture Using Hybrid Approximate Multipliers and Parallel Prefix Adders for CNN Accelerators

Data Availability

The input dataset is publicly available and detailed output data are given in the manuscript.

References

A. Anderson, A. Vasudevan, C. Keane, D. Gregg, Low-memory GEMM-based convolution algorithms for deep neural networks. “arXiv preprint arXiv:1709.03395 ” (2017)
M. Barakat, W. Saad, M. Shokair, Implementation of efficient multiplier for high speed applications using FPGA, in 2018 13th International Conference on Computer Engineering and Systems (ICCES), pp. 211–214 (2018)
Y. Chen, T. Krishna, J.S. Emer, V. Sze, Eyeriss: an energy-efficient accelerator for deep convolutional neural networks. IEEE J. Solid State Circuits 52, 127–138 (2017)
Article Google Scholar
D. Esposito, A.G.M. Strollo, M. Alioto, Low-power approximate MAC unit, in 2017 13th Conference on Ph.D. Research in Microelectronics and Electronics (PRIME) (2017), pp. 81–84
K. Guo, L. Sui, J. Qiu, S. Yao, S. Han, Y. Wang, H. Yang, Angel-Eye: A complete design flow for mapping CNN onto customized hardware, in 2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)) (2016), pp. 24–29
G. Jha, E. John, Performance analysis of single-precision floating-point MAC for deep learning, in 2018 IEEE 61st International Midwest Symposium on Circuits and Systems (MWSCAS) (2018), pp. 885–888
N. Kaur, R. Patial, Implementation of modified booth multiplier using pipeline technique on FPGA. Int. J. Comput. Appl. 68, 38–41 (2013)
Google Scholar
D. Lin, S. Talathi, S. Annapureddy, Fixed point quantization of deep convolutional networks, in Proceedings of The 33rd International Conference on Machine Learning, vol. 45 (2016), pp. 2849–2858
W.-J. Li, S.-J. Ruan, D.-S. Yang, Implementation of energy-efficient fast convolution algorithm for deep convolutional neural networks based on FPGA. Electron. Lett. 56, 485–488 (2020)
Article Google Scholar
M. Peemen, A.A.A. Setio, B. Mesman, H. Corporaal, Memory-centric accelerator design for convolutional neural networks, in 2013 IEEE 31st International Conference on Computer Design (ICCD) (2013), pp. 13–19
V. Peluso, A. Calimera, Weak-MAC: Arithmetic relaxation for dynamic energy-accuracy scaling in ConvNets, in 2018 IEEE International Symposium on Circuits and Systems (ISCAS) (2018), pp. 1–5
T. Sheng, C. Feng, S. Zhuo, X. Zhang, L. Shen, M. Aleksic, A quantization-friendly separable convolution for MobileNets, in 2018 1st Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications (EMC2) (2018), pp. 14–18
D. Srinu, Implementation of high speed signed multiplier using compressor. Int. J. Adv. Res. Electr. Electron. Instrum. Eng. 3, 8096–8106 (2014)
Google Scholar
Y. Wang, J. Lin, Z. Wang, FPAP: a folded architecture for efficient computing of convolutional neural networks, in 2018 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) (2018), pp. 503–508

Download references

Acknowledgements

This research did not receive any specific Grant from funding agencies in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Department of Electronic and Computer Engineering, National Taiwan University of Science and Technology, Taipei, 10607, Taiwan, R.O.C.
Zhong-Da Wu, Shanq-Jang Ruan & Bai-Kui Yan

Authors

Zhong-Da Wu
View author publications
You can also search for this author in PubMed Google Scholar
Shanq-Jang Ruan
View author publications
You can also search for this author in PubMed Google Scholar
Bai-Kui Yan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shanq-Jang Ruan.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, ZD., Ruan, SJ. & Yan, BK. Accumulation-Aware Shift and Difference-Add Booth Multiplier for Energy-Efficient Convolutional Neural Network Inference. Circuits Syst Signal Process 40, 6050–6066 (2021). https://doi.org/10.1007/s00034-021-01751-4

Download citation

Received: 23 December 2020
Revised: 11 May 2021
Accepted: 13 May 2021
Published: 24 May 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s00034-021-01751-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Accumulation-Aware Shift and Difference-Add Booth Multiplier for Energy-Efficient Convolutional Neural Network Inference

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Balancing Convolutional Neural Networks Pipeline in FPGAs

An Improved Deterministic Stochastic MAC (SC-MAC) for High Power Efficiency Design

Efficient Processing Element Architecture Using Hybrid Approximate Multipliers and Parallel Prefix Adders for CNN Accelerators

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Accumulation-Aware Shift and Difference-Add Booth Multiplier for Energy-Efficient Convolutional Neural Network Inference

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Balancing Convolutional Neural Networks Pipeline in FPGAs

An Improved Deterministic Stochastic MAC (SC-MAC) for High Power Efficiency Design

Efficient Processing Element Architecture Using Hybrid Approximate Multipliers and Parallel Prefix Adders for CNN Accelerators

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation