research-article

Simulate-the-hardware: training accurate binarized neural networks for low-precision neural accelerators

Authors:

Xiaowei LiAuthors Info & Claims

ASPDAC '19: Proceedings of the 24th Asia and South Pacific Design Automation Conference

Pages 323 - 328

https://doi.org/10.1145/3287624.3287628

Published: 21 January 2019 Publication History

Abstract

This work investigates how to effectively train binarized neural networks (BNNs) for the specialized low-precision neural accelerators. When mapping BNNs onto the specialized neural accelerators that adopt fixed-point feature data representation and binary parameters, due to the operation overflow caused by short fixed-point coding, the BNN inference results from the deep learning frameworks on CPU/GPU will be inconsistent with those from the accelerators. This issue leads to a large deviation between the training environment and the inference implementation, and causes potential model accuracy losses when deployed on the accelerators. Therefore, we present a series of methods to contain the overflow phenomenon, and enable typical deep learning frameworks like Tensorflow to effectively train BNNs that could work with high accuracy and convergence speed on the specialized neural accelerators.

References

[1]

Guoguo Chen, Carolina Parada, and Tara N Sainath. 2015. Query-by-example keyword spotting using long short-term memory networks. In ICASSP. IEEE, 5236--5240.

[2]

Matthieu Courbariaux, Yoshua Bengio, and Jean-Pierre David. 2015. Binaryconnect: Training deep neural networks with binary weights during propagations. In Advances in neural information processing systems. 3123--3131.

Digital Library

[3]

Matthieu Courbariaux, Itay Hubara, Daniel Soudry, Ran El-Yaniv, and Yoshua Bengio. 2016. Binarized neural networks: Training deep neural networks with weights and activations constrained to + 1 or -1. arXiv preprint arXiv:1602.02830 (2016).

[4]

Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Deep sparse rectifier neural networks. In AISTATS. 315--323.

[5]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR. 770--778.

[6]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735--1780.

Digital Library

[7]

Qinghao Hu, Peisong Wang, and Jian Cheng. 2018. From hashing to CNNs: Training Binary Weight networks via hashing. arXiv preprint arXiv:1802.02733 (2018).

[8]

Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2015).

[9]

Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton. 2014. The CIFAR-10 dataset. (2014).

[10]

Shiqi Lian, Yinhe Han, Xiaoming Chen, Ying Wang, and Hang Xiao. 2018. Dadu-P: a scalable accelerator for robot motion planning in a dynamic environment. In 2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC). IEEE, 1--6.

Digital Library

[11]

Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, and Ali Farhadi. 2016. Xnor-net: Imagenet classification using binary convolutional neural networks. In ECCV. Springer, 525--542.

[12]

Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).

[13]

Lili Song, Ying Wang, Yinhe Han, Xin Zhao, Bosheng Liu, and Xiaowei Li. 2016. C-brain: a deep learning accelerator that tames the diversity of CNNs through adaptive data-level parallelization. In Design Automation Conference (DAC), 2016 53nd ACM/EDAC/IEEE. IEEE, 1--6.

Digital Library

[14]

Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, et al. 2015. Going deeper with convolutions. In CVPR.

[15]

Wei Tang, Gang Hua, and Liang Wang. 2017. How to train a compact binary neural network with high accuracy?. In AAAI. 2625--2631.

Digital Library

[16]

Ying Wang, Jie Xu, Yinhe Han, Huawei Li, and Xiaowei Li. 2016. DeepBurning: automatic generation of FPGA-based learning accelerators for the neural network family. In Proceedings of the 53rd Annual Design Automation Conference. ACM, 110.

Digital Library

[17]

Pete Warden. 2018. Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition. arXiv preprint arXiv:1804.03209 (2018).

[18]

Aojun Zhou, Anbang Yao, Yiwen Guo, Lin Xu, and Yurong Chen. 2017. Incremental network quantization: Towards lossless cnns with low-precision weights. arXiv preprint arXiv:1702.03044 (2017).

[19]

Shuchang Zhou, Yuxin Wu, Zekun Ni, Xinyu Zhou, He Wen, and Yuheng Zou. 2016. DoReFa-Net: Training low bitwidth convolutional neural networks with low bitwidth gradients. arXiv preprint arXiv:1606.06160 (2016).

Cited By

Arlazarov VAndreeva EBulatov KNikolaev DPetrova OSavelev BSlavin O(2022)Document image analysis and recognition: a surveyComputer Optics10.18287/2412-6179-CO-102046:4(567-589)Online publication date: Aug-2022
https://doi.org/10.18287/2412-6179-CO-1020
Mirsalari SNazari NAnsarmohammadi SSinaei SSalehi MDaneshtalab M(2021)ELC-ECG: Efficient LSTM Cell for ECG Classification based on Quantized Architecture2021 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS51556.2021.9401261(1-5)Online publication date: May-2021
https://doi.org/10.1109/ISCAS51556.2021.9401261
Limonova EAlfonso DNikolaev DArlazarov V(2021)ResNet-like Architecture with Low Hardware Requirements2020 25th International Conference on Pattern Recognition (ICPR)10.1109/ICPR48806.2021.9413186(6204-6211)Online publication date: 10-Jan-2021
https://doi.org/10.1109/ICPR48806.2021.9413186
Show More Cited By

Index Terms

Simulate-the-hardware: training accurate binarized neural networks for low-precision neural accelerators
1. Computer systems organization
  1. Architectures
    1. Other architectures
      1. Neural networks
2. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Speech recognition

Recommendations

Training Binarized Neural Networks Using MIP and CP
Principles and Practice of Constraint Programming
Abstract
Binarized Neural Networks (BNNs) are an important class of neural network characterized by weights and activations restricted to the set . BNNs provide simple compact descriptions and as such have a wide range of applications in low-power devices. ...
ROBIN: A Robust Optical Binary Neural Network Accelerator
Special Issue ESWEEK 2021, CASES 2021, CODES+ISSS 2021 and EMSOFT 2021
Domain specific neural network accelerators have garnered attention because of their improved energy efficiency and inference performance compared to CPUs and GPUs. Such accelerators are thus well suited for resource-constrained embedded systems. However, ...
Banners: Binarized Neural Networks with Replicated Secret Sharing
IH&MMSec '21: Proceedings of the 2021 ACM Workshop on Information Hiding and Multimedia Security

Binarized Neural Networks (BNN) provide efficient implementations of Convolutional Neural Networks (CNN). This makes them particularly suitable to perform fast and memory-light inference of neural networks running on resource-constrained devices. ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ASPDAC '19: Proceedings of the 24th Asia and South Pacific Design Automation Conference

January 2019

794 pages

ISBN:9781450360074

DOI:10.1145/3287624

General Chair:
Toshiyuki Shibuya
Fujitsu Laboratories

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGDA: ACM Special Interest Group on Design Automation

In-Cooperation

IEICE ESS: Institute of Electronics, Information and Communication Engineers, Engineering Sciences Society
IEEE CAS
IEEE CEDA
IPSJ SIG-SLDM: Information Processing Society of Japan, SIG System LSI Design Methodology

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 January 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China
Strategic Priority Research Program of the Chinese Academy of Sciences
Beijing Municipal Science & Technology Commission

Conference

ASPDAC '19

Sponsor:

SIGDA

ASPDAC '19: 24th Asia and South Pacific Design Automation Conference

January 21 - 24, 2019

Tokyo, Japan

Acceptance Rates

Overall Acceptance Rate 466 of 1,454 submissions, 32%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
233
Total Downloads

Downloads (Last 12 months)7
Downloads (Last 6 weeks)0

Reflects downloads up to 15 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Arlazarov VAndreeva EBulatov KNikolaev DPetrova OSavelev BSlavin O(2022)Document image analysis and recognition: a surveyComputer Optics10.18287/2412-6179-CO-102046:4(567-589)Online publication date: Aug-2022
https://doi.org/10.18287/2412-6179-CO-1020
Mirsalari SNazari NAnsarmohammadi SSinaei SSalehi MDaneshtalab M(2021)ELC-ECG: Efficient LSTM Cell for ECG Classification based on Quantized Architecture2021 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS51556.2021.9401261(1-5)Online publication date: May-2021
https://doi.org/10.1109/ISCAS51556.2021.9401261
Limonova EAlfonso DNikolaev DArlazarov V(2021)ResNet-like Architecture with Low Hardware Requirements2020 25th International Conference on Pattern Recognition (ICPR)10.1109/ICPR48806.2021.9413186(6204-6211)Online publication date: 10-Jan-2021
https://doi.org/10.1109/ICPR48806.2021.9413186
Limonova EAlfonso DNikolaev DArlazarov V(2021)Bipolar Morphological Neural Networks: Gate-Efficient Architecture for Computer VisionIEEE Access10.1109/ACCESS.2021.30944849(97569-97581)Online publication date: 2021
https://doi.org/10.1109/ACCESS.2021.3094484

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten