POLYBiNN: Binary Inference Engine for Neural Networks using Decision Trees

Abdelsalam, Ahmed M.; Elsheikh, Ahmed; Chidambaram, Sivakumar; David, Jean-Pierre; Langlois, J. M. Pierre

doi:10.1007/s11265-019-01453-w

POLYBiNN: Binary Inference Engine for Neural Networks using Decision Trees

Published: 29 May 2019

Volume 92, pages 95–107, (2020)
Cite this article

Journal of Signal Processing Systems Aims and scope Submit manuscript

Ahmed M. Abdelsalam ORCID: orcid.org/0000-0002-1435-1069¹,
Ahmed Elsheikh²,
Sivakumar Chidambaram³,
Jean-Pierre David³ &
…
J. M. Pierre Langlois¹

655 Accesses
3 Citations
Explore all metrics

Abstract

Convolutional Neural Networks (CNNs) and Deep Neural Networks (DNNs) have gained significant popularity in several classification and regression applications. The massive computation and memory requirements of DNN and CNN architectures pose particular challenges for their FPGA implementation. Moreover, programming FPGAs requires hardware-specific knowledge that many machine-learning researchers do not possess. To make the power and versatility of FPGAs available to a wider deep learning user community and to improve DNN design efficiency, we introduce POLYBiNN, an efficient FPGA-based inference engine for DNNs and CNNs. POLYBiNN is composed of a stack of decision trees, which are binary classifiers in nature, and it utilizes AND-OR gates instead of multipliers and accumulators. POLYBiNN is a memory-free inference engine that drastically cuts hardware costs. We also propose a tool for the automatic generation of a low-level hardware description of the trained POLYBiNN for a given application. We evaluate POLYBiNN and the tool for several datasets that are normally solved using fully connected layers. On the MNIST dataset, when implemented in a ZYNQ-7000 ZC706 FPGA, the system achieves a throughput of up to 100 million image classifications per second with 90 ns latency and 97.26% accuracy. Moreover, POLYBiNN consumes 8× less power than the best previously published implementations, and it does not require any memory access. We also show how POLYBiNN can be used instead of the fully connected layers of a CNN and apply this approach to the CIFAR-10 dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

Input-Dependably Feature-Map Pruning

Recent advances in efficient computation of deep convolutional neural networks

Article 26 January 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

LeCun, Y., Bengio, Y., Hinton, G. (2015). Deep learning. Nature.
Hunter, D., Yu, H., Pukish, M.S., Kolbusz, J, Wilamowski, B.M. (2012). Selection of proper neural network sizes and architectures—a comparative study. IEEE Transactions on Industrial Informatics.
Nurvitadhi, E., Venkatesh, G., Sim, J., Marr, D., Huang, R., Hock, J., Liew, Y.T., Srivatsan, K., Moss, D., Subhaschandra, S., Boudoukh, G. (2017). Can FPGAs beat GPUs in accelerating next-generation deep neural networks? ACM/SIGDA International Symposium on Field-Programmable Gate Arrays.
Misra, J., & Saha, I. (2010). Artificial neural networks in hardware: a survey of two decades of progress. Neurocomputing.
Sze, V., Chen, Y.H., Yang, T.J., Emer, J. (2017). Efficient processing of deep neural networks: a tutorial and survey. Proceedings of the IEEE.
Courbariaux, M., Hubara, I., Soudry, D., Yaniv, R.E., Bengio, Y. (2016). Binarized neural networks: training deep neural networks with weights and activations constrained to + 1 or -1. Computer Research Repository. arXiv:1602.02830.
Rastegari, M., Ordonez, V., Redmon, J., Farhadi, A. (2016). Xnor-net: imagenet classification using binary convolutional neural networks. Springer European Conference on Computer Vision.
Deng, L., Jiao, P., Pei, J., Wu, Z., Li, G. (2017). Gated XNOR networks: deep neural networks with ternary weights and activations under a unified discretization framework. arXiv:1705.09283.
Hastie, T., Tibshirani, R., Friedman, J. (2008). The elements of statistical learning, 2nd edn. New York: Springer.
MATH Google Scholar
Lloyd, S. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28, 129–137.
Article MathSciNet Google Scholar
Cristianini, N., & Shawe-Taylor, J. (2000). An introduction to support vector machines and other Kernel-Based learning methods. Cambridge: Cambridge University Press.
Book Google Scholar
Breiman, L., Friedman, J., Olshen, R., Stone, C. (1984). Classification and regression trees. Boca Raton: CRC Press.
MATH Google Scholar
Akers, S.B. (1978). Binary decision diagrams. IEEE Transactions on Computers.
Tang, P.T. (1991). Table-lookup algorithms for elementary functions and their error analysis. IEEE Symposium on Computer Arithmetic.
Abdelsalam, A.M., Elsheikh, A., David, J.P., Langlois, J.M.P. (2018). POLYBiNN: a scalable and efficient combinatorial inference engine for neural networks on FPGA. IEEE Design and Architectures for Signal and Image Processing.
Cheng, Y., Wang, D., Zhou, P., Zhang, T. (2017). A survey of model compression and acceleration for deep neural networks. arXiv:1710.09282.
Courbariaux, M., Bengio, Y., David, J.P. (2015). Binaryconnect: training deep neural networks with binary weights during propagations. In Advances in neural information processing systems.
Umuroglu, Y., Fraser, N.J., Gambardella, G., Blott, M., Leong, P., Jahre, M., Vissers, K. (2017). FINN: A framework for fast, scalable binarized neural network inference. ACM/SIGDA International Symposium on Field-Programmable Gate Arrays.
Nakahara, H., Fujii, T., Sato, S. (2017). A fully connected layer elimination for a binarized convolutional neural network on an FPGA. IEEE International Conference on Field Programmable Logic and Applications (FPL).
Zhao, R., Song, W., Zhang, W., Xing, T., Lin, J.H., Srivastava, M., Gupta, R., Zhang, Z. (2017). Accelerating binarized convolutional neural networks with software-programmable FPGAs. ACM/SIGDA International Symposium on Field-Programmable Gate Arrays.
Alemdar, H., Leroy, V., Prost-Boucle, A., Petrot, F. (2017). Ternary neural networks for resource-efficient AI applications. IEEE International Joint Conference on Neural Networks.
Liang, S., Yin, S., Liu, L., Luk, W., Wei, S. (2018). FP-BNN binarized neural network on FPGA. Neurocomputing, 275, 1072–1086.
Article Google Scholar
Han, S., Mao, H., Dally, W.J. (2016). Deep compression. Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. ICLR.
Rokach, L., & Maimon, O.Z. (2008). Data mining with decision trees: theory and applications. World Scientific.
Robert, C. (2014). Machine learning, A probabilistic perspective.
Lorena, A.C., De Carvalho, A.C., Gama, J.M. (2008). A review on the combination of binary classifiers in multiclass problems artificial intelligence review.
Struharik, J.R. (2011). Implementing decision trees in hardware. IEEE International Symposium on Intelligent Systems and Informatics.
Furnkranz, J., Gamberger, D., Lavrac, N. (2012). Foundations of rule learning. Berlin: Springer.
Book Google Scholar
Duda, R.O., Hart, P., Stork, D.G. (2012). Pattern classification. New York: Wiley.
MATH Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P. (1998). Gradient-based learning applied to document recognition. In Proceedings of the IEEE.
UCI Machine Learning Repository. http://archive.ics.uci.edu/ml/datasets/ISOLET.
UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/UJIIndoorLoc.
UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/Daily+and+Spor/discretionary-ts+Activities https://archive.ics.uci.edu/ml/datasets/Daily+and+Spor/discretionary-ts+Activities.
Krizhevsky, A., & Hinton, G. (2009). Learning multiple layers of features from tiny images technical report.
Abdelsalam, A.M., Boulet, F., Demers, G., Langlois, J.M.P., Cheriet, F. (2018). An efficient FPGA-based overlay inference architecture for fully connected DNNs. IEEE International Conference on Reconfigurable Computing.
Ba, J., & Caruana, R. (2014). Do deep nets really need to be deep? Advances in Neural Information Processing Systems.

Download references

Acknowledgments

The authors would like to thank Safa Berrima, Imad Benacer, Jeferson Santiago da Silva, Thibaut Stimpfling and Thomas Luinaud for their insightful comments.

Author information

Authors and Affiliations

Department of Computer and Software Engineering, Polytechnique Montréal, Montreal, Canada
Ahmed M. Abdelsalam & J. M. Pierre Langlois
Department of Mathematics and Industrial Engineering, Polytechnique Montréal, Montreal, Canada
Ahmed Elsheikh
Department of Electrical Engineering, Polytechnique Montréal, Montreal, Canada
Sivakumar Chidambaram & Jean-Pierre David

Authors

Ahmed M. Abdelsalam
View author publications
You can also search for this author inPubMed Google Scholar
Ahmed Elsheikh
View author publications
You can also search for this author inPubMed Google Scholar
Sivakumar Chidambaram
View author publications
You can also search for this author inPubMed Google Scholar
Jean-Pierre David
View author publications
You can also search for this author inPubMed Google Scholar
J. M. Pierre Langlois
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Ahmed M. Abdelsalam.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abdelsalam, A.M., Elsheikh, A., Chidambaram, S. et al. POLYBiNN: Binary Inference Engine for Neural Networks using Decision Trees. J Sign Process Syst 92, 95–107 (2020). https://doi.org/10.1007/s11265-019-01453-w

Download citation

Received: 22 November 2018
Revised: 08 March 2019
Accepted: 26 April 2019
Published: 29 May 2019
Issue Date: January 2020
DOI: https://doi.org/10.1007/s11265-019-01453-w

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

POLYBiNN: Binary Inference Engine for Neural Networks using Decision Trees

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks

Input-Dependably Feature-Map Pruning

Recent advances in efficient computation of deep convolutional neural networks

Explore related subjects

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now