Skip to main content

Comparison of Deep Learning Libraries on the Problem of Handwritten Digit Classification

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 542))

Abstract

This paper presents a comparative analysis of several popular and freely available deep learning frameworks. We compare functionality and usability of the frameworks trying to solve popular computer vision problems like hand-written digit recognition. Four libraries have been chosen for the detailed study: Caffe, Pylearn2, Torch, and Theano. We give a brief description of these libraries, consider key features and capabilities, and provide case studies. We also investigate the performance of the libraries. This study allows making a decision which deep learning framework suites us best and will be used for our future research.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Kustikova, V.D., Druzhkov, P.N.: A survey of deep learning methods and software for image classification and object detection. In: Proceedings of the 9th Open German-Russian Workshop on Pattern Recognition and Image Understanding (2014)

    Google Scholar 

  2. Caffe. http://caffe.berkeleyvision.org

  3. Pylearn2. http://deeplearning.net/software/pylearn2

  4. Torch. http://www.torch.ch

  5. Theano. http://deeplearning.net/software/theano

  6. Hinton, G.E.: Learning multiple layers of representation. Trends Cogn. Sci. 11, 428–434 (2007)

    Article  Google Scholar 

  7. LeCun, Y., Kavukcuoglu, K., Farabet, C.: Convolutional networks and applications in vision. In: Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS), pp. 253–256 (2010)

    Google Scholar 

  8. Hayat, M., Bennamoun, M., An, S.: Learning non-linear reconstruction models for image set classification. In: Proceedings of the IEEE Conference on CVPR (2014)

    Google Scholar 

  9. Restricted Boltzmann Machines (RBMs). http://www.deeplearning.net/tutorial/rbm.html

  10. DeapLearnToolbox. https://github.com/rasmusbergpalm/DeepLearnToolbox

  11. Deepnet Library. https://github.com/nitishsrivastava/deepnet

  12. DeepMat Library. https://github.com/kyunghyuncho/deepmat

  13. Package Darch. http://cran.r-project.org/web/packages/darch/index.html

  14. nnForge Library. http://milakov.github.io/nnForge

  15. CXXNET. https://github.com/antinucleon/cxxnet

  16. Cuda-convnet - high-performance C++/CUDA implementation of convolutional neural networks. http://code.google.com/p/cuda-convnet

  17. Cuda CNN Library. http://www.mathworks.com/matlabcentral/fileexchange/24291-cnn-convolutional-neural-network-class

  18. Caffes repository. https://github.com/BVLC/caffe

  19. Unofficial version of library Caffe to the Windows. https://github.com/niuzhiheng/caffe

  20. NVIDIA(R) cuDNN - GPU Accelerated Machine Learning. https://developer.nvidia.com/cuDNN

  21. Bottou, L.: Stochastic gradient descent tricks. In: Montavon, G., Orr, G.B., Müller, K.-R. (eds.) NN: Tricks of the Trade. LNCS, vol. 7700, 2nd edn, pp. 421–436. Springer, Heidelberg (2012). http://research.microsoft.com/pubs/192769/tricks-2012.pdf

    Chapter  Google Scholar 

  22. Duchi, J., Hazan, E., Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization. J. ML Res. 12, 2121–2159 (2011)

    MathSciNet  MATH  Google Scholar 

  23. Sutskever, I., Martens, J., Dahl, G., Hinton, G.: On the importance of initialization and momentum in deep learning. In: Proceedings of the 30th International Conference on ML (2013)

    Google Scholar 

  24. The MNIST database of handwritten digits. http://yann.lecun.com/exdb/mnist

  25. Pylearn2s repository. https://github.com/lisa-lab/pylearn2

  26. Torch cheatsheet. https://github.com/torch/torch7/wiki/Cheatsheet

  27. Example of logistic regression (Theano). https://github.com/ITLab-Vision/DNN-develop/blob/master/theano/src/mnist/logistic_sgd.py

  28. Ciresan, D., Meier, U., Schmidhuber J.: Multi-column Deep Neural Networks for Image Classification. http://arxiv.org/pdf/1202.2745v1.pdf

Download references

Acknowledgments

The work has been performed in Information Technologies Laboratory at Computational Mathematics and Cybernetics Department, Lobachevsky State University of Nizhni Novgorod under support by Itseez Co. and Argus Center for Computer Vision Co. Ltd.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Valentina Kustikova .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Kruchinin, D., Dolotov, E., Kornyakov, K., Kustikova, V., Druzhkov, P. (2015). Comparison of Deep Learning Libraries on the Problem of Handwritten Digit Classification. In: Khachay, M., Konstantinova, N., Panchenko, A., Ignatov, D., Labunets, V. (eds) Analysis of Images, Social Networks and Texts. AIST 2015. Communications in Computer and Information Science, vol 542. Springer, Cham. https://doi.org/10.1007/978-3-319-26123-2_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26123-2_38

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26122-5

  • Online ISBN: 978-3-319-26123-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics