Do We Really Need All These Neurons?

Romero, Adriana; Gatta, Carlo

doi:10.1007/978-3-642-38628-2_54

Adriana Romero¹⁹ &
Carlo Gatta²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7887))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

1876 Accesses
2 Citations

Abstract

Restricted Boltzmann Machines (RBMs) are generative neural networks that have received much attention recently. In particular, choosing the appropriate number of hidden units is important as it might hinder their representative power. According to the literature, RBM require numerous hidden units to approximate any distribution properly. In this paper, we present an experiment to determine whether such amount of hidden units is required in a classification context. We then propose an incremental algorithm that trains RBM reusing the previously trained parameters using a trade-off measure to determine the appropriate number of hidden units. Results on the MNIST and OCR letters databases show that using a number of hidden units, which is one order of magnitude smaller than the literature estimate, suffices to achieve similar performance. Moreover, the proposed algorithm allows to estimate the required number of hidden units without the need of training many RBM from scratch.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Smolensky, P.: Information processing in dynamical systems: foundations of harmony theory. In: Rumelhart, D.E., McClelland, J.L. (eds.) Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol. 1, ch. 6, pp. 194–281. MIT Press, Cambridge (1986)
Google Scholar
Freund, Y., Haussler, D.: Unsupervised learning of distributions on binary vectors using two layer networks. In: Advances in Neural Information Processing Systems 4, San Mateo, CA, USA, pp. 912–919 (1992)
Google Scholar
Hinton, G.: Training products of experts by minimizing contrastive divergence. Neural Computation 14(8), 1771–1800 (2002)
Article MathSciNet MATH Google Scholar
Hinton, G.: A practical guide to training restricted boltzmann machines, version 1, University of Toronto. Tech. Rep. (2010)
Google Scholar
Montufar, G., Rauh, J., Ay, N.: Expressive power and approximation errors of restricted boltzmann machines. In: Shawe-Taylor, J., Zemel, R.S., Bartlett, P.L., Pereira, F.C.N., Weinberger, K.Q. (eds.) NIPS, pp. 415–423 (2011)
Google Scholar
Le Roux, N., Bengio, Y.: Representational power of restricted Boltzmann machines and deep belief networks. Neural Computation 20(6), 1631–1649 (2008)
Article MathSciNet MATH Google Scholar
Montufar, G., Ay, N.: Refinements of universal approximation results for deep belief networks and restricted boltzmann machines. Neural Computation 23(5), 1306–1319 (2011)
Article MathSciNet MATH Google Scholar
Lecun, Y., Cortes, C.: The MNIST database of handwritten digits
Google Scholar
Shannon, C.E.: A mathematical theory of communication. Bell System Technical Journal 27(3), 379–423 (1948)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Dpt. de Matemàtica Aplicada i Anàlisi, Universitat de Barcelona, Barcelona, Spain
Adriana Romero
Centre de Visió per Computador, Campus UAB, Bellaterra, Spain
Carlo Gatta

Authors

Adriana Romero
View author publications
You can also search for this author in PubMed Google Scholar
Carlo Gatta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Systems and Robotics, Instituto Superior Técnico, Portugal
João M. Sanches
University of Alicante, Spain
Luisa Micó
INESC and University of Porto, Porto, Portugal
Jaime S. Cardoso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Romero, A., Gatta, C. (2013). Do We Really Need All These Neurons?. In: Sanches, J.M., Micó, L., Cardoso, J.S. (eds) Pattern Recognition and Image Analysis. IbPRIA 2013. Lecture Notes in Computer Science, vol 7887. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38628-2_54

Download citation

DOI: https://doi.org/10.1007/978-3-642-38628-2_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38627-5
Online ISBN: 978-3-642-38628-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics