Integration of Unsupervised and Supervised Criteria for Deep Neural Networks Training

Zamora-Martínez, Francisco; Muñoz-Almaraz, Javier; Pardo, Juan

doi:10.1007/978-3-319-44781-0_7

Integration of Unsupervised and Supervised Criteria for Deep Neural Networks Training

Francisco Zamora-Martínez¹⁶,
Javier Muñoz-Almaraz¹⁶ &
Juan Pardo¹⁶

Conference paper
First Online: 13 August 2016

3797 Accesses
1 Citations
3 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9887))

Abstract

Training Deep Neural Networks has been a difficult task for a long time. Recently diverse approaches have been presented to tackle these difficulties, showing that deep models improve the performance of shallow ones in some areas like signal processing, signal classification or signal segmentation, whatever type of signals, e.g. video, audio or images. One of the most important methods is greedy layer-wise unsupervised pre-training followed by a fine-tuning phase. Despite the advantages of this procedure, it does not fit some scenarios where real time learning is needed, as for adaptation of some time-series models. This paper proposes to couple both phases into one, modifying the loss function to mix together the unsupervised and supervised parts. Benchmark experiments with MNIST database prove the viability of the idea for simple image tasks, and experiments with time-series forecasting encourage the incorporation of this idea into on-line learning approaches. The interest of this method in time-series forecasting is motivated by the study of predictive models for domotic houses with intelligent control systems.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
Note that \(\varLambda =0\) is equivalent to training without the proposed integrated method.

References

Bache, K., Lichman, M.: UCI machine learning repository (2016). http://archive.ics.uci.edu/ml
Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy layer-wise training of deep networks. In: NIPS 2006, pp. 153–160. MIT Press (2007)
Google Scholar
Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 5(2), 157–166 (1994)
Article Google Scholar
Bergstra, J., Komer, B., Eliasmith, C., Yamins, D., Cox, D.D.: Hyperopt: a python library for model selection and hyperparameter optimization. Comp. Sci. Discov. 8(1) (2015)
Google Scholar
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: International Conference on AI and Statistics, pp. 249–256 (2010)
Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436–444 (2015)
Article Google Scholar
Lecun, Y., Cortes, C.: The MNIST database of handwritten digits (1998). http://yann.lecun.com/exdb/mnist/
Pardo, J., Zamora-Martínez, F., Botella-Rocamora, P.: Addendum: Pardo, J.; Zamora-Martínez, F.; Botella-Rocamora, P. online learning algorithm for time series forecasting suitable for low cost wireless sensor networks nodes. Sensors 2015, 15, 9277–9304. Senors 15(7), 16831 (2015)
Google Scholar
Politis, D.N., Romano, J.P.: The stationary bootstrap. J. Am. Stat. Assoc. 89(428), 1303–1313 (1994)
Article MathSciNet MATH Google Scholar
Schmidhuber, J.: Deep learning in neural networks: an overview. Neural Netw. 61, 85–117 (2015)
Article Google Scholar
Schmidhuber, J.: Learning complex, extended sequences using the principle of history compression. Neural Comput. 4(2), 234–242 (1992)
Article Google Scholar
Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.A.: Extracting and composing robust features with denoising autoencoders. In: ICML, pp. 1096–1103 (2008)
Google Scholar
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., Manzagol, P.A.: Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 11, 3371–3408 (2010)
MathSciNet MATH Google Scholar
Zamora-Martínez, F., Romeu, P., Botella-Rocamora, P., Pardo, J.: On-line learning of indoor temperature forecasting models towards energy efficiency. Energ. Build. 83, 162–172 (2014)
Article Google Scholar
Zamora-Martínez, F., Romeu-Guallart, P., Pardo, J.: SML2010 data set (2014). https://archive.ics.uci.edu/ml/datasets/SML2010
Zamora-Martínez, F., España-Boquera, S., Gorbe-Moya, J., Pastor-Pellicer, J., Palacios-Corella, A.: APRIL-ANN toolkit, a pattern recognizer in Lua with artificial neural networks (2013). https://github.com/pakozm/april-ann
Zeiler, M.D.: Adadelta: an adaptive learning rate method (2012). arXiv preprint: arXiv:1212.5701
Zhou, Y., Arpit, D., Nwogu, I., Govindaraju, V.: Is Joint Training Better for Deep Auto-Encoders? ArXiv e-prints (2015)
Google Scholar

Download references

Acknowledgments

This work has been financed by the local government of Generalitat Valenciana under project GV/2015/088 and Universidad CEU Cardenal Herrera.

Author information

Authors and Affiliations

Departamento de Ciencias Físicas, Matemáticas y de la Computación, Universidad CEU Cardenal Herrera, Alfara del Patriarca, 46115, Valencia, Spain
Francisco Zamora-Martínez, Javier Muñoz-Almaraz & Juan Pardo

Authors

Francisco Zamora-Martínez
View author publications
You can also search for this author in PubMed Google Scholar
Javier Muñoz-Almaraz
View author publications
You can also search for this author in PubMed Google Scholar
Juan Pardo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Francisco Zamora-Martínez .

Editor information

Editors and Affiliations

University of Lausanne, Lausanne, Switzerland
Alessandro E.P. Villa
University of Lausanne, Lausanne, Switzerland
Paolo Masulli
Universitat Politécnica de Catalunya, Terrrassa, Spain
Antonio Javier Pons Rivero

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zamora-Martínez, F., Muñoz-Almaraz, J., Pardo, J. (2016). Integration of Unsupervised and Supervised Criteria for Deep Neural Networks Training. In: Villa, A., Masulli, P., Pons Rivero, A. (eds) Artificial Neural Networks and Machine Learning – ICANN 2016. ICANN 2016. Lecture Notes in Computer Science(), vol 9887. Springer, Cham. https://doi.org/10.1007/978-3-319-44781-0_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-44781-0_7
Published: 13 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44780-3
Online ISBN: 978-3-319-44781-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics