Fast and Stable Learning Utilizing Singular Regions of Multilayer Perceptron

Satoh, Seiya; Nakano, Ryohei

doi:10.1007/s11063-013-9283-z

Fast and Stable Learning Utilizing Singular Regions of Multilayer Perceptron

Published: 03 February 2013

Volume 38, pages 99–115, (2013)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Seiya Satoh¹ &
Ryohei Nakano¹

263 Accesses
11 Citations
Explore all metrics

Abstract

In the parameter space of MLP(J), multilayer perceptron with J hidden units, there exist flat areas called singular regions created by applying reducibility mappings to the optimal solution of MLP(\(J-1\)). Since such singular regions cause serious stagnation of learning, a learning method to avoid singular regions has been desired. However, such avoiding does not guarantee the quality of the final solutions. This paper proposes a new learning method which does not avoid but makes good use of singular regions to stably and successively find excellent solutions commensurate with MLP(J). The proposed method worked well in our experiments using artificial and real data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next

Article Open access 26 July 2022

Development and Application of Artificial Neural Network

Article 30 December 2017

Fundamentals of Artificial Neural Networks and Deep Learning

References

Amari S (1998) Natural gradient works efficiently in learning. Neural Comput 10(2):251–276
Article MathSciNet Google Scholar
Amari S, Park H, Fukumizu K (2000) Adaptive method of realizing natural gradient learning for multilayer perceptrons. Neural Comput 12(6):1399–1409
Article Google Scholar
Cousseau F, Oseki T, Amari S (2008) Dynamics of learning in multilayer perceptrons near singularities. IEEE Trans Neural Networks 19(8):1313–1328
Article Google Scholar
Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley, New York
MATH Google Scholar
Fukumizu K, Amari S (2000) Local minima and plateaus in hierarchical structure of multilayer perceptrons. Neural Netw 1(3):317–327
Article Google Scholar
Hamey LGC (1998) XOR has no local minima: a case study in neural network error surface. Neural Netw 11(4):669–681
Article Google Scholar
Hecht-Nielsen H (1990) Neurocomputing. Addison–Wesley Publishing Company, Reading
Google Scholar
Minnett RCJ, Smith AT, Hecht-Nielsen R (2011) Neural network tomography: network replication from output surface geometry. Neural Netw 24(5):484–492
Article Google Scholar
Luenberger DG (1984) Linear and nonlinear programming. Addison–Wesley Publishing Company, Reading
Nakano R, Saito K (2002) Discovering polynomials to fit multivariate data having numeric and nominal variables. LNAI 2281:482–493
Google Scholar
Nakano R, Satoh S, Ohwaki T (2011) Learning method utilizing singular region of multilayer perceptron. In: Proceedings of the 3rd International Conference on Neural Computation Theory and Applications, Paris, pp. 106–111
Saito K, Nakano R (1997) Partial BFGS update and efficient step-length calculation for three-layer neural networks. Neural Comput 9(1):239–257
Article Google Scholar
Sussmann HJ (1992) Uniqueness of the weights for minimal feedforward nets with a given input-output map. Neural Netw 5(4):589–593
Article Google Scholar
Wan W (2006) Implementing online natural gradient learning: problems and solutions. IEEE Trans Neural Netw 17(2):317–329
Article Google Scholar
Watanabe S (2008) A formula of equations of states in singular learning machines. In: Proceedings of the International Joint Conference on Neural Networks 2008, Hong Kong, pp. 2099–2106
Watanabe S (2009) Algebraic geometry and statistical learning theory. Cambridge University Press, Cambridge
Book MATH Google Scholar

Download references

Acknowledgments

This work was supported by Grants-in-Aid for Scientific Research (C) 22500212 and Chubu University Grant 24IS27A.

Author information

Authors and Affiliations

Department of Computer Science, Chubu University, 1200 Matsumoto-cho, Kasugai, 487-8501, Japan
Seiya Satoh & Ryohei Nakano

Authors

Seiya Satoh
View author publications
You can also search for this author in PubMed Google Scholar
Ryohei Nakano
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ryohei Nakano.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Satoh, S., Nakano, R. Fast and Stable Learning Utilizing Singular Regions of Multilayer Perceptron. Neural Process Lett 38, 99–115 (2013). https://doi.org/10.1007/s11063-013-9283-z

Download citation

Published: 03 February 2013
Issue Date: October 2013
DOI: https://doi.org/10.1007/s11063-013-9283-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fast and Stable Learning Utilizing Singular Regions of Multilayer Perceptron

Abstract

Access this article

Similar content being viewed by others

Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next

Development and Application of Artificial Neural Network

Fundamentals of Artificial Neural Networks and Deep Learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Fast and Stable Learning Utilizing Singular Regions of Multilayer Perceptron

Abstract

Access this article

Similar content being viewed by others

Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next

Development and Application of Artificial Neural Network

Fundamentals of Artificial Neural Networks and Deep Learning

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation