Skip to main content
Log in

Fast and Stable Learning Utilizing Singular Regions of Multilayer Perceptron

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

In the parameter space of MLP(J), multilayer perceptron with J hidden units, there exist flat areas called singular regions created by applying reducibility mappings to the optimal solution of MLP(\(J-1\)). Since such singular regions cause serious stagnation of learning, a learning method to avoid singular regions has been desired. However, such avoiding does not guarantee the quality of the final solutions. This paper proposes a new learning method which does not avoid but makes good use of singular regions to stably and successively find excellent solutions commensurate with MLP(J). The proposed method worked well in our experiments using artificial and real data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Amari S (1998) Natural gradient works efficiently in learning. Neural Comput 10(2):251–276

    Article  MathSciNet  Google Scholar 

  2. Amari S, Park H, Fukumizu K (2000) Adaptive method of realizing natural gradient learning for multilayer perceptrons. Neural Comput 12(6):1399–1409

    Article  Google Scholar 

  3. Cousseau F, Oseki T, Amari S (2008) Dynamics of learning in multilayer perceptrons near singularities. IEEE Trans Neural Networks 19(8):1313–1328

    Article  Google Scholar 

  4. Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley, New York

    MATH  Google Scholar 

  5. Fukumizu K, Amari S (2000) Local minima and plateaus in hierarchical structure of multilayer perceptrons. Neural Netw 1(3):317–327

    Article  Google Scholar 

  6. Hamey LGC (1998) XOR has no local minima: a case study in neural network error surface. Neural Netw 11(4):669–681

    Article  Google Scholar 

  7. Hecht-Nielsen H (1990) Neurocomputing. Addison–Wesley Publishing Company, Reading

    Google Scholar 

  8. Minnett RCJ, Smith AT, Hecht-Nielsen R (2011) Neural network tomography: network replication from output surface geometry. Neural Netw 24(5):484–492

    Article  Google Scholar 

  9. Luenberger DG (1984) Linear and nonlinear programming. Addison–Wesley Publishing Company, Reading

  10. Nakano R, Saito K (2002) Discovering polynomials to fit multivariate data having numeric and nominal variables. LNAI 2281:482–493

    Google Scholar 

  11. Nakano R, Satoh S, Ohwaki T (2011) Learning method utilizing singular region of multilayer perceptron. In: Proceedings of the 3rd International Conference on Neural Computation Theory and Applications, Paris, pp. 106–111

  12. Saito K, Nakano R (1997) Partial BFGS update and efficient step-length calculation for three-layer neural networks. Neural Comput 9(1):239–257

    Article  Google Scholar 

  13. Sussmann HJ (1992) Uniqueness of the weights for minimal feedforward nets with a given input-output map. Neural Netw 5(4):589–593

    Article  Google Scholar 

  14. Wan W (2006) Implementing online natural gradient learning: problems and solutions. IEEE Trans Neural Netw 17(2):317–329

    Article  Google Scholar 

  15. Watanabe S (2008) A formula of equations of states in singular learning machines. In: Proceedings of the International Joint Conference on Neural Networks 2008, Hong Kong, pp. 2099–2106

  16. Watanabe S (2009) Algebraic geometry and statistical learning theory. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

Download references

Acknowledgments

This work was supported by Grants-in-Aid for Scientific Research (C) 22500212 and Chubu University Grant 24IS27A.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ryohei Nakano.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Satoh, S., Nakano, R. Fast and Stable Learning Utilizing Singular Regions of Multilayer Perceptron. Neural Process Lett 38, 99–115 (2013). https://doi.org/10.1007/s11063-013-9283-z

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-013-9283-z

Keywords

Navigation