An Efficient Learning Algorithm Using Natural Gradient and Second Order Information of Error Surface

Park, Hyeyoung; Fukumizu, Kenji; Amari, Shun-ichi; Lee, Yillbyung

doi:10.1007/3-540-44533-1_23

An Efficient Learning Algorithm Using Natural Gradient and Second Order Information of Error Surface

Hyeyoung Park³,
Kenji Fukumizu⁴,
Shun-ichi Amari⁵ &
…
Yillbyung Lee³

Conference paper

948 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1886))

Abstract

Natural gradient learning algorithm, which originated from information geometry, is known to provide a good solution for the problem of slow learning speed of gradient descent learning methods. Whereas the natural gradient learning algorithm is inspired from the geometric structure of the space of learning systems, there have been other approaches to acceleration of learning by using the second order information of error surface. Although the second order methods cannot give as successful solutions as the natural gradient learning method, their results showed the usefulness of the second order information of error surface in the learning process. In this paper, we develop a method of combining these two different approaches to propose a more efficient learning algorithm. At each learning step, we calculate a search direction by means of the natural gradient. When we apply the search direction to parameter-updating process, the second order information of error surface is applied to determine an efficient learning rate. Through a simple experiment on a real world problem, we confirmed that the proposed learning algorithm show faster convergence than the pure natural gradient learning algorithm.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. Amari, Natural Gradient Works Efficiently in Learning, Neural Computation, 10, 251–276, 1998.
Article Google Scholar
S. Amari and H. Nagaoka, Information Geometry, AMS and Oxford University Press, 1999.
Google Scholar
Amari, S., Park, H., and Fukumizu, F., Adaptive method of realizing natural gradient learning for multilayer perceptrons, Neural Computation, 12, xx–xx, 2000.
Google Scholar
D. P. Bertsekas, Nonlinear Programming, Athena Scientific, Belmout, Massachusetts, 1995.
MATH Google Scholar
C. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, 1995.
Google Scholar
Y. LeCun, L. Bottou, G. B. Orr, and K.-R. Müller, Neural Networks: Tricks of the Trade, ed. G. B. Orr and K. R.Müller, (pp. 5–50), Springer Lecture Notes in Computer Sciences, 1524, Springer Heidelberg, 1998
Google Scholar
H. Park, Efficient On-line Learning Algorithms Based on Information Geometry for Stochastic Neural Networks, Ph.D. Thesis, Dept. of CS, Yonsei University, 2000.
Google Scholar
M. Rattray and D. Saad, Transient and Asymptotics of Natural Gradient Learning, Proceedings of the 8th International Conference on Artificial Neural Networks, 165–170, 1998.
Google Scholar
M. Rattray, D. Saad, and S. Amari, Natural Gradient Descent for On-line Learning, Physical Review Letters, 81, 5461–5464, 1998.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, Yonsei University, Seoul, Korea
Hyeyoung Park & Yillbyung Lee
Institute of Statistical Mathematics, Tokyo, Japan
Kenji Fukumizu
Institute of Physical and Chemical Research, Saitama, Japan
Shun-ichi Amari

Authors

Hyeyoung Park
View author publications
You can also search for this author in PubMed Google Scholar
Kenji Fukumizu
View author publications
You can also search for this author in PubMed Google Scholar
Shun-ichi Amari
View author publications
You can also search for this author in PubMed Google Scholar
Yillbyung Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka, 567-0047, Japan
Riichiro Mizoguchi
Computer Sciences Laboratory, Research School of Information Sciences and Engineering, Australian National University, Canberra, ACT, 0200, Australia
John Slaney

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, H., Fukumizu, K., Amari, Si., Lee, Y. (2000). An Efficient Learning Algorithm Using Natural Gradient and Second Order Information of Error Surface. In: Mizoguchi, R., Slaney, J. (eds) PRICAI 2000 Topics in Artificial Intelligence. PRICAI 2000. Lecture Notes in Computer Science(), vol 1886. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44533-1_23

Download citation

DOI: https://doi.org/10.1007/3-540-44533-1_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67925-7
Online ISBN: 978-3-540-44533-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics