Towards the Optimal Learning Rate for Backpropagation

Mandic, Danilo P.; Chambers, Jonathon A.

doi:10.1023/A:1009686825582

Towards the Optimal Learning Rate for Backpropagation

Published: February 2000

Volume 11, pages 1–5, (2000)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Danilo P. Mandic¹ &
Jonathon A. Chambers²

292 Accesses
30 Citations
Explore all metrics

Abstract

A backpropagation learning algorithm for feedforward neural networks withan adaptive learning rate is derived. The algorithm is based uponminimising the instantaneous output error and does not include anysimplifications encountered in the corresponding Least Mean Square (LMS)algorithms for linear adaptive filters. The backpropagation algorithmwith an adaptive learning rate, which is derived based upon the Taylorseries expansion of the instantaneous output error, is shown to exhibitbehaviour similar to that of the Normalised LMS (NLMS) algorithm. Indeed,the derived optimal adaptive learning rate of a neural network trainedby backpropagation degenerates to the learning rate of the NLMS for a linear activation function of a neuron. By continuity, the optimal adaptive learning rate for neural networks imposes additional stabilisationeffects to the traditional backpropagation learning algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Ljung, L. and Soderstrom, T.: Theory and Practice of Recursive Identification, MIT Press, Cambridge, MA.,1983.
Google Scholar
Treichler, J. R., Johnson, Jr., C. R. and Larimore, M. G.: Theory and Design of Adaptive Filters, John Wiley & Sons, New York, 1987.
Google Scholar
Kwong, R. H. and Johnston, E. W.: A variable step size LMS algorithm, IEEE Transactions on Signal Processing 40(7) (1992), 1633-1641.
Google Scholar
Evans, J. B., Xue, P. and Liu, B.: Analysis and implementation of variable step size adaptive algorithms, IEEE Transactions on Signal Processing 41(8) (1993), 2517-2535.
Google Scholar
Mathews, V. J. and Xie, Z.: A stochastic gradient adaptive filter with gradient adaptive step size, IEEE Transactions on Signal Processing 41(6) (1993), 2075-2087.
Google Scholar
Shan, T. J. and Kailaith, T.: Adaptive algorithms with an automatic gain control feature, IEEE Transactions on Acoustics, Speech and Signal Processing 35(1) (1988), 122-127.
Google Scholar
Aboulnasr, T. and Mayyas, K.: A robust variable step-size LMS-type algorithm: Analysis and simulations, IEEE Transactions on Signal Processing 45(3) (1997), 631-639.
Google Scholar
Haykin, S.: Adaptive Filter Theory, Prentice-Hall, 3d ed., Englewood Cliffs, NJ, 1996.
Google Scholar
Jacobs, R. A.: Increased rates of convergence through learning rate adaptation, Neural Networks 1 (1988), 295-307.
Google Scholar
Haykin, S.: Neural Networks-A Comprehensive Foundation, Prentice Hall, Englewood Cliffs, NJ, 1994.
Google Scholar
Douglas, S. C. and Cichocki, A.: On-line step-size selection for training of adaptive systems, IEEE Signal Processing Magazine 14(6) (1997), 45-46.
Google Scholar
Mandic, D. P. and Chambers, J. A.: A posteriori real time recurrent learning schemes for a recurrent neural network based non-linear predictor. IEE Proceedings-Vision, Image and Signal Processing 145(6) (1998), 365-370.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Systems, University of East Anglia, Norwich, NR4 7TJ, UK
Danilo P. Mandic
Dept. of Electrical and Electronic Engineering, Imperial College of Science, Technology and Medicine, Exhibition Road, SW7 2BT, London, UK
Jonathon A. Chambers

Authors

Danilo P. Mandic
View author publications
You can also search for this author in PubMed Google Scholar
Jonathon A. Chambers
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mandic, D.P., Chambers, J.A. Towards the Optimal Learning Rate for Backpropagation. Neural Processing Letters 11, 1–5 (2000). https://doi.org/10.1023/A:1009686825582

Download citation

Issue Date: February 2000
DOI: https://doi.org/10.1023/A:1009686825582

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Towards the Optimal Learning Rate for Backpropagation

Abstract

Access this article

Similar content being viewed by others

Development and Application of Artificial Neural Network

Particle swarm optimization algorithm: an overview

Fundamentals of Artificial Neural Networks and Deep Learning

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Towards the Optimal Learning Rate for Backpropagation

Abstract

Access this article

Similar content being viewed by others

Development and Application of Artificial Neural Network

Particle swarm optimization algorithm: an overview

Fundamentals of Artificial Neural Networks and Deep Learning

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation