Convergence of gradient method for a fully recurrent neural network

Xu, Dongpo; Li, Zhengxue; Wu, Wei

doi:10.1007/s00500-009-0398-0

Convergence of gradient method for a fully recurrent neural network

Original Paper
Published: 17 February 2009

Volume 14, pages 245–250, (2010)
Cite this article

Soft Computing Aims and scope Submit manuscript

Dongpo Xu¹,
Zhengxue Li¹ &
Wei Wu¹

251 Accesses
12 Citations
Explore all metrics

Abstract

Recurrent neural networks have been successfully used for analysis and prediction of temporal sequences. This paper is concerned with the convergence of a gradient-descent learning algorithm for training a fully recurrent neural network. In literature, stochastic process theory has been used to establish some convergence results of probability nature for the on-line gradient training algorithm, based on the assumption that a very large number of (or infinitely many in theory) training samples of the temporal sequences are available. In this paper, we consider the case that only a limited number of training samples of the temporal sequences are available such that the stochastic treatment of the problem is no longer appropriate. Instead, we use an off-line gradient training algorithm for the fully recurrent neural network, and we accordingly prove some convergence results of deterministic nature. The monotonicity of the error function in the iteration is also guaranteed. A numerical example is given to support the theoretical findings.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions

Article Open access 09 August 2022

Discrete Time Lyapunov-Type Convergence Conditions for Recurrent Sequences in Optimization and Subgradient Method for Weakly Convex Functions

Convergence of Stochastic Gradient Descent in Deep Neural Network

Article 06 January 2021

References

Atiya AF, Parlos AG (2000) New results on recurrent network training: Unifying the algorithms and accelerating convergence. IEEE Trans Neural Netw 11:697–709. doi:10.1109/72.846741
Article Google Scholar
Aussem A (2002) Sufficient conditions for error backflow convergence in dynamical recurrent neural networks. Neural Comput 14:1907–1927. doi:10.1162/089976602760128063
Article MATH Google Scholar
Chen TB, Soo VW (1996) A comparative study of recurrent neural network architectures on learning temporal sequences. IEEE Int Conf Neural Netw 4:1945–1950
Google Scholar
Gori M, Maggini M (1996) Optimal convergence of on-line backpropagation. IEEE Trans Neural Netw 7:251–254. doi:10.1109/72.478415
Article Google Scholar
Jesus OD, Hagan MT (2007) Backpropation algorithms for a broad class of dynamic networks. IEEE Trans Neural Netw 18:14–27. doi:10.1109/TNN.2006.882371
Article Google Scholar
Ku CC, Lee KY (1995) Diagonal recurrent neural networks for dynamic systems control. IEEE Trans Neural Netw 6:144–156. doi:10.1109/72.363441
Article Google Scholar
Kuan CM, Hornik K, White H (1994) A convergence results for learning in recurrent neural networks. Neural Comput 6:420–440. doi:10.1162/neco.1994.6.3.420
Article Google Scholar
Ortega J, Rheinboldt W (1970) Iterative solution of nonlinear equations in several variables. Academic Press, New York
MATH Google Scholar
Williams RJ, Zisper J (1989a) A learning algorithm for continually running fully recurrent neural networks. Neural Comput 1:270–280. doi:10.1162/neco.1989.1.2.270
Article Google Scholar
Williams RJ, Zisper J (1989b) Experimental analysis of the real time recurrent learning algorithm. Connect Sci 1:87–111. doi:10.1080/09540098908915631
Article Google Scholar
Yuan YX, Sun WY (2001) Optimization theory and methods. Science Press, Beijing
Google Scholar

Download references

Acknowledgments

This work is partly supported by the National Natural Science Foundation of China (10471017).

Author information

Authors and Affiliations

Department of Applied Mathematics, Dalian University of Technology, Dalian, 116024, People’s Republic of China
Dongpo Xu, Zhengxue Li & Wei Wu

Authors

Dongpo Xu
View author publications
You can also search for this author in PubMed Google Scholar
Zhengxue Li
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhengxue Li.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xu, D., Li, Z. & Wu, W. Convergence of gradient method for a fully recurrent neural network. Soft Comput 14, 245–250 (2010). https://doi.org/10.1007/s00500-009-0398-0

Download citation

Published: 17 February 2009
Issue Date: February 2010
DOI: https://doi.org/10.1007/s00500-009-0398-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Convergence of gradient method for a fully recurrent neural network

Abstract

Access this article

Similar content being viewed by others

A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions

Discrete Time Lyapunov-Type Convergence Conditions for Recurrent Sequences in Optimization and Subgradient Method for Weakly Convex Functions

Convergence of Stochastic Gradient Descent in Deep Neural Network

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Convergence of gradient method for a fully recurrent neural network

Abstract

Access this article

Similar content being viewed by others

A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions

Discrete Time Lyapunov-Type Convergence Conditions for Recurrent Sequences in Optimization and Subgradient Method for Weakly Convex Functions

Convergence of Stochastic Gradient Descent in Deep Neural Network

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation