Behaviour in 0 of the Neural Networks Training Cost

Goutte, Cyril

doi:10.1023/A:1009684310458

Behaviour in 0 of the Neural Networks Training Cost

Published: October 1998

Volume 8, pages 107–116, (1998)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Cyril Goutte¹

37 Accesses
Explore all metrics

Abstract

We study the behaviour in zero of the derivatives of the cost function used when training non-linear neural networks. It is shown that a fair number of first, second and higher order derivatives vanish in zero, validating the belief that 0 is a peculiar and potentially harmful location. These calculations are related to practical and theoretical aspects of neural networks training.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem

Deterministic Neural Networks Optimization from a Continuous and Energy Point of View

Article 24 May 2023

Bilel Bensaid, Gaël Poëtte & Rodolphe Turpault

Optimal Control in Learning Neural Network

References

R. Battiti, “First-and second-order methods for learning: Between steepest descent and Newton's method”, Neural Computation, Vol. 4, No. 2, pp. 141–166, March 1992.
Google Scholar
C. Goutte, Statistical learning and regularisation in regression, Ph.D. Thesis, Université Paris 6, Paris, 1997.
Google Scholar
A. Krogh and J.A. Hertz, “A simple weight decay can improve generalization”, in J. E. Moody, S. J. Hanson and R. P. Lippman (eds), Advances in Neural Information Processing Systems, Vol. 4, 1992.
Y. LeCun, Modèles connexionnistes de l'apprentissage, Ph.D. Thesis, Université Paris 6, Paris, 1987.
Google Scholar
M. Møller, “A scaled conjugate gradient algorithm for fast supervised learning”, Neural Networks, Vol. 6, No. 4, pp. 525–533, 1993.
Google Scholar
D. Rumelhart, G. Hinton and R. Williams, “Learning internal representation by error propagation”, in D. Rumelhart and J. McClellan (eds), Parallel Distributed Processing: Exploring the microstructure of cognition, Vol. 1, pp. 318–362, MIT Press, 1986.
A.N. Tikhonov and V.Y. Arsenin, Solution of Ill-Posed Problems, Winston: Washington, D.C., 1977.
Google Scholar
V.N. Vapnik, The Nature of Statistical Learning Theory, Springer: New York, 1995.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematical Modelling, Technical University of Denmark, building 321, 2800, Lyngby, Denmark. E-mail: Email
Cyril Goutte

Authors

Cyril Goutte
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Goutte, C. Behaviour in 0 of the Neural Networks Training Cost. Neural Processing Letters 8, 107–116 (1998). https://doi.org/10.1023/A:1009684310458

Download citation

Issue Date: October 1998
DOI: https://doi.org/10.1023/A:1009684310458

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Behaviour in 0 of the Neural Networks Training Cost

Abstract

Access this article

Similar content being viewed by others

Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem

Deterministic Neural Networks Optimization from a Continuous and Energy Point of View

Optimal Control in Learning Neural Network

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Behaviour in 0 of the Neural Networks Training Cost

Abstract

Access this article

Similar content being viewed by others

Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem

Deterministic Neural Networks Optimization from a Continuous and Energy Point of View

Optimal Control in Learning Neural Network

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation