Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem

Griewank, Andreas; Rojas, Ángel

doi:10.1007/978-3-030-37599-7_64

Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem

Conference paper
First Online: 03 January 2020

1773 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11943))

Abstract

We attack the classical neural network training problem by successive piecewise linearization, applying three different methods for the global optimization of the local piecewise linear models. The methods are compared to each other and steepest descent as well as stochastic gradient on the regression problem for the Griewank function.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Arora, S., Cohen, N., Golowich, N., Hu, W.: A convergence analysis of gradient descent for deep linear neural networks. CoRR, abs/1810.02281 (2018)
Google Scholar
Bagirov, A., Karmitsa, N., Mäkelä, M.: Introduction to Nonsmooth Optimization: Theory, Practice and Software. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08114-4
Book MATH Google Scholar
Bölcskei, H., Grohs, P., Kutyniok, G., Petersen, P.: Optimal approximation with sparsely connected deep neural networks. ArXiv:abs/1705.01714 (2019)
Bottou, L., Curtis, F.E., Nocedal, J.: Optimization methods for large-scale machine learning. SIAM Rev. 60, 223–311 (2018)
Article MathSciNet Google Scholar
Fourer, R., Kernighan, B.W.: AMPL: a modeling language for mathematical programming (2003)
Google Scholar
Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier neural networks. J. Mach. Learn. Res. 15, 315–323 (2011)
Google Scholar
Griewank, A.: On stable piecewise linearization and generalized algorithmic differentiation. Optim. Methods Softw. 28(6), 1139–1178 (2013)
Article MathSciNet Google Scholar
Griewank, A., Walther, A.: First and second order optimality conditions for piecewise smooth objective functions. Optim. Methods Softw. 31(5), 904–930 (2016)
Article MathSciNet Google Scholar
Griewank, A.: Generalized descent of global optimization. J. Optim. Theory Appl. 34, 11–39 (1981)
Article MathSciNet Google Scholar
Griewank, A., Walther, A.: Finite convergence of an active signature method to local minima of piecewise linear functions. Optim. Methods Softw. 34, 1035–1055 (2019)
Article MathSciNet Google Scholar
Gupte, A., Ahmed, S., Cheon, M., Dey, S.: Solving mixed integer bilinear problems using MILP formulations. SIAM J. Optim. 23(2), 721–744 (2013)
Article MathSciNet Google Scholar
Wright, S.J.: Coordinate descent algorithms. Math. Program. 151, 3–34 (2015)
Article MathSciNet Google Scholar
Kakade, S.M., Lee, J.D.: Provably correct automatic sub-differentiation for qualified programs. ArXiv:abs/1809.08530 (2018)
Kärkkäinen, T., Heikkola, E.: Robust formulations for training multilayer perceptrons. Neural Comput. 16, 837–862 (2004)
Article Google Scholar
Scholtes, S.: Introduction to Piecewise Differentiable Equations. Springer, New York (2012). https://doi.org/10.1007/978-1-4614-4340-7
Book MATH Google Scholar
Yarotsky, D.: Error bounds for approximations with deep ReLU networks. Neural Netw. Off. J. Int. Neural Netw. Soc. 94, 103–114 (2017)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Research Center on Mathematical Modelling (MODEMAT), Escuela Politécnica Nacional, Quito, Ecuador
Andreas Griewank
School of Mathematical and Computational Sciences, Yachay Tech, Urcuquí, Ecuador
Ángel Rojas

Authors

Andreas Griewank
View author publications
You can also search for this author in PubMed Google Scholar
Ángel Rojas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andreas Griewank .

Editor information

Editors and Affiliations

University of Cambridge, Cambridge, UK
Giuseppe Nicosia
University of Florida, Gainesville, FL, USA
Panos Pardalos
Harvard University, Cambridge, MA, USA
Renato Umeton
Università di Catania, Catania, Catania, Italy
Giovanni Giuffrida
Almawave, Rome, Roma, Italy
Vincenzo Sciacca

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Griewank, A., Rojas, Á. (2019). Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem. In: Nicosia, G., Pardalos, P., Umeton, R., Giuffrida, G., Sciacca, V. (eds) Machine Learning, Optimization, and Data Science. LOD 2019. Lecture Notes in Computer Science(), vol 11943. Springer, Cham. https://doi.org/10.1007/978-3-030-37599-7_64

Download citation

DOI: https://doi.org/10.1007/978-3-030-37599-7_64
Published: 03 January 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37598-0
Online ISBN: 978-3-030-37599-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics