On the Steplength Selection in Stochastic Gradient Methods

Franchini, Giorgia; Ruggiero, Valeria; Zanni, Luca

doi:10.1007/978-3-030-39081-5_17

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11973))

Included in the following conference series:

International Conference on Numerical Computations: Theory and Algorithms

1037 Accesses
4 Citations

Abstract

This paper deals with the steplength selection in stochastic gradient methods for large scale optimization problems arising in machine learning. We introduce an adaptive steplength selection derived by tailoring a limited memory steplength rule, recently developed in the deterministic context, to the stochastic gradient approach. The proposed steplength rule provides values within an interval, whose bounds need to be prefixed by the user. A suitable choice of the interval bounds allows to perform similarly to the standard stochastic gradient method equipped with the best-tuned steplength. Since the setting of the bounds slightly affects the performance, the new rule makes the tuning of the parameters less expensive with respect to the choice of the optimal prefixed steplength in the standard stochastic gradient method. We evaluate the behaviour of the proposed steplength selection in training binary classifiers on well known data sets and by using different loss functions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bellavia, S., Krejic, N., Krklec Jerinkic, N.: Subsampled inexact Newton methods for minimizing large sums of convex functions. arXiv:1811.05730 (2018)
Bollapragada, R., Byrd, R., Nocedal, J.: Adaptive sampling strategies for stochastic optimization. SIAM J. Optim. 28(4), 3312–3343 (2018)
Article MathSciNet Google Scholar
Bottou, L., Curtis, F.E., Nocedal, J.: Optimization methods for large-scale machine learning. SIAM Rev. 60(2), 223–311 (2018)
Article MathSciNet Google Scholar
Fletcher, R.: A limited memory steepest descent method. Math. Program. Ser. A 135, 413–436 (2012)
Article MathSciNet Google Scholar
Krejic, N., Krklec Jerinki, N.: Nonmonotone line search methods with variable sample size. Numer. Algorithms 68, 711–739 (2015)
Article MathSciNet Google Scholar
Paquette, C., Scheinberg, K.: A stochastic line search method with convergence rate analysis. arXiv:1807.07994v1 (2018)
di Serafino, D., Ruggiero, V., Toraldo, G., Zanni, L.: On the steplength selection in gradient methods for unconstrained optimization. Appl. Math. Comput. 318, 176–195 (2018)
MathSciNet MATH Google Scholar
Sopyla, K., Drozda, P.: SGD with BB update step for SVM. Inf. Sci. Inform. Comput. Sci. Intell. Syst. Appl. 316(C), 218–233 (2015)
MATH Google Scholar
Tan, C., Ma, S., Dai, Y., Qian, Y.: BB step size for SGD. In: Lee, D., Sugiyama, M., Luxburg, U., Guyon, I., Garnett, R. (eds.) Advances in Neural Information Processing Systems (NIPS 2016), vol. 29 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Physics, Informatics and Mathematics, University of Modena and Reggio Emilia, Modena, Italy
Giorgia Franchini & Luca Zanni
Department of Mathematics and Computer Science, University of Ferrara, Ferrara, Italy
Giorgia Franchini & Valeria Ruggiero
INdAM Research Group GNCS, Rome, Italy
Giorgia Franchini, Valeria Ruggiero & Luca Zanni

Authors

Giorgia Franchini
View author publications
You can also search for this author in PubMed Google Scholar
Valeria Ruggiero
View author publications
You can also search for this author in PubMed Google Scholar
Luca Zanni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giorgia Franchini .

Editor information

Editors and Affiliations

University of Calabria, Rende, Italy
Yaroslav D. Sergeyev
University of Calabria, Rende, Italy
Dmitri E. Kvasov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Franchini, G., Ruggiero, V., Zanni, L. (2020). On the Steplength Selection in Stochastic Gradient Methods. In: Sergeyev, Y., Kvasov, D. (eds) Numerical Computations: Theory and Algorithms. NUMTA 2019. Lecture Notes in Computer Science(), vol 11973. Springer, Cham. https://doi.org/10.1007/978-3-030-39081-5_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-39081-5_17
Published: 14 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-39080-8
Online ISBN: 978-3-030-39081-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics