On Low Complexity Acceleration Techniques for Randomized Optimization

Stich, Sebastian Urban

doi:10.1007/978-3-319-10762-2_13

Sebastian Urban Stich¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8672))

Included in the following conference series:

International Conference on Parallel Problem Solving from Nature

2958 Accesses

Abstract

Recently it was shown by Nesterov (2011) that techniques form convex optimization can be used to successfully accelerate simple derivative-free randomized optimization methods. The appeal of those schemes lies in their low complexity, which is only Θ(n) per iteration—compared to Θ(n ²) for algorithms storing second-order information or covariance matrices. From a high-level point of view, those accelerated schemes employ correlations between successive iterates—a concept looking similar to the evolution path used in Covariance Matrix Adaptation Evolution Strategies (CMA-ES). In this contribution, we (i) implement and empirically test a simple accelerated random search scheme (SARP). Our study is the first to provide numerical evidence that SARP can effectively be implemented with adaptive step size control and does not require access to gradient or advanced line search oracles. We (ii) try to empirically verify the supposed analogy between the evolution path and SARP. We propose an algorithm CMA-EP that uses only the evolution path to bias the search. This algorithm can be generalized to a family of low memory schemes, with complexity Θ(mn) per iteration, following a recent approach by Loshchilov (2014). The study shows that the performance of CMA-EP heavily depends on the spectra of the objective function and thus it cannot accelerate as consistently as SARP.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Methods to compare expensive stochastic optimization algorithms with random restarts

Article 14 June 2018

On asymptotic convergence rate of random search

Article Open access 24 November 2023

Natural Gradient Interpretation of Rank-One Update in CMA-ES

References

Polyak, B.: Introduction to Optimization. Optimization Software - Inc. (1987)
Google Scholar
Nesterov, Y.: Introductory Lectures on Convex Optimization. Kluwer (2004)
Google Scholar
Broyden, C.G.: The Convergence of a Class of Double-rank Minimization Algorithms 1. General Considerations. IMA J. of Appl. Math. 6(1), 76–90 (1970)
Article MathSciNet MATH Google Scholar
Fletcher, R.: A new approach to variable metric algorithms. The Computer Journal 13(3), 317–322 (1970)
Article MathSciNet MATH Google Scholar
Goldfarb, D.: A Family of Variable-Metric Methods Derived by Variational Means. Mathematics of Computation 24(109), 23–26 (1970)
Article MathSciNet MATH Google Scholar
Nocedal, J.: Updating Quasi-Newton Matrices with Limited Storage. Mathematics of Computation 35(151), 773–782 (1980)
Article MathSciNet MATH Google Scholar
Liu, D.C., Nocedal, J.: On the limited memory BFGS method for large scale optimization. Mathematical Programming 45(1-3), 503–528 (1989)
Article MathSciNet MATH Google Scholar
Nesterov, Y.: A method of solving a convex programming problem with convergence rate O(1/k ²). Soviet Mathematics Doklady 27(2), 372–376 (1983)
MATH Google Scholar
Nesterov, Y.: Smoothing technique and its applications in semidefinite optimization. Mathematical Programming 110(2), 245–259 (2007)
Article MathSciNet MATH Google Scholar
Tseng, P.: On accelerated proximal gradient methods for convex-concave optimization. Submitted to SIAM Journal on Optimization (2008)
Google Scholar
Schumer, M., Steiglitz, K.: Adaptive step size random search. IEEE Transactions on Automatic Control 13(3), 270–276 (1968)
Article Google Scholar
Rechenberg, I.: Evolutionsstrategie; Optimierung technischer Systeme nach Prinzipien der biologischen Evolution. Frommann-Holzboog (1973)
Google Scholar
Mutseniyeks, V.A., Rastrigin, L.A.: Extremal control of continuous multi-parameter systems by the method of random search. Eng.Cyb. 1, 82–90 (1964)
Google Scholar
Stich, S.U., Müller, C.L., Gärtner, B.: Optimization of convex functions with Random Pursuit. SIAM Journal on Optimization 23(2), 1284–1309 (2013)
Article MathSciNet MATH Google Scholar
Nesterov, Y.: Random Gradient-Free Minimization of Convex Functions. Technical report, ECORE (2011)
Google Scholar
Leventhal, D., Lewis, A.S.: Randomized Hessian estimation and directional search. Optimization 60(3), 329–345 (2011)
Article MathSciNet MATH Google Scholar
Stich, S.U., Gärtner, B., Müller, C.L.: Variable Metric Random Pursuit (2012) (submitted), http://arxiv.org/abs/1210.5114
Hansen, N., Ostermeier, A.: Completely Derandomized Self-Adaption in Evolution Strategies. Evolutionary Computation 9(2), 159–195 (2001)
Article Google Scholar
Hansen, N., Muller, S.D., Koumoutsakos, P.: Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES). Evol. Comput. 11(1), 1–18 (2003)
Article Google Scholar
Knight, J.N., Lunacek, M.: Reducing the Space-time Complexity of the CMA-ES. In: GECCO 2007, pp. 658–665. ACM (2007)
Google Scholar
Loshchilov, I.: A Computationally Efficient Limited Memory CMA-ES for Large Scale Optimization. To appear GECCO (2014), http://arxiv.org/abs/1404.5520
Lee, Y.T., Sidford, A.: Efficient accelerated coordinate descent methods and faster algorithms for solving linear systems. In: FOCS, pp. 147–156. IEEE (2013)
Google Scholar
Ostermeier, A., Gawelczyk, A., Hansen, N.: Step-size adaptation based on non-local use of selection information. In: Davidor, Y., Männer, R., Schwefel, H.-P. (eds.) PPSN 1994. LNCS, vol. 866, pp. 189–198. Springer, Heidelberg (1994)
Chapter Google Scholar
Igel, C., Suttorp, T., Hansen, N.: A Computational Efficient Covariance Matrix Update and a (1+1)-CMA for Evolution Strategies. In: GECCO, pp. 453–460 (2006)
Google Scholar
Sun, Y., Schaul, T., Gomez, F., Schmidhuber, J.: A linear time natural evolution strategy for non-separable functions. In: Proc. 15th Genetic and Evolutionary Computation Conference Companion, pp. 61–62. ACM (2013)
Google Scholar
Stich, S.U.: Supplementary Online Mat (2014), http://arxiv.org/abs/1406.2010
Stich, S.U., Müller, C.L.: On Spectral Invariance of Randomized Hessian and Covariance Matrix Adaptation Schemes. In: Coello, C.A.C., Cutello, V., Deb, K., Forrest, S., Nicosia, G., Pavone, M. (eds.) PPSN 2012, Part I. LNCS, vol. 7491, pp. 448–457. Springer, Heidelberg (2012)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Theoretical Computer Science, ETH Zürich, 8092, Zürich, Switzerland
Sebastian Urban Stich

Authors

Sebastian Urban Stich
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Computer and Engineering Sciences, Cologne University of Applied Sciences, Steinmüllerallee 1, 51643, Gummersbach, Germany
Thomas Bartz-Beielstein
Warwick Business School, University of Warwick, CV8 2SY, Coventry, UK
Jürgen Branke
Department of Intelligent Systems, JožefStefan Institute, Jamova cesta 39, 1000, Ljubljana, Slovenia
Bogdan Filipič
Department of Computer Science and Creative Technologies, University of the West of England, BS16 1QY, Bristol, UK
Jim Smith

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stich, S.U. (2014). On Low Complexity Acceleration Techniques for Randomized Optimization. In: Bartz-Beielstein, T., Branke, J., Filipič, B., Smith, J. (eds) Parallel Problem Solving from Nature – PPSN XIII. PPSN 2014. Lecture Notes in Computer Science, vol 8672. Springer, Cham. https://doi.org/10.1007/978-3-319-10762-2_13

Download citation

DOI: https://doi.org/10.1007/978-3-319-10762-2_13
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10761-5
Online ISBN: 978-3-319-10762-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics