Deep Automatic Control of Learning Rates for GANs

Kamiya, Toshiki; Sakaue, Fumihiko; Sato, Jun

doi:10.1007/978-3-031-06381-7_8

Toshiki Kamiya⁸,
Fumihiko Sakaue⁸ &
Jun Sato⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1578))

Included in the following conference series:

International Workshop on Frontiers of Computer Vision

606 Accesses

Abstract

In this paper, we propose a method for automatically controlling the learning rate of Generative Adversarial Networks (GANs) so as to stabilize the training of GANs. In recent years, GAN has been successful in various types of image generation tasks. Since GAN trains Generators and Discriminators adversarially, it is very important to keep the balance of their learning progress. However, it is known that the adjustment of learning rate of GAN is extremely difficult compared to conventional networks. Thus, we in this paper propose a method for predicting the future training progress of GANs from the current state of Generators and Discriminators, and for automatically controlling the learning rate of GANs appropriately. The proposed method has been tested using several different GANs, and the results show the proposed method can control the learning rate of GANs appropriately for a variety of tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Squeeze Criterion GANs: Double Adversarial Learning Method

Generative adversarial networks with adaptive learning strategy for noise-to-image synthesis

Article 17 November 2022

Understanding GANs: fundamentals, variants, training challenges, applications, and open problems

Article 14 May 2024

References

Abadi, M., et al.: Large-scale machine learning on heterogeneous systems (2015). https://www.tensorflow.org/
Bergstra, J., Yamins, D., Cox, D.D.: Making a science of model search: hyperparameter optimization in hundreds of dimensions for vision architectures. In:Proceedings of the 30th International Conference on Machine Learning (ICML 2013), pp. I-115–I-123 (2013)
Google Scholar
Fukushima, K., Miyake, S.: Neocognitron: a new algorithm for pattern recognition tolerant of deformations and shifts in position. Pattern Recogn. 15(6), 455–469 (1982)
Article Google Scholar
Goodfellow, I., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2672–2680 (2014)
Google Scholar
Hansen, N., Ostermeier, A.: Adapting arbitrary normal mutation distributions in evolution strategies: the covariance matrix adaptation. In: Proceedings of IEEE International Conference on Evolutionary Computation, pp. 312–317 (1996)
Google Scholar
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., Hochreiter, S.: GANs trained by a two time-scale update rule converge to a local Nash equilibrium. In: Conference on Neural Information Processing Systems (NIPS) (2017)
Google Scholar
Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1125–1134 (2017)
Google Scholar
LeCun, Y., Cortes, C., Burges, C.J.: MNIST handwritten digit database, yann lecun, corinna cortes and chris burges. http://yann.lecun.com/exdb/mnist/
Mirza, M., Osindero, S.: Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784, pp. 1–7 (2014)
Nelder, J.A., Mead, R.: A simplex method for function minimization. Comput. J. 7, 308–313 (1965)
Article MathSciNet Google Scholar
Preferred Networks, I.: Automatic hyperparameter optimization framework for machine learning. https://www.preferred.jp/en/projects/optuna/
Salimans, T., Goodfellow, I., Zaremba, W., Cheung, V., Radford, A., Chen, X.: Improved techniques for training GANs. In: NIPS 2016 (2016)
Google Scholar
Snoek, J., Larochelle, H., Adams, R.P.: Practical Bayesian optimization of machine learning algorithms. In: NIPS 2012: Proceedings of the 25th International Conference on Neural Information Processing Systems, vol. 2, pp. 2951–2959 (2012)
Google Scholar
Szegedy, C., et al.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)
Google Scholar
Xiao, H., Rasul, K., Vollgraf, R.: Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms (2017)
Google Scholar
Zhu, J., Park, T., Isola, P., Efros, A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the International Conference on Computer Vision, pp. 2223–2232 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Nagoya Institute of Technology, Nagoya, 466-8555, Japan
Toshiki Kamiya, Fumihiko Sakaue & Jun Sato

Authors

Toshiki Kamiya
View author publications
You can also search for this author in PubMed Google Scholar
Fumihiko Sakaue
View author publications
You can also search for this author in PubMed Google Scholar
Jun Sato
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jun Sato .

Editor information

Editors and Affiliations

Aoyama Gakuin University, Kanagawa, Japan
Kazuhiko Sumi
Chosun University, Gwangju, Korea (Republic of)
In Seop Na
Aoyama Gakuin University, Kanagawa, Japan
Naoshi Kaneko

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kamiya, T., Sakaue, F., Sato, J. (2022). Deep Automatic Control of Learning Rates for GANs. In: Sumi, K., Na, I.S., Kaneko, N. (eds) Frontiers of Computer Vision. IW-FCV 2022. Communications in Computer and Information Science, vol 1578. Springer, Cham. https://doi.org/10.1007/978-3-031-06381-7_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-06381-7_8
Published: 17 May 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06380-0
Online ISBN: 978-3-031-06381-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Deep Automatic Control of Learning Rates for GANs