An Empirical Study of Neural Network Hyperparameters

Makwe, Aditya; Rathore, Abhishek Singh

doi:10.1007/978-981-15-5788-0_36

Aditya Makwe¹⁸ &
Abhishek Singh Rathore¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1176))

1020 Accesses
6 Citations

Abstract

The learning algorithms related to deep learning involves many attributes called hyperparameters, these variables help in determining the network structure. The performance of algorithms depends upon these hyper-parameter variables that are needed to be set prior to the actual implementation of the algorithm. This study involves an overview of some of the commonly used hyperparameters in the context of learning algorithms used for training neural networks along with the analysis of adaptive learning algorithms used for tuning learning rates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Hyperparameter Study: An Analysis of Hyperparameters and Their Search Methodology

Hyperparameter Tuning and Optimization Applications

Methods for Hyperparameters Optimization in Learning Approaches: An Overview

References

Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012)
Google Scholar
Bengio, Y.: Practical recommendations for gradient-based training of deep architectures. In: Neural Networks: Tricks of the Trade, pp. 437–478. Springer, Berlin, Heidelberg (2012)
Google Scholar
Smith, L.N.: No more pesky learning rate guessing games. CoRR, abs/1506.01186 (2015)
Google Scholar
Smith, L.N.: Cyclical learning rates for training neural networks. In: 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), IEEE, pp. 464–472 (2017)
Google Scholar
Smith, S.L., Kindermans, P.J., Ying, C., Le, Q.V.: Don’t decay the learning rate, increase the batch size. arXiv preprint arXiv:1711.00489 (2017)
Google Scholar
Lorraine, J., Duvenaud, D.: Stochastic hyperparameter optimization through hypernetworks. arXiv preprint arXiv:1802.09419 (2018)
Google Scholar
Duchi, J., Hazan, E., Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12, 2121–2159 (2011)
Google Scholar
Zeiler, M. D.: ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012)
Google Scholar
Tieleman, T., Hinton, G.: Lecture 6.5-rmsprop: divide the gradient by a running average of its recent magnitude. COURSERA Neural Netw. Mach. Learn. 4(2), 26–31 (2012)
Google Scholar
Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations, pp. 1–13 (2015)
Google Scholar
Dozat, T.: Incorporating nesterov momentum into adam 2016
Google Scholar
Ruder, S.: An overview of gradient descent optimization algorithms. arXiv preprint arXiv:1609.04747 (2016)
Google Scholar
Tiwari, A., Bhateja, V., Gautam, A., Satapathy, S.C.: ANN-based classification of mammograms using nonlinear preprocessing. In: Proceedings of 2nd International Conference on Micro-Electronics, Electromagnetics and Telecommunications, pp. 375–382. Springer, Singapore (2018)
Google Scholar
https://github.com/spMohanty/PlantVillage-Dataset
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Engineering and Technology DAVV, Indore, India
Aditya Makwe
Shri Vaishnav Vidyapeeth Vishwavidyalaya, Indore, India
Abhishek Singh Rathore

Authors

Aditya Makwe
View author publications
You can also search for this author in PubMed Google Scholar
Abhishek Singh Rathore
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aditya Makwe .

Editor information

Editors and Affiliations

Department of Electronics and Communication Engineering, Shri Ramswaroop Memorial Group of Professional Colleges (SRMGPC), Lucknow, Uttar Pradesh, India
Vikrant Bhateja
Department of Computer Science and Information Engineering, National Dong Hwa University, Hualien, Taiwan
Sheng-Lung Peng
School of Computer Engineering, Kalinga Institute of Industrial Technology (KIIT), Bhubaneswar, Odisha, India
Suresh Chandra Satapathy
Department of Informatics, University of Leicester, Leicester, UK
Yu-Dong Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Makwe, A., Rathore, A.S. (2021). An Empirical Study of Neural Network Hyperparameters. In: Bhateja, V., Peng, SL., Satapathy, S.C., Zhang, YD. (eds) Evolution in Computational Intelligence. Advances in Intelligent Systems and Computing, vol 1176. Springer, Singapore. https://doi.org/10.1007/978-981-15-5788-0_36

Download citation

DOI: https://doi.org/10.1007/978-981-15-5788-0_36
Published: 09 September 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5787-3
Online ISBN: 978-981-15-5788-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

An Empirical Study of Neural Network Hyperparameters

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Hyperparameter Study: An Analysis of Hyperparameters and Their Search Methodology

Hyperparameter Tuning and Optimization Applications

Methods for Hyperparameters Optimization in Learning Approaches: An Overview

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

An Empirical Study of Neural Network Hyperparameters

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Hyperparameter Study: An Analysis of Hyperparameters and Their Search Methodology

Hyperparameter Tuning and Optimization Applications

Methods for Hyperparameters Optimization in Learning Approaches: An Overview

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation