Two Hidden Layers are Usually Better than One

Thomas, Alan J.; Petridis, Miltos; Walters, Simon D.; Gheytassi, Saeed Malekshahi; Morgan, Robert E.

doi:10.1007/978-3-319-65172-9_24

Alan J. Thomas¹³,
Miltos Petridis¹⁴,
Simon D. Walters¹³,
Saeed Malekshahi Gheytassi¹³ &
…
Robert E. Morgan¹³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 744))

Included in the following conference series:

International Conference on Engineering Applications of Neural Networks

3677 Accesses

Abstract

This study investigates whether feedforward neural networks with two hidden layers generalise better than those with one. In contrast to the existing literature, a method is proposed which allows these networks to be compared empirically on a hidden-node-by-hidden-node basis. This is applied to ten public domain function approximation datasets. Networks with two hidden layers were found to be better generalisers in nine of the ten cases, although the actual degree of improvement is case dependent. The proposed method can be used to rapidly determine whether it is worth considering two hidden layers for a given problem.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

An Empirical Study of Activation Functions for Function Approximation Tasks

Why Deep Neural Networks: Yet Another Explanation

Why Neural Networks Work

References

Hornik, K., Stinchcombe, M., White, H.: Multilayer feedforward networks are universal approximators. Neural Netw. 2, 359–366 (1989)
Article Google Scholar
Hornik, K., Stinchcombe, M., White, H.: Some new results on neural network approximation. Neural Netw. 6, 1069–1072 (1993)
Article Google Scholar
Huang, G.-B., Babri, H.A.: Upper bounds on the number of hidden neurons in feedforward networks with arbitrary bounded nonlinear activation functions. IEEE Trans. Neural Netw. 9, 224–229 (1998)
Article Google Scholar
Zhang, G.P.: Avoiding pitfalls in neural network research. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 37, 3–16 (2007)
Article Google Scholar
Chester, D.L.: Why two hidden layers are better than one. In: Caudhill, M. (ed.) International Joint Conference on Neural Networks, vol. 1, pp. 265–268. Laurence Erlbaum, New Jersey (1990)
Google Scholar
Brightwell, G., Kenyon, C., Paugam-Moisy, H.: Multilayer neural networks: one or two hidden layers? In: Mozer, M.C., Jordan, M.I., Petsche, T. (eds.) Advances in Neural Information Processing Systems, vol. 9, pp. 148–154. MIT Press, Cambridge (1997)
Google Scholar
Sontag, E.D.: Feedback stabilization using two-hidden-layer nets. IEEE Trans. Neural Netw. 3, 981–990 (1992)
Article Google Scholar
Thomas, A.J., Walters, S.D., Petridis, M., Malekshahi Gheytassi, S., Morgan, R.E.: Accelerated optimal topology search for two-hidden-layer feedforward neural networks. In: Jayne, C., Iliadis, L. (eds.) EANN 2016. CCIS, vol. 629, pp. 253–266. Springer, Cham (2016). doi:10.1007/978-3-319-44188-7_19
Chapter Google Scholar
Thomas, A.J., Walters, S.D., Malekshahi Gheytassi, S., Morgan, R.E., Petridis, M.: On the optimal node ratio between hidden layers: a probabilistic study. Int. J. Mach. Learn. Comput. 6, 241–247 (2016). doi:10.18178/ijmlc.2016.6.5.605
Article Google Scholar
Nakama, T.: Comparisons of single- and multiple-hidden-layer neural networks. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds.) Advances in Neural Networks – ISNN 2011 Part 1. LNCS, vol. 6675, pp. 270–279. Springer, Heidelberg (2011)
Chapter Google Scholar
Funahashi, K.-I.: On the approximate realization of continuous mappings by neural networks. Neural Netw. 2, 183–192 (1989)
Article Google Scholar
Idler, C.: Pattern recognition and machine learning techniques for algorithmic trading. MA thesis, FernUniversität, Hagen, Germany (2014)
Google Scholar
Moré, J.J.: The Levenberg-Marquardt algorithm: implementation and theory. In: Watson, G.A. (ed.) Numerical Analysis. LNM, vol. 630, pp. 105–116. Springer, Heidelberg (1978). doi:10.1007/BFb0067700
Chapter Google Scholar
Beale, M.H., Hagan, M.T., Demuth, H.B.: Neural Network Toolbox User’s guide. https://www.mathworks.com/help/pdf_doc/nnet/nnet_ug.pdf
UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/
Bilkent University Function Approximation Repository. http://funapp.cs.bilkent.edu.tr/DataSets/
Regression Datasets. http://www.dcc.fc.up.pt/~ltorgo/Regression/DataSets.html
Yeh, I.-C.: Modeling of strength of high performance concrete using artificial neural networks. Cem. Concr. Res. 28, 1797–1808 (1998)
Article Google Scholar

Download references

Acknowledgements

We thank Prof. Martin T. Hagan of Oklahoma State University for kindly donating the Engine dataset used in this paper to Matlab. Thanks also to Prof. I-Cheng Yeh for permission to use his Concrete Compressive Strength dataset [18], as well as the other donors of the various datasets used in this study.

Author information

Authors and Affiliations

School of Computing Engineering and Mathematics, University of Brighton, Brighton, UK
Alan J. Thomas, Simon D. Walters, Saeed Malekshahi Gheytassi & Robert E. Morgan
Faculty of Science and Technology, Middlesex University, London, UK
Miltos Petridis

Authors

Alan J. Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Miltos Petridis
View author publications
You can also search for this author in PubMed Google Scholar
Simon D. Walters
View author publications
You can also search for this author in PubMed Google Scholar
Saeed Malekshahi Gheytassi
View author publications
You can also search for this author in PubMed Google Scholar
Robert E. Morgan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alan J. Thomas .

Editor information

Editors and Affiliations

Politecnico di Milano, Milan, Italy
Giacomo Boracchi
Democritus University of Thrace, University Campus, Xanthi, Greece
Lazaros Iliadis
School of Computing Science and Digital Media, Robert Gordon University, Aberdeen, United Kingdom
Chrisina Jayne
Univesity of Ioannina, Ioannina, Greece
Aristidis Likas

Appendix A – Full Results: Average Node for Node Comparisons

See Figs. 4, 5 and 6.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Thomas, A.J., Petridis, M., Walters, S.D., Gheytassi, S.M., Morgan, R.E. (2017). Two Hidden Layers are Usually Better than One. In: Boracchi, G., Iliadis, L., Jayne, C., Likas, A. (eds) Engineering Applications of Neural Networks. EANN 2017. Communications in Computer and Information Science, vol 744. Springer, Cham. https://doi.org/10.1007/978-3-319-65172-9_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-65172-9_24
Published: 02 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65171-2
Online ISBN: 978-3-319-65172-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Two Hidden Layers are Usually Better than One

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

An Empirical Study of Activation Functions for Function Approximation Tasks

Why Deep Neural Networks: Yet Another Explanation

Why Neural Networks Work

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix A – Full Results: Average Node for Node Comparisons

Appendix A – Full Results: Average Node for Node Comparisons

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us