Abstract
This study investigates whether feedforward neural networks with two hidden layers generalise better than those with one. In contrast to the existing literature, a method is proposed which allows these networks to be compared empirically on a hidden-node-by-hidden-node basis. This is applied to ten public domain function approximation datasets. Networks with two hidden layers were found to be better generalisers in nine of the ten cases, although the actual degree of improvement is case dependent. The proposed method can be used to rapidly determine whether it is worth considering two hidden layers for a given problem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hornik, K., Stinchcombe, M., White, H.: Multilayer feedforward networks are universal approximators. Neural Netw. 2, 359–366 (1989)
Hornik, K., Stinchcombe, M., White, H.: Some new results on neural network approximation. Neural Netw. 6, 1069–1072 (1993)
Huang, G.-B., Babri, H.A.: Upper bounds on the number of hidden neurons in feedforward networks with arbitrary bounded nonlinear activation functions. IEEE Trans. Neural Netw. 9, 224–229 (1998)
Zhang, G.P.: Avoiding pitfalls in neural network research. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 37, 3–16 (2007)
Chester, D.L.: Why two hidden layers are better than one. In: Caudhill, M. (ed.) International Joint Conference on Neural Networks, vol. 1, pp. 265–268. Laurence Erlbaum, New Jersey (1990)
Brightwell, G., Kenyon, C., Paugam-Moisy, H.: Multilayer neural networks: one or two hidden layers? In: Mozer, M.C., Jordan, M.I., Petsche, T. (eds.) Advances in Neural Information Processing Systems, vol. 9, pp. 148–154. MIT Press, Cambridge (1997)
Sontag, E.D.: Feedback stabilization using two-hidden-layer nets. IEEE Trans. Neural Netw. 3, 981–990 (1992)
Thomas, A.J., Walters, S.D., Petridis, M., Malekshahi Gheytassi, S., Morgan, R.E.: Accelerated optimal topology search for two-hidden-layer feedforward neural networks. In: Jayne, C., Iliadis, L. (eds.) EANN 2016. CCIS, vol. 629, pp. 253–266. Springer, Cham (2016). doi:10.1007/978-3-319-44188-7_19
Thomas, A.J., Walters, S.D., Malekshahi Gheytassi, S., Morgan, R.E., Petridis, M.: On the optimal node ratio between hidden layers: a probabilistic study. Int. J. Mach. Learn. Comput. 6, 241–247 (2016). doi:10.18178/ijmlc.2016.6.5.605
Nakama, T.: Comparisons of single- and multiple-hidden-layer neural networks. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds.) Advances in Neural Networks – ISNN 2011 Part 1. LNCS, vol. 6675, pp. 270–279. Springer, Heidelberg (2011)
Funahashi, K.-I.: On the approximate realization of continuous mappings by neural networks. Neural Netw. 2, 183–192 (1989)
Idler, C.: Pattern recognition and machine learning techniques for algorithmic trading. MA thesis, FernUniversität, Hagen, Germany (2014)
Moré, J.J.: The Levenberg-Marquardt algorithm: implementation and theory. In: Watson, G.A. (ed.) Numerical Analysis. LNM, vol. 630, pp. 105–116. Springer, Heidelberg (1978). doi:10.1007/BFb0067700
Beale, M.H., Hagan, M.T., Demuth, H.B.: Neural Network Toolbox User’s guide. https://www.mathworks.com/help/pdf_doc/nnet/nnet_ug.pdf
UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/
Bilkent University Function Approximation Repository. http://funapp.cs.bilkent.edu.tr/DataSets/
Regression Datasets. http://www.dcc.fc.up.pt/~ltorgo/Regression/DataSets.html
Yeh, I.-C.: Modeling of strength of high performance concrete using artificial neural networks. Cem. Concr. Res. 28, 1797–1808 (1998)
Acknowledgements
We thank Prof. Martin T. Hagan of Oklahoma State University for kindly donating the Engine dataset used in this paper to Matlab. Thanks also to Prof. I-Cheng Yeh for permission to use his Concrete Compressive Strength dataset [18], as well as the other donors of the various datasets used in this study.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix A – Full Results: Average Node for Node Comparisons
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Thomas, A.J., Petridis, M., Walters, S.D., Gheytassi, S.M., Morgan, R.E. (2017). Two Hidden Layers are Usually Better than One. In: Boracchi, G., Iliadis, L., Jayne, C., Likas, A. (eds) Engineering Applications of Neural Networks. EANN 2017. Communications in Computer and Information Science, vol 744. Springer, Cham. https://doi.org/10.1007/978-3-319-65172-9_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-65172-9_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65171-2
Online ISBN: 978-3-319-65172-9
eBook Packages: Computer ScienceComputer Science (R0)