Deep and Wide Neural Networks Covariance Estimation

Arratia, Argimiro; Cabaña, Alejandra; León, José Rafael

doi:10.1007/978-3-030-61609-0_16

Argimiro Arratia¹¹,
Alejandra Cabaña¹² &
José Rafael León¹³

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12396))

Included in the following conference series:

International Conference on Artificial Neural Networks

3466 Accesses

Abstract

It has been recently shown that a deep neural network with i.i.d. random parameters is equivalent to a Gaussian process in the limit of infinite network width. The Gaussian process associated to the neural network is fully described by a recursive covariance kernel determined by the architecture of the network, and which is expressed in terms of expectation. We give a numerically workable analytic expression of the neural network recursive covariance based on Hermite polynomials. We give explicit forms of this recursive covariance for the cases of neural networks with activation function the Heaviside, ReLU and sigmoid.

A. Arratia—Supported by grant TIN2017-89244-R from MINECO (Ministerio de Economía, Industria y Competitividad) and the recognition 2017SGR-856 (MACDA) from AGAUR (Generalitat de Catalunya).

A. Cabaña—Supported by grant RTI2018-096072-B-I00D Ministerio de Ciencia, Innovación y Universidades.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Covariance Kernel Learning Schemes for Gaussian Process Based Prediction Using Markov Chain Monte Carlo

On the universal consistency of an over-parametrized deep neural network estimate learned by gradient descent

Article 08 April 2024

Uncertainty Estimates in Deep Generative Models Using Gaussian Processes

References

Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. The MIT Press, Cambridge (2016)
Google Scholar
Blum, J.R., Chernoff, H., Rosenblatt, M., Teicher, H.: Central limit theorems for interchangeable processes. Canad. J. Math. 10, 222–229 (1958)
Article MathSciNet Google Scholar
Eddelbuettel, D., Romain Francois, R.: RCPP: Seamless R and C++ Integration. J. Stat. Softw. 40(8), 1–18 (2011)
Article Google Scholar
Kallenberg, O.: Probabilistic Symmetries and Invariance Principles. Springer Series Probability and Applications. Springer, Heidelberg (2005). https://doi.org/10.1007/0-387-28861-9
Book MATH Google Scholar
Karatzoglou, A., Smola, A., Hornik, K., Zeileis, A.: kernlab - an S4 package for kernel methods in R. J. Stat. Softw. 11(9), 1–20 (2004)
Article Google Scholar
Lee, J., Bahri, Y., Novak, R., Schoenholz, S.S., Pennington, J., Sohl-Dickstein, J.: Deep neural networks as Gaussian processes. In: International Conference on Learning Representations, vol. 4 (2018)
Google Scholar
de G. Matthews, A.G., Hron, J., Rowland, M., Turner, R.E., Ghahramani, Z.: Gaussian process behaviour in wide deep neural networks. In: International Conference on Learning Representations, vol. 4 (2018)
Google Scholar
de G. Matthews, A.G., Hron, J., Rowland, M., Turner, R.E., Ghahramani, Z.: Gaussian process behaviour in wide deep neural networks. arXiv:1804.11271 (2018)
Neal, R.M.: Bayesian learning for neural networks. Ph.D. Thesis, Department of Computer Science, University of Toronto (1994)
Google Scholar
Peccati, G., Taqqu, M.: Wiener Chaos, Moments, Cumulants and Diagrams. Bocconi & Springer (2011)
Google Scholar
Rasmussen, C.E., Williams, C.K.I.: Gaussian Processes for Machine Learning, vol. 1. MIT Press, Cambridge (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science, Polytechnical University of Catalunya, Barcelona, Spain
Argimiro Arratia
Department of Mathematics, Universidad Autónoma de Barcelona, Barcelona, Spain
Alejandra Cabaña
IMERL, Universidad de La República, Montevideo, Uruguay
José Rafael León

Authors

Argimiro Arratia
View author publications
You can also search for this author in PubMed Google Scholar
Alejandra Cabaña
View author publications
You can also search for this author in PubMed Google Scholar
José Rafael León
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Argimiro Arratia .

Editor information

Editors and Affiliations

Department of Applied Informatics, Comenius University in Bratislava, Bratislava, Slovakia
Igor Farkaš
Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kgs. Lyngby, Denmark
Paolo Masulli
Department of Informatics, University of Hamburg, Hamburg, Germany
Stefan Wermter

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arratia, A., Cabaña, A., León, J.R. (2020). Deep and Wide Neural Networks Covariance Estimation. In: Farkaš, I., Masulli, P., Wermter, S. (eds) Artificial Neural Networks and Machine Learning – ICANN 2020. ICANN 2020. Lecture Notes in Computer Science(), vol 12396. Springer, Cham. https://doi.org/10.1007/978-3-030-61609-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-61609-0_16
Published: 14 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61608-3
Online ISBN: 978-3-030-61609-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics