A BP-Based Algorithm for Performing Bayesian Inference in Large Perceptron-Type Networks

Kabashima, Yoshiyuki; Uda, Shinsuke

doi:10.1007/978-3-540-30215-5_36

Yoshiyuki Kabashima²¹ &
Shinsuke Uda²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3244))

Included in the following conference series:

International Conference on Algorithmic Learning Theory

534 Accesses
11 Citations

Abstract

Although the Bayesian approach provides optimal performance for many inference problems, the computation cost is sometimes impractical. We herein develop a practical algorithm by which to approximate Bayesian inference in large single-layer feed-forward networks (perceptrons) based on belief propagation (BP). Although direct application of BP to the inference problem remains computationally difficult, by introducing methods and concepts from statistical mechanics that are related to the central limit theorem and the law of large numbers, the proposed BP-based algorithm exhibits nearly optimal performance in a practical time scale for ideal large networks. In order to demonstrate the practical significance of the proposed algorithm, an application to a problem that arises in a mobile communications system is also presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Iba, Y.: The Nishimori line and Bayesian statistics. J. Phys. A 38, 3875–3888 (1999)
Article MathSciNet Google Scholar
Berger, J.O.: Statistical Decision Theory and Bayesian Analysis, 2nd edn. Springer, New York (1985)
MATH Google Scholar
Opper, M., Winther, O.: A mean field algorithm for Bayes learning in large feedforward neural networks. In: Mozer, M.C., et al. (eds.) Advances in Neural Information Processing Systems, vol. 9, pp. 225–231. MIT Press, Cambridge (1997)
Google Scholar
Opper, M., Winther, O.: A mean field approach to Bayes learning in feed-forward neural networks. Phys. Rev. Lett. 76, 1964–1967 (1996)
Article Google Scholar
Opper, M., Winther, O.: Tractable approximation for probabilistic models: The adaptive TAP mean field approach. Phys. Rev. Lett. 86, 3695–3699 (2001)
Article Google Scholar
Kappen, H.J., Rodríguez, F.B.: Efficient learning in Boltzmann Machines using linear response theory. In: Kearns, M.S., et al. (eds.) Advances in Neural Information Processing Systems, vol. 11, pp. 280–286. MIT Press, Cambridge (1999)
Google Scholar
MacKay, D.J.C., Neal, R.M.: Near Shannon limit performance of low density parity check codes. Electronics Letters 32, 1645–1646 (1996)
Article Google Scholar
MacKay, D.J.C.: Good error correcting codes based on very sparse matrices. IEEE Trans. Infor. Theor. 45, 399–431 (1999)
Article MATH MathSciNet Google Scholar
Kabashima, Y., Saad, D.: Belief Propagation vs. TAP for decoding corrupted messages. Europhys. Lett. 44, 668–674 (1998)
Article Google Scholar
Kabashima, Y.: A CDMA multiuser detection algorithm on the basis of belief propagation. J. Phys. A 36, 11111–11121 (2003)
Article MATH MathSciNet Google Scholar
Tanaka, T.: A Statistical-Mechanics Approach to Large-System Analysis of CDMA Multiuser Detectors. IEEE Trans. Infor. Theor. 48, 2888–2910 (2002)
Article MATH Google Scholar
Opper, M., Saad, D.: Advanced Mean Field Methods. MIT Press, Cambridge (2001)
MATH Google Scholar
Nishimori, H.: Statistical Physics of Spin Glasses and Information Processing. Oxform Univ. Press, New York (2001)
Book MATH Google Scholar
Pearl, J.: Probabilistic Reasoning in Intelligent Systems, 2nd edn. Morgan Kaufmann, San Francisco (1988)
Google Scholar
Gallager, R.G.: Low Density Parity Check Codes. MIT Press, Cambridge (1963)
Google Scholar
Tanaka, K.: Probabilistic Inference by means of Cluster Variation Method and Linear Response Theory. IEICE Trans. Inform. & Syst. E86-D, 1228–1242 (2003)
Google Scholar
Welling, M., Teh, Y.W.: Linear Response Algorithms for Approximate Inference in Graphical Models. Neural Compt. 16, 197–221 (2004)
Article MATH Google Scholar
Murayama, T.: Statistical Mechanics of the data compression theorem. J. Phys. A 35, L95–L100 (2002)
Article MathSciNet Google Scholar
Murayama, T., Okada, M.: One step RSB scheme for the rate distortion function. J. Phys. A 36, 11123–11130 (2003)
Article MATH MathSciNet Google Scholar
Vicente, R., Saad, D., Kabashima, Y.: Statistical physics of irregular low-density parity-check codes. J. Phys. A 33, 6527–6542 (2000)
Article MATH MathSciNet Google Scholar
Mezard, M., Parisi, G., Virasoro, M.A.: Spin Glass Theory and Beyond. World Scientific, Singapore (1987)
MATH Google Scholar
Watkin, T.L.H., Rau, A., Biehl, M.: The statistical mechanics of learning a rule. Rev. Mod. Phys. 65, 499–568 (1993)
Article MathSciNet Google Scholar
Ojanpera, T., Prasad, R. (eds.): WCDMA: Towards IP Mobility and Mobile Internet, Artech House, Boston, MA (2001)
Google Scholar
Varanasi, M.K., Aazhang, B.: Near Optimum Detection in Synchronous Code- Division Multiple-Access Systems. IEEE Trans. on Commun. 39, 725–736 (1991)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Tokyo Institute of Technology, Yokohama, 2268502, Japan
Yoshiyuki Kabashima & Shinsuke Uda

Authors

Yoshiyuki Kabashima
View author publications
You can also search for this author in PubMed Google Scholar
Shinsuke Uda
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

David R. Cheriton School of Computer Science University of Waterloo,
Shoham Ben-David
Department of Computer & Information Sciences, University of Delaware, 103 Smith Hall, DE 19716, Newark
John Case
Dept. of Information Technology and Electronics, Ishinomaki Senshu University,
Akira Maruoka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kabashima, Y., Uda, S. (2004). A BP-Based Algorithm for Performing Bayesian Inference in Large Perceptron-Type Networks. In: Ben-David, S., Case, J., Maruoka, A. (eds) Algorithmic Learning Theory. ALT 2004. Lecture Notes in Computer Science(), vol 3244. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30215-5_36

Download citation

DOI: https://doi.org/10.1007/978-3-540-30215-5_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23356-5
Online ISBN: 978-3-540-30215-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics