The Bayesian Search Game

Toussaint, Marc

doi:10.1007/978-3-642-33206-7_7

Marc Toussaint⁸

Part of the book series: Natural Computing Series ((NCS))

1689 Accesses
8 Citations

Abstract

The aim of this chapter is to draw links between (1) No Free Lunch (NFL) theorems which, interpreted inversely, lay the foundation of how to design search heuristics that exploit prior knowledge about the function, (2) partially observable Markov decision processes (POMDP) and their approach to the problem of sequentially and optimally choosing search points, and (3) the use of Gaussian processes as a representation of belief, i.e., knowledge about the problem. On the one hand, this joint discussion of NFL, POMDPs and Gaussian processes will give a broader view on the problem of search heuristics. On the other hand this will naturally introduce us to efficient global optimization algorithms that are well known in operations research and geology (Gutmann, J Glob Optim 19:201–227, 2001; Jones et al., J Glob Optim 13:455–492, 1998; Jones, J Glob Optim 21:345–383, 2001) and which, in our view, naturally arise from a discussion of NFL and POMDPs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://userpage.fu-berlin.de/mtoussai/07-bsg/
2.
On true subsets ⊂ X, but not all subsets ⊆ X. This weaker condition ensures that also the ⇐ holds; see proof for details.
3.
Alternatives to represent agent policies are, for instance, finite state controllers [11].
4.
http://userpage.fu-berlin.de/mtoussai/07-bsg/

References

A. Auger, O. Teytaud, Continuous lunches are free plus the design of optimal optimization algorithms. Algorithmica 57(1), 121–146 (2010)
Article MATH MathSciNet Google Scholar
H. Gutmann, A radial basis function method for global optimization. J. Glob. Optim. 19, 201–227 (2001)
Article MATH MathSciNet Google Scholar
N. Hansen, A. Ostermeier, Completely derandomized self-adaption in evolutionary strategies. Evol. Comput. 9, 159–195 (2001)
Article Google Scholar
M. Hutter, Towards a universal theory of artificial intelligence based on algorithmic probability and sequential decision theory. arXiv: cs.AI/0012011 (2000)
Google Scholar
C. Igel, M. Toussaint, A no-free-lunch theorem for non-uniform distributions of target functions. J. Math. Model. Algorithms 3, 313–322 (2004)
Article MATH MathSciNet Google Scholar
D. Jones, M. Schonlau, W. Welch, Efficient global optimization of expensive black-box functions. J. Glob. Optim. 13, 455–492 (1998)
Article MATH MathSciNet Google Scholar
D.R. Jones, A taxonomy of global optimization methods based on response surfaces. J. Glob. Optim. 21, 345–383 (2001)
Article MATH Google Scholar
M. Pelikan, D.E. Goldberg, F. Lobo, A survey of optimization by building and using probabilistic models. Technical Report IlliGAL-99018, Illinois Genetic Algorithms Laboratory, 1999
Google Scholar
J. Pineau, G. Gordon, S. Thrun, Anytime point-based approximations for large POMDPs. J. Artif. Intell. Res. 27, 335–380 (2006)
MATH Google Scholar
J. Poland, Explicit local models: towards optimal optimization algorithms. Technical Report No. IDSIA-09-04, 2004
Google Scholar
P. Poupart, C. Boutilier, Bounded finite state controllers, in Advances in Neural Information Processing Systems 16 (NIPS 2003), Vancouver, vol. 16 (MIT Press, 2004)
Google Scholar
P. Poupart, N. Vlassis, J. Hoey, K. Regan, An analytic solution to discrete Bayesian reinforcement learning, in Proceeding of the 23rd International Conference on Machine Learning (ICML 2006), Pittsburgh, 2006, pp. 697–704
Google Scholar
C.E. Rasmussen, C. Williams, Gaussian Processes for Machine Learning (MIT Press, Cambridge, 2006)
MATH Google Scholar
M. Toussaint, Compact representations as a search strategy: compression EDAs. Theor. Comput. Sci. 361, 57–71 (2006)
Article MATH MathSciNet Google Scholar
H. Ulmer, F. Streichert, A. Zell, Optimization by Gaussian processes assisted evolution strategies, in International Conference on Operations Research (OR 2003) (Springer, Heidelberg, 2003) pp. 435–442
Google Scholar
D.H. Wolpert, W.G. Macready, No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1(1), 67–82 (1997)
Article Google Scholar

Download references

Acknowledgements

This research was supported by the German Research Foundation (DFG), Emmy Noether fellowship TO 409/1-3.

Author information

Authors and Affiliations

Machine Learning & Robotics Lab, Free University of Berlin, Arnimallee 7, 14195, Berlin, Germany
Marc Toussaint

Authors

Marc Toussaint
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marc Toussaint .

Editor information

Editors and Affiliations

VisualDNA, London, United Kingdom
Yossi Borenstein
University of Birmingham School of Computer Science, Birmingham, United Kingdom
Alberto Moraglio

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Toussaint, M. (2014). The Bayesian Search Game. In: Borenstein, Y., Moraglio, A. (eds) Theory and Principled Methods for the Design of Metaheuristics. Natural Computing Series. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33206-7_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-33206-7_7
Published: 12 November 2013
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33205-0
Online ISBN: 978-3-642-33206-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics