Computational Experiments with the RAVE Heuristic

Tom, David; Müller, Martin

doi:10.1007/978-3-642-17928-0_7

David Tom¹⁸ &
Martin Müller¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6515))

Included in the following conference series:

International Conference on Computers and Games

1462 Accesses

Abstract

The Monte-Carlo tree search algorithm Upper Confidence bounds applied to Trees (UCT) has become extremely popular in computer games research. The Rapid Action Value Estimation (RAVE) heuristic is a strong estimator that often improves the performance of UCT-based algorithms. However, there are situations where RAVE misleads the search whereas pure UCT search can find the correct solution. Two games, the simple abstract game Sum of Switches (SOS) and the game of Go, are used to study the behavior of the RAVE heuristic. In SOS, RAVE updates are manipulated to mimic game situations where RAVE misleads the search. Such false RAVE updates are used to create RAVE overestimates and underestimates. A study of the distributions of mean and RAVE values reveals great differences between Go and SOS. While the RAVE-max update rule is able to correct extreme cases of RAVE underestimation, it is not effective in closer to practical settings and in Go.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Adapting Improved Upper Confidence Bounds for Monte-Carlo Tree Search

Game Solvers

References

Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
Chapter Google Scholar
Gelly, S., Wang, Y., Munos, R., Teytaud, O.: Modification of UCT with patterns in Monte-Carlo Go, Technical Report RR-6062, INRIA, France (2006)
Google Scholar
Finnsson, H., Björnsson, Y.: Simulation-based approach to General Game Playing. In: Fox, D., Gomes, C. (eds.) AAAI, pp. 259–264. AAAI Press, Menlo Park (2008)
Google Scholar
Arneson, B., Hayward, R., Henderson, P.: Wolve 2008 wins Hex Tournament. ICGA Journal 32(1), 49–53 (2009)
Article Google Scholar
Lorentz, R.J.: Amazons discover monte-carlo. In: van den Herik, H.J., Xu, X., Ma, Z., Winands, M.H.M. (eds.) CG 2008. LNCS, vol. 5131, pp. 13–24. Springer, Heidelberg (2008)
Chapter Google Scholar
Winands, M., Björnsson, Y.: Evaluation function based Monte-Carlo LOA. In: [15], pp. 33–44
Google Scholar
Brügmann, B.: Monte Carlo Go (March 1993) (unpublished manuscript), http://www.cgl.ucsf.edu/go/Programs/Gobble.html
Gelly, S., Silver, D.: Combining online and offline knowledge in UCT. In: Ghahramani, Z. (ed.) ICML. ACM International Conference Proceeding Series, vol. 227, pp. 273–280. ACM, New York (2007)
Chapter Google Scholar
Tom, D., Müller, M.: A study of UCT and its enhancements in an artificial game. In: [15], pp. 55–64
Google Scholar
Teytaud, F., Teytaud, O.: Creating an Upper-Confidence-Tree program for Havannah. In: [15], pp. 65–74
Google Scholar
Enzenberger, M., Müller, M.: Fuego (2008), http://fuego.sf.net/ (Retrieved December 22, 2008)
Silver, D.: Reinforcement Learning and Simulation-Based Search. PhD thesis, University of Alberta (2009)
Google Scholar
Tom, D.: Investigating UCT and RAVE: Steps Towards a More Robust Method. Master’s thesis, University of Alberta, Department of Computing Science (2010)
Google Scholar
Enzenberger, M., Müller, M., Arneson, B., Segal, R.: Fuego – an open-source framework for board games and Go engine based on Monte-Carlo tree search. Submitted to IEEE Transactions on Computational Intelligence and AI in Games (2010)
Google Scholar
van den Herik, H.J., Spronck, P. (eds.): ACG 2009. LNCS, vol. 6048. Springer, Heidelberg (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing Science, University of Alberta, Edmonton, Canada, T6G 2E8
David Tom & Martin Müller

Authors

David Tom
View author publications
You can also search for this author in PubMed Google Scholar
Martin Müller
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Tilburg Center for Cognition and Communication (TiCC), Tilburg University, P.O. Box 90153, 5000LE, Tilburg, The Netherlands
H. Jaap van den Herik & Aske Plaat &
Japan Advanced Institute of Science and Technology, Research Unit for Computers and Games, 1-1, Asahidai, 923-1292, Nomi, Ishikawa, Japan
Hiroyuki Iida

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tom, D., Müller, M. (2011). Computational Experiments with the RAVE Heuristic. In: van den Herik, H.J., Iida, H., Plaat, A. (eds) Computers and Games. CG 2010. Lecture Notes in Computer Science, vol 6515. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17928-0_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-17928-0_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17927-3
Online ISBN: 978-3-642-17928-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Computational Experiments with the RAVE Heuristic

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Adapting Improved Upper Confidence Bounds for Monte-Carlo Tree Search

Game Solvers

Game Solvers

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Computational Experiments with the RAVE Heuristic

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Adapting Improved Upper Confidence Bounds for Monte-Carlo Tree Search

Game Solvers

Game Solvers

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation