Single-Player Monte-Carlo Tree Search

Schadd, Maarten P. D.; Winands, Mark H. M.; van den Herik, H. Jaap; Chaslot, Guillaume M. J. -B.; Uiterwijk, Jos W. H. M.

doi:10.1007/978-3-540-87608-3_1

Maarten P. D. Schadd¹,
Mark H. M. Winands¹,
H. Jaap van den Herik¹,
Guillaume M. J. -B. Chaslot¹ &
…
Jos W. H. M. Uiterwijk¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5131))

Included in the following conference series:

International Conference on Computers and Games

2450 Accesses
43 Citations

Abstract

Classical methods such as A* and IDA* are a popular and successful choice for one-player games. However, they fail without an accurate admissible evaluation function. In this paper we investigate whether Monte-Carlo Tree Search (MCTS) is an interesting alternative for one-player games where A* and IDA* methods do not perform well. Therefore, we propose a new MCTS variant, called Single-Player Monte-Carlo Tree Search (SP-MCTS). The selection and backpropagation strategy in SP-MCTS are different from standard MCTS. Moreover, SP-MCTS makes use of a straightforward Meta-Search extension. We tested the method on the puzzle SameGame. It turned out that our SP-MCTS program gained the highest score so far on the standardized test set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 74.99; Price excludes VAT (USA)

Softcover Book: USD 99.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Biedl, T.C., Demaine, E.D., Demaine, M.L., Fleischer, R., Jacobsen, L., Munro, I.: The Complexity of Clickomania. In: Nowakowski, R.J. (ed.) More Games of No Chance, Proc. MSRI Workshop on Combinatorial Games, pp. 389–404. MSRI Publ., Berkeley. Cambridge University Press, Cambridge (2002)
Google Scholar
Billings, D.: Personal Communication. University of Alberta, Canada (2007)
Google Scholar
Bouzy, B., Cazanave, T.: Computer Go: An AI-Oriented Survey. Artificial Intelligence 132(1), 39–103 (2001)
Article MATH MathSciNet Google Scholar
Bouzy, B., Helmstetter, B.: Monte-Carlo Go Developments. In: van den Herik, H.J., Iida, H., Heinz, E.A. (eds.) Proceedings of the 10th Advances in Computer Games Conference (ACG-10), The Netherlands, pp. 159–174. Kluwer Academic, Dordrecht (2003)
Google Scholar
Brügmann, B.: Monte Carlo Go. Technical report, Physics Department, Syracuse University (1993)
Google Scholar
Cazenave, T., Borsboom, J.: Golois Wins Phantom Go Tournament. ICGA Journal 30(3), 165–166 (2007)
Google Scholar
Chaslot, G.M.J.-B., Winands, M.H.M., Uiterwijk, J.W.H.M., van den Herik, H.J., Bouzy, B.: Progressive strategies for Monte-Carlo Tree Search. New Mathematics and Natural Computation 4(3), 343–357 (2008)
Article MathSciNet Google Scholar
Chaslot, G.M.J.B., de Jong, S., Saito, J.-T., Uiterwijk, J.W.H.M.: Monte-Carlo Tree Search in Production Management Problems. In: Schobbens, P.Y., Vanhoof, W., Schwanen, G. (eds.) Proceedings of the 18th BeNeLux Conference on Artificial Intelligence, Namur, Belgium, pp. 91–98 (2006)
Google Scholar
Chaslot, G.M.J.B., Saito, J.-T., Bouzy, B., Uiterwijk, J.W.H.M., van den Herik, H.J.: Monte-Carlo Strategies for Computer Go. In: Schobbens, P.Y., Vanhoof, W., Schwanen, G. (eds.) Proceedings of the 18th BeNeLux Conference on Artificial Intelligence, Namur, Belgium, pp. 83–91 (2006)
Google Scholar
Coulom, R.: Efficient selectivity and backup operators in Monte-Carlo tree search. In: van den Herik, H.J., Ciancarini, P., Donkers, H.H.L.M(J.) (eds.) CG 2006. LNCS, vol. 4630, pp. 72–83. Springer, Heidelberg (2007)
Chapter Google Scholar
Culberson, J.C., Schaeffer, J.: Pattern databases. Computational Intelligence 14(3), 318–334 (1998)
Article MathSciNet Google Scholar
Felner, A., Zahavi, U., Schaeffer, J., Holte, R.C.: Dual Lookups in Pattern Databases. In: IJCAI, Edinburgh, Scotland, pp. 103–108 (2005)
Google Scholar
Gomes, C.P., Selman, B., McAloon, K., Tretkoff, C.: Randomization in Backtrack Search: Exploiting Heavy-Tailed Profiles for Solving Hard Scheduling Problems. In: AIPS, Pittsburg, PA, pp. 208–213 (1998)
Google Scholar
Hart, P.E., Nilsson, N.J., Raphael, B.: A formal basis for the heuristic determination of minimum cost paths. IEEE Transactions on Systems Science and Cybernatics 4(2), 100–107 (1968)
Article Google Scholar
Junghanns, A.: Pushing the Limits: New Developments in Single Agent Search. PhD thesis, University of Alberta, Alberta, Canada (1999)
Google Scholar
Kendall, G., Parkes, A., Spoerer, K.: A Survey of NP-Complete Puzzles. ICGA Journal 31(1), 13–34 (2008)
Google Scholar
Kocsis, L., Szepesvári, C.: Bandit based Monte-Carlo Planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
Google Scholar
Kocsis, L., Szepesvári, C., Willemson, J.: Improved Monte-Carlo Search (2006), http://zaphod.aml.sztaki.hu/papers/cg06-ext.pdf
Korf, R.E.: Depth-first iterative deepening: An optimal admissable tree search. Artificial Intelligence 27(1), 97–109 (1985)
Article MATH MathSciNet Google Scholar
Moribe, K.: Chain shot! Gekkan ASCII, (November 1985) (in Japanese)
Google Scholar
PDA Game Guide.com. Pocket PC Jawbreaker Game. The Ultimate Guide to PDA Games, Retrieved 7.1.2008 (2008), http://www.pdagameguide.com/jawbreaker-game.html
Sadikov, A., Bratko, I.: Solving 20 × 20 Puzzles. In: van den Herik, H.J., Uiterwijk, J.W.H.M., Winands, M.H.M., Schadd, M.P.D. (eds.) Proceedings of the Computer Games Workshop 2007 (CGW 2007), The Netherlands, pp. 157–164. Universiteit Maastricht, Maastricht (2007)
Google Scholar
Tesauro, G., Galperin, G.R.: On-line policy improvement using Monte Carlo search. In: Mozer, M.C., Jordan, M.I., Petsche, T. (eds.) Advances in Neural Information Processing Systems, vol. 9, pp. 1068–1074. MIT Press, Cambridge (1997)
Google Scholar
University of Alberta GAMES Group. GAMES Group News (Archives) (2002), http://www.cs.ualberta.ca/~games/archives.html
Vempaty, N.R., Kumar, V., Korf, R.E.: Depth-first versus best-first search. In: AAAI, Anaheim, California, USA, pp. 434–440. MIT Press, Cambridge (1991)
Google Scholar

Download references

Author information

Authors and Affiliations

Games and AI Group, MICC, Faculty of Humanities and Sciences, Universiteit Maastricht, Maastricht, The Netherlands
Maarten P. D. Schadd, Mark H. M. Winands, H. Jaap van den Herik, Guillaume M. J. -B. Chaslot & Jos W. H. M. Uiterwijk

Authors

Maarten P. D. Schadd
View author publications
You can also search for this author in PubMed Google Scholar
Mark H. M. Winands
View author publications
You can also search for this author in PubMed Google Scholar
H. Jaap van den Herik
View author publications
You can also search for this author in PubMed Google Scholar
Guillaume M. J. -B. Chaslot
View author publications
You can also search for this author in PubMed Google Scholar
Jos W. H. M. Uiterwijk
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

H. Jaap van den Herik Xinhe Xu Zongmin Ma Mark H. M. Winands

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schadd, M.P.D., Winands, M.H.M., van den Herik, H.J., Chaslot, G.M.J.B., Uiterwijk, J.W.H.M. (2008). Single-Player Monte-Carlo Tree Search. In: van den Herik, H.J., Xu, X., Ma, Z., Winands, M.H.M. (eds) Computers and Games. CG 2008. Lecture Notes in Computer Science, vol 5131. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87608-3_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-87608-3_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87607-6
Online ISBN: 978-3-540-87608-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics