Pruning Playouts in Monte-Carlo Tree Search for the Game of Havannah

Duguépéroux, Joris; Mazyad, Ahmad; Teytaud, Fabien; Dehos, Julien

doi:10.1007/978-3-319-50935-8_5

Joris Duguépéroux¹⁶,
Ahmad Mazyad¹⁶,
Fabien Teytaud¹⁶ &
…
Julien Dehos¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10068))

Included in the following conference series:

International Conference on Computers and Games

1116 Accesses
3 Citations

Abstract

Monte-Carlo Tree Search (MCTS) is a popular technique for playing multi-player games. In this paper, we propose a new method to bias the playout policy of MCTS. The idea is to prune the decisions which seem “bad” (according to the previous iterations of the algorithm) before computing each playout. Thus, the method evaluates the estimated “good” moves more precisely. We have tested our improvement for the game of Havannah and compared it to several classic improvements. Our method outperforms the classic version of MCTS (with the RAVE improvement) and the different playout policies of MCTS that we have experimented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Enhancing Playout Policy Adaptation for General Game Playing

Monte Carlo Game Solver

Monte Carlo Tree Search: a review of recent modifications and applications

Article Open access 19 July 2022

References

Arneson, B., Hayward, R., Henderson, P.: Monte-Carlo tree search in hex. IEEE Trans. Comput. Intell. AI Games 2(4), 251–258 (2010)
Article Google Scholar
Baier, H., Drake, P.: The power of forgetting: improving the last-good-reply policy in Monte-Carlo go. IEEE Trans. Comput. Intell. AI Games 2(4), 303–309 (2010)
Article Google Scholar
Bertsimas, D., Griffith, J., Gupta, V., Kochenderfer, M.J., Mišić, V., Moss, R.: A comparison of Monte-Carlo tree search and mathematical optimization for large scale dynamic resource allocation (2014). arXiv:1405.5498
Browne, C., Powley, E., Whitehouse, D., Lucas, S., Cowling, P., Rohlfshagen, P., Tavener, S., Perez, D., Samothrakis, S., Colton, S.: A survey of Monte-Carlo tree search methods. IEEE Trans. Comput. Intell. AI Games 4(1), 1–43 (2012)
Article Google Scholar
Cazenave, T.: Monte-Carlo kakuro. In: Herik, H.J., Spronck, P. (eds.) ACG 2009. LNCS, vol. 6048, pp. 45–54. Springer, Heidelberg (2010). doi:10.1007/978-3-642-12993-3_5
Chapter Google Scholar
Chaslot, G., Saito, J., Bouzy, B., Uiterwijk, J., Herik, H.: Monte-Carlo strategies for computer go. In: Proceedings of the 18th BeNeLux Conference on Artificial Intelligence, pp. 83–91, Namur, Belgium (2006)
Google Scholar
Coulom, R.: Efficient selectivity and backup operators in Monte-Carlo tree search. In: Herik, H.J., Ciancarini, P., Donkers, H.H.L.M.J. (eds.) CG 2006. LNCS, vol. 4630, pp. 72–83. Springer, Heidelberg (2007). doi:10.1007/978-3-540-75538-8_7
Chapter Google Scholar
Drake, P.: The last-good-reply policy for Monte-Carlo go. Int. Comput. Games Assoc. J. 32(4), 221–227 (2009)
MathSciNet Google Scholar
Edelkamp, S., Tang, Z.: Monte-Carlo tree search for the multiple sequence alignment problem. In: Eighth Annual Symposium on Combinatorial Search (2015)
Google Scholar
Ewalds, T.: Playing and Solving Havannah. Master’s thesis, University of Alberta (2012)
Google Scholar
Finnsson, H., Björnsson, Y.: Simulation-based approach to general game playing. In: Proceedings of the 23rd National Conference on Artificial Intelligence, AAAI 2008, vol. 1, pp. 259–264. AAAI Press (2008)
Google Scholar
Gelly, S., Silver, D.: Combining online and offline knowledge in UCT. In: Proceedings of the 24th International Conference on Machine Learning, pp. 273–280. ACM (2007)
Google Scholar
Gelly, S., Silver, D.: Monte-Carlo tree search and rapid action value estimation in computer go. Artif. Intell. 175(11), 1856–1875 (2011)
Article MathSciNet Google Scholar
Guo, X., Singh, S., Lee, H., Lewis, R.L., Wang, X.: Deep learning for real-time atari game play using offline Monte-Carlo tree search planning. In: Advances in Neural Information Processing Systems, pp. 3338–3346 (2014)
Google Scholar
Heinrich, J., Silver, D.: Self-play Monte-Carlo tree search in computer poker. In: Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence (2014)
Google Scholar
Herik, H.J., Kuipers, J., Vermaseren, J.A.M., Plaat, A.: Investigations with Monte Carlo tree search for finding better multivariate horner schemes. In: Filipe, J., Fred, A. (eds.) ICAART 2013. CCIS, vol. 449, pp. 3–20. Springer, Heidelberg (2014). doi:10.1007/978-3-662-44440-5_1
Google Scholar
Hoock, J., Lee, C., Rimmel, A., Teytaud, F., Wang, M., Teytaud, O.: Intelligent agents for the game of go. IEEE Comput. Intell. Mag. 5(4), 28–42 (2010)
Google Scholar
Kocsis, L., Szepesvári, C.: Bandit based Monte-Carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006). doi:10.1007/11871842_29
Chapter Google Scholar
Lanctot, M., Saffidine, A., Veness, J., Archibald, C., Winands, M.: Monte Carlo*-minimax search. In: Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, pp. 580–586. AAAI Press (2013)
Google Scholar
Lorentz, R.: Improving Monte-Carlo tree search in Havannah. In: Computers and Games 2010, pp. 105–115 (2010)
Google Scholar
Lorentz, R.J.: Amazons discover Monte-Carlo. In: Herik, H.J., Xu, X., Ma, Z., Winands, M.H.M. (eds.) CG 2008. LNCS, vol. 5131, pp. 13–24. Springer, Heidelberg (2008). doi:10.1007/978-3-540-87608-3_2
Chapter Google Scholar
Mazyad, A., Teytaud, F., Fonlupt, C.: Monte-Carlo Tree Search for the “mr jack” board game. J. Soft Comput. Artif. Intell. Appl. (IJSCAI) 4(1) (2015)
Google Scholar
Powley, E.J., Whitehouse, D., Cowling, P.I.: Bandits all the way down: UCB1 as a simulation policy in Monte-Carlo tree search. In: CIG, pp. 81–88. IEEE (2013)
Google Scholar
Rimmel, A., Teytaud, F.: Multiple overlapping tiles for contextual Monte Carlo tree search. In: Chio, C., Cagnoni, S., Cotta, C., Ebner, M., Ekárt, A., Esparcia-Alcazar, A.I., Goh, C.-K., Merelo, J.J., Neri, F., Preuß, M., Togelius, J., Yannakakis, G.N. (eds.) EvoApplications 2010. LNCS, vol. 6024, pp. 201–210. Springer, Heidelberg (2010). doi:10.1007/978-3-642-12239-2_21
Chapter Google Scholar
Rimmel, A., Teytaud, F., Teytaud, O.: Biasing Monte-Carlo simulations through RAVE values. In: Herik, H.J., Iida, H., Plaat, A. (eds.) CG 2010. LNCS, vol. 6515, pp. 59–68. Springer, Heidelberg (2011). doi:10.1007/978-3-642-17928-0_6
Chapter Google Scholar
Schmittberger, R.: New Rules for Classic Games. Wiley, New York (1992)
Google Scholar
Stankiewicz, J.A., Winands, M.H.M., Uiterwijk, J.W.H.M.: Monte-Carlo tree search enhancements for havannah. In: Herik, H.J., Plaat, A. (eds.) ACG 2011. LNCS, vol. 7168, pp. 60–71. Springer, Heidelberg (2012). doi:10.1007/978-3-642-31866-5_6
Chapter Google Scholar
Tak, M.J., Winands, M.H., Björnsson, Y.: N-grams and the last-good-reply policy applied in general game playing. IEEE Trans. Comput. Intell. AI Games 4(2), 73–83 (2012)
Article Google Scholar
Taralla, D.: Learning Artificial Intelligence in Large-Scale Video Games. Ph.D. thesis, University of Liège (2015)
Google Scholar
Teytaud, F., Teytaud, O.: Creating an upper-confidence-tree program for havannah. In: Herik, H.J., Spronck, P. (eds.) ACG 2009. LNCS, vol. 6048, pp. 65–74. Springer, Heidelberg (2010). doi:10.1007/978-3-642-12993-3_7
Chapter Google Scholar
Wilisowski, Ł., Dreżewski, R.: The application of co-evolutionary genetic programming and TD(1) reinforcement learning in large-scale strategy game VCMI. In: Jezic, G., Howlett, R.J., Jain, L.C. (eds.) Agent and Multi-Agent Systems: Technologies and Applications. SIST, vol. 38, pp. 81–93. Springer, Heidelberg (2015). doi:10.1007/978-3-319-19728-9_7
Chapter Google Scholar

Download references

Acknowledgements

Experiments presented in this paper were carried out using the CALCULCO computing platform, supported by SCOSI/ULCO (Service Commun du Système d’Information de l’Université du Littoral Côte d’Opale).

Author information

Authors and Affiliations

LISIC, ULCO, Université du Littoral Côte d’Opale, Calais, France
Joris Duguépéroux, Ahmad Mazyad, Fabien Teytaud & Julien Dehos

Authors

Joris Duguépéroux
View author publications
You can also search for this author in PubMed Google Scholar
Ahmad Mazyad
View author publications
You can also search for this author in PubMed Google Scholar
Fabien Teytaud
View author publications
You can also search for this author in PubMed Google Scholar
Julien Dehos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julien Dehos .

Editor information

Editors and Affiliations

Leiden Institute of Advanced Computer Science (LIACS), Leiden University, Leiden, Zuid-Holland, The Netherlands
Aske Plaat
Leiden Institute of Advanced Computer Science (LIACS), Leiden University, Leiden, Zuid-Holland, The Netherlands
Walter Kosters
Leiden Institute of Advanced Computer Science (LIACS), Leiden University, Leiden, Zuid-Holland, The Netherlands
Jaap van den Herik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Duguépéroux, J., Mazyad, A., Teytaud, F., Dehos, J. (2016). Pruning Playouts in Monte-Carlo Tree Search for the Game of Havannah. In: Plaat, A., Kosters, W., van den Herik, J. (eds) Computers and Games. CG 2016. Lecture Notes in Computer Science(), vol 10068. Springer, Cham. https://doi.org/10.1007/978-3-319-50935-8_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-50935-8_5
Published: 10 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50934-1
Online ISBN: 978-3-319-50935-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Pruning Playouts in Monte-Carlo Tree Search for the Game of Havannah

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhancing Playout Policy Adaptation for General Game Playing

Monte Carlo Game Solver

Monte Carlo Tree Search: a review of recent modifications and applications

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Pruning Playouts in Monte-Carlo Tree Search for the Game of Havannah

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhancing Playout Policy Adaptation for General Game Playing

Monte Carlo Game Solver

Monte Carlo Tree Search: a review of recent modifications and applications

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation