Abstract
Monte Carlo Tree Search/Upper Confidence bounds applied to Trees (MCTS/UCT) is a popular and powerful search technique applicable to many domains, most frequently to searching game trees. Even though the algorithm has been widely researched, there is still room for its improvement, especially when combined with metaheuristics or machine learning methods. In this paper, we revise and experimentally evaluate the idea of enhancing MCTS/UCT with game-specific heuristics that guide the playout (simulation) phase. MCTS/UCT with the proposed guiding mechanism is tested on two popular board games: Othello and Hex. The enhanced method clearly defeats the well-known Alpha-beta pruning algorithm in both games, and for the more complex game (Othello) is highly competitive to the vanilla MCTS/UCT formulation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arneson, B., Hayward, R., Henderson, P.: Mohex wins hex tournament. In: ICGA, vol. 32, pp. 114–116, September 2013. https://doi.org/10.3233/ICG-2009-32218
Barratt, J., Pan, C.: Playing go without game tree search using convolutional neural networks. ArXiv abs/1907.04658 (2019)
Björnsson, Y., Finnsson, H.: Cadiaplayer: a simulation-based general game player. Comput. Intell. AI Games IEEE Trans. 1, 4–15 (2009). https://doi.org/10.1109/TCIAIG.2009.2018702
Enzenberger, M., Müller, M., Arneson, B., Segal, R.: Fuego-an open-source framework for board games and go engine based on monte carlo tree search. IEEE Trans. Comput. Intell. AI Games 2(4), 259–270 (2010). https://doi.org/10.1109/TCIAIG.2010.2083662
Gelly, S., Silver, D.: Combining online and offline knowledge in UCT. In: Proceedings of the 24th International Conference on Machine Learning, pp. 273–280 (2007). https://doi.org/10.1145/1273496.1273531
Gelly, S., Silver, D.: Monte-carlo tree search and rapid action value estimation in computer go. Artif. Intell. 175(11), 1856–1875 (2011). https://doi.org/10.1016/j.artint.2011.03.007
Karwowski, J., Mańdziuk, J.: A new approach to security games. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2015, Part II. LNCS (LNAI), vol. 9120, pp. 402–411. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19369-4_36
Karwowski, J., Mańdziuk, J.: A monte carlo tree search approach to finding efficient patrolling schemes on graphs. Eur. J. Oper. Res. 277(1), 255–268 (2019)
Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006). https://doi.org/10.1007/11871842_29
Maarup, T.: Everything you always wanted to know about hexbut were afraid to Ask. Ph.D. thesis (2005). www.maarup.net/thomas/hex/hex3.pdf
Segler, M.H., Preuss, M., Waller, M.P.: Planning chemical syntheses with deep neural networks and symbolic AI. Nature 555(7698), 604–610 (2018)
Silver, D., et al.: Mastering the game of Go without human knowledge. Nature 550(7676), 354–359 (2017)
Świechowski, M., Park, H.S., Mańdziuk, J., Kim, K.J.: Recent advances in general game playing. The Scientific World Journal 2015, Article ID: 986262 (2015)
Świechowski, M., Godlewski, K., Sawicki, B., Mańdziuk, J.: Monte carlo tree search: a review of recent modifications and applications. Artif. Intell. Rev. 56, 2497–2562 (2023). https://doi.org/10.1007/s10462-022-10228-y
Świechowski, M., Mańdziuk, J.: Self-adaptation of playing strategies in general game playing. IEEE Trans. Comput. Intell. AI Games 6(4), 367–381 (2014). https://doi.org/10.1109/TCIAIG.2013.2275163
Walȩdzik, K., Mańdziuk, J.: Applying hybrid monte carlo tree search methods to risk-aware project scheduling problem. Inf. Sci. 460–461, 450–468 (2018)
Łapa, K., Cpałka, K., Kisiel-Dorohinicki, M., Paszkowski, J., Dȩbski, M., Le, V.H.: Multi-population-based algorithm with an exchange of training plans based on population evaluation. J. Artif. Intell. Soft Comput. Rese. 12(4), 239–253 (2022). https://doi.org/10.2478/jaiscr-2022-0016
Łapa, K., Cpałka, K., Laskowski, Ł, Cader, A., Zeng, Z.: Evolutionary algorithm with a configurable search mechanism. J. Artif. Intell. Soft Comput. Res. 10(3), 151–171 (2020). https://doi.org/10.2478/jaiscr-2020-0011
Acknowledgements.
This research was carried out with the support of the Laboratory of Bioinformatics and Computational Genomics and the HPC Center of the Faculty of Mathematics and Information Science, Warsaw University of Technology under computational grant number A-22-03.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Mańdziuk, J., Walczak, P. (2023). Monte Carlo Tree Search with Metaheuristics. In: Rutkowski, L., Scherer, R., Korytkowski, M., Pedrycz, W., Tadeusiewicz, R., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2023. Lecture Notes in Computer Science(), vol 14126. Springer, Cham. https://doi.org/10.1007/978-3-031-42508-0_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-42508-0_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-42507-3
Online ISBN: 978-3-031-42508-0
eBook Packages: Computer ScienceComputer Science (R0)