Solving Sokoban Game with a Heuristic for Avoiding Dead-End States

Ignatenko, Oleksii; Pravosud, Ruslan

doi:10.1007/978-3-031-48325-7_4

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1980))

Included in the following conference series:

International Conference on Information and Communication Technologies in Education, Research, and Industrial Applications

178 Accesses

Abstract

This paper focuses on applying reinforcement learning methods to solve the game Sokoban. This game is a popular puzzle, relatively easy for humans to solve. However, it poses a significant challenge for computer algorithms due to the irreversible nature of certain moves. To predict which actions will lead to such undesirable states is often difficult for a learning agent – a common problem in tasks requiring planning. We propose using a Monte-Carlo tree search (MCTS) algorithm and a heuristic convolution neural network (CNN) specially trained to separate undesirable, neutral, and desired game states to address this issue. We experimented with different heuristic variations of algorithms and compared them against each other. We have implemented MCTS in two different setups: one with a CNN trained using data obtained during the solving process and one without such training. We also varied the number of rollouts for each move in MCTS and compared the results. The paper’s research question was how to improve the performance of learning agents in tasks that require planning to avoid unwanted states.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Racanière, S., et al.: Imagination-augmented agents for deep reinforcement learning. In: Advances in Neural Information Processing Systems, vol. 30 (2017). https://arxiv.org/abs/1707.06203
Ge, V.: Solving planning problems with deep reinforcement learning and tree search (2018). https://hdl.handle.net/2142/101086
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)
MATH Google Scholar
Plaat, A., Kosters, W., Preuss, M.: Deep model-based reinforcement learning for high-dimensional problems, a survey. arXiv preprint arXiv:2008.05598 (2020)
Shoham, Y., Elidan, G.: Solving Sokoban with forward-backward reinforcement learning. In: Proceedings of the International Symposium on Combinatorial Search, vol. 12, no. 1 (2021)
Google Scholar
Feng, D., Gomes, C.P., Selman, B.: A novel automated curriculum strategy to solve hard Sokoban planning instances. In: Advances in Neural Information Processing Systems, vol. 33, pp. 3141–3152 (2020). https://arxiv.org/abs/2110.00898
Gym Sokoban. https://github.com/mpSchrader/gym-sokoban. Accessed 30 Apr 2023
Kissmann, P., Edelkamp, S.: Improving cost-optimal domain-independent symbolic planning. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 25, no. 1 (2011)
Google Scholar
https://github.com/Pronod/Sokoban

Download references

Acknowledgments

We would like to thank the Armed Forces of Ukraine for providing security, that made this work possible.

Author information

Authors and Affiliations

Kyiv Academic University, Kyiv, Ukraine
Ruslan Pravosud
Ukraine Catholic University, Lviv, Ukraine
Oleksii Ignatenko

Authors

Oleksii Ignatenko
View author publications
You can also search for this author in PubMed Google Scholar
Ruslan Pravosud
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Oleksii Ignatenko .

Editor information

Editors and Affiliations

University of Huddersfield, Huddersfield, UK
Grigoris Antoniou
Ukrainian Catholic University, Lviv, Ukraine
Vadim Ermolayev
Kherson State University, Kherson, Ukraine
Vitaliy Kobets
Odessa National Polytechnic University, Odesa, Ukraine
Vira Liubchenko
University of Klagenfurt, Klagenfurt, Austria
Heinrich C. Mayr
Kherson State University, Kherson, Ukraine
Aleksander Spivakovsky
University of Warmia and Mazury in Olsztyn, Olsztyn, Poland
Vitaliy Yakovyna
V. N. Karazin Kharkiv National University, Kharkiv, Ukraine
Grygoriy Zholtkevych

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ignatenko, O., Pravosud, R. (2023). Solving Sokoban Game with a Heuristic for Avoiding Dead-End States. In: Antoniou, G., et al. Information and Communication Technologies in Education, Research, and Industrial Applications. ICTERI 2023. Communications in Computer and Information Science, vol 1980. Springer, Cham. https://doi.org/10.1007/978-3-031-48325-7_4

Download citation

DOI: https://doi.org/10.1007/978-3-031-48325-7_4
Published: 01 December 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-48324-0
Online ISBN: 978-3-031-48325-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Solving Sokoban Game with a Heuristic for Avoiding Dead-End States