Abstract
The Game of Go is one of the biggest challenge in the field of Computer Game. The large board makes Go very complex and hard to evaluate. In this paper, we propose a method that reduce the complexity of Go by learning and extracting patterns from game records. This method is more efficient and stronger than the baseline method we have chosen. Our method has two major components: a) a pattern learning method based on K-means, it will learn and extract patterns from game records, b) a perceptron which learns the win rates of Go situations. We build an agent to evaluate the performance of our method, and get at least 20% of performance improvement or 25% of computing power saving in most circumstances.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Slany, W.: The complexity of graph Ramsey games. In: Marsland, T., Frank, I. (eds.) CG 2001. LNCS, vol. 2063, pp. 186–203. Springer, Heidelberg (2002)
Hsieh, M.Y., Tsai, S.-C.: On the fairness and complexity of generalized k-in-a-row games. Theoretical Computer Science 385(1), 88–100 (2007)
Allis, V.L.: Searching for solutions in games and artificial intelligence (1994)
Shannon, C.E.: Programming a computer for playing chess. Philosophical Magazine 41(314), 256–275 (1950)
Reisch, S.: Gobang ist PSPACE-vollständig. Acta Informatica 13(1), 59–66 (1980)
Robson, J.M.: The Complexity of Go. In: IFIP Congress, pp. 413–417 (1983)
Campbell, M., Hoane Jr., A.J., Hsu, F.-H.: Deep blue. Artificial Intelligence 134, 157–183 (2002)
van der Werf, E.C.D., Van Den Herik, H.J., Uiterwijk, J.W.H.M.: Solving Go on Small Boards. ICGA Journal 26(2), 92–107 (2003)
Bouzy, B., Cazenave, T.: Computer Go: An AI oriented survey. Artificial Intelligence 132, 39–103 (2001)
Browne, C.B., Powley, E., Whitehouse, D., Lucas, S.M., Cowling, P.I., Rohlfshagen, P., Tavener, S., Perez, D., Samothrakis, S., Colton, S.: A survey of monte carlo tree search methods. Computational Intelligence and AI in Games 4(1), 1–43 (2012)
Press, W.H.: Numerical recipes, 3rd edn. The art of scientific computing (2007)
Tesauro, G.: TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation 6(2), 215–219 (1994)
Schraudolph, N.N., Dayan, P., Sejnowski, T.J.: Temporal difference learning of position evaluation in the game of Go. In: Advances in Neural Information Processing Systems, p. 817 (1994)
Ekker, R., van der Werf, E.C.D., Schomaker, L.R.B.: Dedicated TD-learning for Stronger Gameplay: Applications to Go (2004)
Ghory, I.: Reinforcement learning in board games. Department of Computer Science, University of Bristol, Tech. Rep. (2004)
Gelly, S., Wang, Y., Munos, R., Teytaud, O., et al.: Modification of UCT with patterns in Monte-Carlo Go (2006)
Gelly, S., Silver, D.: Achieving Master Level Play in 9 x 9 Computer Go. In: AAAI, vol. 8, pp. 1537–1540 (2008)
Coulom, R.: Computing elo ratings of move patterns in the game of go. In: Computer Games Workshop (2007)
Stern, D., Herbrich, R., Graepel, T.: Bayesian pattern ranking for move prediction in the game of Go. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 873–880 (2006)
Graepel, T., Goutrié, M., Krüger, M., Herbrich, R.: Learning on graphs in the game of go. In: Dorffner, G., Bischof, H., Hornik, K. (eds.) ICANN 2001. LNCS, vol. 2130, p. 347. Springer, Heidelberg (2001)
Ralaivola, L., Wu, L., Baldi, P.: SVM and pattern-enriched common fate graphs for the game of Go (2005)
Coates, A., Ng, A.Y.: Learning feature representations with K-means. In: Montavon, G., Orr, G.B., Müller, K.-R. (eds.) NN: Tricks of the Trade, 2nd edn. LNCS, vol. 7700, pp. 561–580. Springer, Heidelberg (2012)
Coates, A., Ng, A.Y., Lee, H.: An analysis of single-layer networks in unsupervised feature learning. In: International Conference on Artificial Intelligence and Statistics, pp. 215–223 (2011)
Xu, R., Wunsch, D., et al.: Survey of clustering algorithms. Neural Networks 16(3), 645–678 (2005)
Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Liang, Y., Chen, S. (2014). K-means Pattern Learning for Move Evaluation in the Game of Go. In: Pham, DN., Park, SB. (eds) PRICAI 2014: Trends in Artificial Intelligence. PRICAI 2014. Lecture Notes in Computer Science(), vol 8862. Springer, Cham. https://doi.org/10.1007/978-3-319-13560-1_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-13560-1_39
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13559-5
Online ISBN: 978-3-319-13560-1
eBook Packages: Computer ScienceComputer Science (R0)