K-means Pattern Learning for Move Evaluation in the Game of Go

Liang, Yunzhao; Chen, Shuoying

doi:10.1007/978-3-319-13560-1_39

Yunzhao Liang²¹ &
Shuoying Chen²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8862))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

6359 Accesses
1 Citations

Abstract

The Game of Go is one of the biggest challenge in the field of Computer Game. The large board makes Go very complex and hard to evaluate. In this paper, we propose a method that reduce the complexity of Go by learning and extracting patterns from game records. This method is more efficient and stronger than the baseline method we have chosen. Our method has two major components: a) a pattern learning method based on K-means, it will learn and extract patterns from game records, b) a perceptron which learns the win rates of Go situations. We build an agent to evaluate the performance of our method, and get at least 20% of performance improvement or 25% of computing power saving in most circumstances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Slany, W.: The complexity of graph Ramsey games. In: Marsland, T., Frank, I. (eds.) CG 2001. LNCS, vol. 2063, pp. 186–203. Springer, Heidelberg (2002)
Chapter Google Scholar
Hsieh, M.Y., Tsai, S.-C.: On the fairness and complexity of generalized k-in-a-row games. Theoretical Computer Science 385(1), 88–100 (2007)
Article MATH MathSciNet Google Scholar
Allis, V.L.: Searching for solutions in games and artificial intelligence (1994)
Google Scholar
Shannon, C.E.: Programming a computer for playing chess. Philosophical Magazine 41(314), 256–275 (1950)
Article MATH MathSciNet Google Scholar
Reisch, S.: Gobang ist PSPACE-vollständig. Acta Informatica 13(1), 59–66 (1980)
Article MATH MathSciNet Google Scholar
Robson, J.M.: The Complexity of Go. In: IFIP Congress, pp. 413–417 (1983)
Google Scholar
Campbell, M., Hoane Jr., A.J., Hsu, F.-H.: Deep blue. Artificial Intelligence 134, 157–183 (2002)
Article Google Scholar
van der Werf, E.C.D., Van Den Herik, H.J., Uiterwijk, J.W.H.M.: Solving Go on Small Boards. ICGA Journal 26(2), 92–107 (2003)
Google Scholar
Bouzy, B., Cazenave, T.: Computer Go: An AI oriented survey. Artificial Intelligence 132, 39–103 (2001)
Article MATH MathSciNet Google Scholar
Browne, C.B., Powley, E., Whitehouse, D., Lucas, S.M., Cowling, P.I., Rohlfshagen, P., Tavener, S., Perez, D., Samothrakis, S., Colton, S.: A survey of monte carlo tree search methods. Computational Intelligence and AI in Games 4(1), 1–43 (2012)
Article Google Scholar
Press, W.H.: Numerical recipes, 3rd edn. The art of scientific computing (2007)
Google Scholar
Tesauro, G.: TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation 6(2), 215–219 (1994)
Article Google Scholar
Schraudolph, N.N., Dayan, P., Sejnowski, T.J.: Temporal difference learning of position evaluation in the game of Go. In: Advances in Neural Information Processing Systems, p. 817 (1994)
Google Scholar
Ekker, R., van der Werf, E.C.D., Schomaker, L.R.B.: Dedicated TD-learning for Stronger Gameplay: Applications to Go (2004)
Google Scholar
Ghory, I.: Reinforcement learning in board games. Department of Computer Science, University of Bristol, Tech. Rep. (2004)
Google Scholar
Gelly, S., Wang, Y., Munos, R., Teytaud, O., et al.: Modification of UCT with patterns in Monte-Carlo Go (2006)
Google Scholar
Gelly, S., Silver, D.: Achieving Master Level Play in 9 x 9 Computer Go. In: AAAI, vol. 8, pp. 1537–1540 (2008)
Google Scholar
Coulom, R.: Computing elo ratings of move patterns in the game of go. In: Computer Games Workshop (2007)
Google Scholar
Stern, D., Herbrich, R., Graepel, T.: Bayesian pattern ranking for move prediction in the game of Go. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 873–880 (2006)
Google Scholar
Graepel, T., Goutrié, M., Krüger, M., Herbrich, R.: Learning on graphs in the game of go. In: Dorffner, G., Bischof, H., Hornik, K. (eds.) ICANN 2001. LNCS, vol. 2130, p. 347. Springer, Heidelberg (2001)
Chapter Google Scholar
Ralaivola, L., Wu, L., Baldi, P.: SVM and pattern-enriched common fate graphs for the game of Go (2005)
Google Scholar
Coates, A., Ng, A.Y.: Learning feature representations with K-means. In: Montavon, G., Orr, G.B., Müller, K.-R. (eds.) NN: Tricks of the Trade, 2nd edn. LNCS, vol. 7700, pp. 561–580. Springer, Heidelberg (2012)
Google Scholar
Coates, A., Ng, A.Y., Lee, H.: An analysis of single-layer networks in unsupervised feature learning. In: International Conference on Artificial Intelligence and Statistics, pp. 215–223 (2011)
Google Scholar
Xu, R., Wunsch, D., et al.: Survey of clustering algorithms. Neural Networks 16(3), 645–678 (2005)
Article Google Scholar
Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Beijing Institute of Technology, China
Yunzhao Liang & Shuoying Chen

Authors

Yunzhao Liang
View author publications
You can also search for this author in PubMed Google Scholar
Shuoying Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

MIMOS Berhad Technology Park Malaysia, 57000, Bukit Jalil, KL, Malaysia
Duc-Nghia Pham
Kyungpook National University, Sankyuk-Dong, Buk-Gu, 702-701, Daegu, Korea
Seong-Bae Park

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liang, Y., Chen, S. (2014). K-means Pattern Learning for Move Evaluation in the Game of Go. In: Pham, DN., Park, SB. (eds) PRICAI 2014: Trends in Artificial Intelligence. PRICAI 2014. Lecture Notes in Computer Science(), vol 8862. Springer, Cham. https://doi.org/10.1007/978-3-319-13560-1_39

Download citation

DOI: https://doi.org/10.1007/978-3-319-13560-1_39
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13559-5
Online ISBN: 978-3-319-13560-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics