Indirect Encoding of Neural Networks for Scalable Go

Gauci, Jason; Stanley, Kenneth O.

doi:10.1007/978-3-642-15844-5_36

Jason Gauci²⁰ &
Kenneth O. Stanley²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6238))

Included in the following conference series:

International Conference on Parallel Problem Solving from Nature

2807 Accesses
8 Citations

Abstract

The game of Go has attracted much attention from the artificial intelligence community. A key feature of Go is that humans begin to learn on a small board, and then incrementally learn advanced strategies on larger boards. While some machine learning methods can also scale the board, they generally only focus on a subset of the board at one time. Neuroevolution algorithms particularly struggle with scalable Go because they are often directly encoded (i.e. a single gene maps to a single connection in the network). Thus this paper applies an indirect encoding to the problem of scalable Go that can evolve a solution to 5×5 Go and then extrapolate that solution to 7×7 Go and continue evolution. The scalable method is demonstrated to learn faster and ultimately discover better strategies than the same method trained on 7×7 Go directly from the start.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Burmeister, J., Wiles, J.: The challenge of Go as a domain for AI research: A comparison between go and chess. In: Proceedings of the Third Australian and New Zealand Conference on Intelligent Information Systems. IEEE Western Australia Section, pp. 181–186 (1995)
Google Scholar
Silver, D., Sutton, R., Müller, M.: Reinforcement learning of local shape in the game of go. In: 20th International Joint Conference on Artificial Intelligence, pp. 1053–1058 (2007)
Google Scholar
Schaul, T., Schmidhuber, J.: Scalable neural networks for board games. In: Proceedings of the International Conference on Artificial Neural Networks (ICANN). Springer, Heidelberg (2008)
Google Scholar
Stanley, K.O., Miikkulainen, R.: Evolving a roving eye for Go. In: Deb, K., et al. (eds.) GECCO 2004. LNCS, vol. 3103, pp. 1226–1238. Springer, Heidelberg (2004)
Chapter Google Scholar
Botermans, J.: The Book of Games: Strategy, Tactics, and History. Sterling Publishing Co. (2008)
Google Scholar
Shotwell, P.: Go! More Than a Game. Turtle Publishing (2003)
Google Scholar
Silver, D., Sutton, R.S., Müller, M.: Sample-based learning and search with permanent and transient memories. In: Proceedings of the 25th International Conference on Machine Learning, pp. 968–975. ACM, New York (2008)
Chapter Google Scholar
Enzenberger, M.: Evaluation in Go by a neural network using soft segmentation. In: Advances in Computer Games: Many Games, Many Challenges: Proceedings of the ICGA/IFIP SG16 10th Advances in Computer Games Conference (ACG 10), Graz, Styria, Austria, November 24-27, p. 97. Kluwer Academic Pub., Dordrecht (2003)
Google Scholar
Schraudolph, N.N., Dayan, P., Sejnowski, T.J.: Temporal difference learning of position evaluation in the game of Go. In: Advances in Neural Information Processing Systems, pp. 817–817 (1994)
Google Scholar
Graves, A., Fernandez, S., Schmidhuber, J.: Multi-dimensional recurrent neural networks. In: de Sá, J.M., Alexandre, L.A., Duch, W., Mandic, D.P. (eds.) ICANN 2007. LNCS, vol. 4668, pp. 549–558. Springer, Heidelberg (2007)
Chapter Google Scholar
Yao, X.: Evolving artificial neural networks. Proceedings of the IEEE 87(9), 1423–1447 (1999)
Article Google Scholar
Stanley, K.O., Miikkulainen, R.: Evolving neural networks through augmenting topologies. Evolutionary Computation 10, 99–127 (2002)
Article Google Scholar
Stanley, K.O., Miikkulainen, R.: Continual coevolution through complexification. In: Genetic and Evolutionary Computation Conference (2002)
Google Scholar
Fogel, D.B.: Blondie24: Playing at the Edge of AI (2002)
Google Scholar
Stanley, K.O., D’Ambrosio, D.B., Gauci, J.: A hypercube-based indirect encoding for evolving large-scale neural networks. Artificial Life 15(2), 185–212 (2009)
Article Google Scholar
Hornby, G.S., Pollack, J.B.: Creating high-level components with a generative representation for body-brain evolution. Artificial Life 8(3) (2002)
Google Scholar
Bongard, J.C.: Evolving modular genetic regulatory networks. In: Proceedings of the 2002 Congress on Evolutionary Computation (2002)
Google Scholar
Stanley, K.O., Miikkulainen, R.: A taxonomy for artificial embryogeny. Artificial Life 9(2), 93–130 (2003)
Article Google Scholar
Stanley, K.O.: Compositional pattern producing networks: A novel abstraction of development. Genetic Programming and Evolvable Machines Special Issue on Developmental Systems 8(2), 131–162 (2007)
Article MathSciNet Google Scholar
Gauci, J., Stanley, K.O.: A case study on the critical role of geometric regularity in machine learning. In: Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (AAAI-2008). AAAI Press, Menlo Park (2008)
Google Scholar
Gauci, J., Stanley, K.O.: Autonomous evolution of topographic regularities in artificial neural networks. Neural Computation 22(7), 1860–1898 (2010)
Article MATH Google Scholar
Enzenberger, M., Müller, M.: Fuego–an open-source framework for board games and go engine based on monte-carlo tree search. Technical report, Technical Report TR09-08, University of Alberta, Edmonton (2009)
Google Scholar
Gelly, S., Silver, D.: Combining online and offline knowledge in uct. In: Ghahramani, Z. (ed.) Proceedings of the International Conference of Machine Learning (ICML 2007), pp. 273–280 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL, 32816
Jason Gauci & Kenneth O. Stanley

Authors

Jason Gauci
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth O. Stanley
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Electrical Engineering, Automatics, Computer Science and Electronics, Department of ComputerScience, AGH University of Science and Technology, Mickiewicza 30, 30-059, Kraków, Poland
Robert Schaefer
Departamento de Lenguajes y Ciencias de la Computación, ETSI Informática, Universidad de Málaga, Campus Teatinos, 29071, Málaga, Spain
Carlos Cotta
Department of Mathematics and Computer Science, University of Bielsko-Biala, Willowa 2, 43-309, Bielsko-Biala, Poland
Joanna Kołodziej
Fakultät für Informatik, Technische Universität Dortmund, 44221, Dortmund, Germany
Günter Rudolph

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gauci, J., Stanley, K.O. (2010). Indirect Encoding of Neural Networks for Scalable Go. In: Schaefer, R., Cotta, C., Kołodziej, J., Rudolph, G. (eds) Parallel Problem Solving from Nature, PPSN XI. PPSN 2010. Lecture Notes in Computer Science, vol 6238. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15844-5_36

Download citation

DOI: https://doi.org/10.1007/978-3-642-15844-5_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15843-8
Online ISBN: 978-3-642-15844-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics