research-article

Shaping fitness function for evolutionary learning of game strategies

Authors:

Marcin Szubert,

Wojciech Jaśkowski,

Paweł Liskowski,

Krzysztof KrawiecAuthors Info & Claims

GECCO '13: Proceedings of the 15th annual conference on Genetic and evolutionary computation

Pages 1149 - 1156

https://doi.org/10.1145/2463372.2463513

Published: 06 July 2013 Publication History

Abstract

In evolutionary learning of game-playing strategies, fitness evaluation is based on playing games with certain opponents. In this paper we investigate how the performance of these opponents and the way they are chosen influence the efficiency of learning. For this purpose we introduce a simple method for shaping the fitness function by sampling the opponents from a biased performance distribution. We compare the shaped function with existing fitness evaluation approaches that sample the opponents from an unbiased performance distribution or from a coevolving population. In an extensive computational experiment we employ these methods to learn Othello strategies and assess both the absolute and relative performance of the elaborated players. The results demonstrate the superiority of the shaping approach, and can be explained by means of performance profiles, an analytical tool that evaluate the evolved strategies using a range of variably skilled opponents.

References

[1]

K. J. Binkley, K. Seehart, and M. Hagiwara. A Study of Artificial Neural Network Architectures for Othello Evaluation Functions. Transactions of the Japanese Society for Artificial Intelligence, 22(5):461--471, 2007.

[2]

A. Bucci, J. B. Pollack, and E. de Jong. Automated extraction of problem structure. In K. D. et al., editor, Genetic and Evolutionary Computation -- GECCO-2004, Part I, volume 3102 of Lecture Notes in Computer Science, pages 501--512, Seattle, WA, USA, 26--30 June 2004. Springer-Verlag.

[3]

E. K. Burke, M. Hyde, G. Kendall, G. Ochoa, E. Ozcan, and J. R. Woodward. A classification of hyper-heuristics approaches. In M. Gendreau and J.-Y. Potvin, editors, Handbook of Metaheuristics, volume 57 of International Series in Operations Research & Management Science, chapter 15, pages 449--468. Springer, 2nd edition, 2010.

[4]

S. Y. Chong, M. K. Tan, and J. D. White. Observing the evolution of neural networks learning to play the game of othello. Evolutionary Computation, IEEE Transactions on, 9(3):240--251, 2005.

Digital Library

[5]

S. Y. Chong, P. Tino, D. C. Ku, and Y. Xin. Improving Generalization Performance in Co-Evolutionary Learning. IEEE Transactions on Evolutionary Computation, 16(1):70--85, 2012.

Digital Library

[6]

E. D. de Jong. The maxsolve algorithm for coevolution. In H.-G. B. et al., editor, GECCO 2005: Proceedings of the 2005 conference on Genetic and evolutionary computation, volume 1, pages 483--489, Washington DC, USA, 25--29 June 2005. ACM Press.

Digital Library

[7]

E. D. de Jong and J. B. Pollack. Ideal Evaluation from Coevolution. Evolutionary Computation, 12(2):159--192, Summer 2004.

Digital Library

[8]

A. Eiben and S. Smit. Parameter tuning for configuring and analyzing evolutionary algorithms. Swarm and Evolutionary Computation, 1(1):19--31, 2011.

[9]

T. Erez and W. Smart. What does shaping mean for computational reinforcement learning? In Development and Learning, 2008. ICDL 2008. 7th IEEE International Conference on, pages 215 --219, aug. 2008.

[10]

S. G. Ficici and J. B. Pollack. Pareto optimality in coevolutionary learning. In J. Kelemen and P. Sosík, editors, Advances in Artificial Life, 6th European Conference, ECAL 2001, volume 2159 of Lecture Notes in Computer Science, pages 316--325, Prague, Czech Republic, 2001. Springer.

Digital Library

[11]

W. Jaskowski and K. Krawiec. Formal analysis, hardness and algorithms for extracting internal structure of test-based problems. Evolutionary Computation, 19(4):639--671, 2011.

Digital Library

[12]

W. Jaskowski, K. Krawiec, and B. Wieloch. Evolving strategy for a probabilistic game of imperfect information using genetic programming. Genetic Programming and Evolvable Machines, 9(4):281--294, 2008.

Digital Library

[13]

W. Jaskowski, P. Liskowski, M. Szubert, and K. Krawiec. Improving coevolution by random sampling. In GECCO'13: Proceedings of the 15th annual conference on Genetic and Evolutionary Computation, Amsterdam, The Netherlands, July 2013. ACM.

Digital Library

[14]

H. Juillé and J. B. Pollack. Coevolving the "ideal" trainer: Application to the discovery of cellular automata rules. In University of Wisconsin, pages 519--527. Morgan Kaufmann, 1998.

[15]

K. Krawiec, W. Ja\'skowski, and M. Szubert. Evolving small-board go players using coevolutionary temporal difference learning with archive. International Journal of Applied Mathematics and Computer Science, 21(4):717--731, 2011.

Digital Library

[16]

K. Krawiec and M. Szubert. Learning n-tuple networks for othello by coevolutionary gradient search. In N. K. et al, editor, GECCO 2011 Proceedings, pages 355--362. ACM, ACM, 2011.

Digital Library

[17]

S. M. Lucas. Learning to play Othello with N-tuple systems. Australian Journal of Intelligent Information Processing Systems, Special Issue on Game Technology, 9(4):1--20, 2007.

[18]

S. M. Lucas and T. P. Runarsson. Temporal difference learning versus co-evolution for acquiring othello position evaluation. In CIG, pages 52--59, 2006.

[19]

S. Luke and R. P. Wiegand. When coevolutionary algorithms exhibit evolutionary dynamics. In A. M. Barry, editor, GECCO 2002: Proceedings of the Bird of a Feather Workshops, Genetic and Evolutionary Computation Conference, pages 236--241, New York, 2002.

[20]

E. Popovici, A. Bucci, R. P. Wiegand, and E. D. de Jong. Handbook of Natural Computing, chapter Coevolutionary Principles. Springer-Verlag, 2011.

[21]

J. Randløv and P. Alstrøm. Learning to drive a bicycle using reinforcement learning and shaping. In Proceedings of the Fifteenth International Conference on Machine Learning, pages 463--471. Morgan Kaufmann, San Francisco, CA, 1998.

Digital Library

[22]

C. D. Rosin and R. K. Belew. New methods for competitive coevolution. Evolutionary Computation, 5(1):1--29, 1997.

Digital Library

[23]

S. Samothrakis, S. Lucas, T. Runarsson, and D. Robles. Coevolving Game-Playing Agents: Measuring Performance and Intransitivities. IEEE Transactions on Evolutionary Computation, (99):1--15, 2012.

[24]

B. Skinner. The behavior of organisms: An experimental analysis. Appleton-Century, 1938.

[25]

R. A. Watson and J. B. Pollack. Coevolutionary dynamics in a minimal substrate. In L. S. et al., editor, Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), pages 702--709, San Francisco, California, USA, 7--11 July 2001. Morgan Kaufmann.

Cited By

Johnson C(2021)Solving the Rubik's cube with stepwise deep learningExpert Systems10.1111/exsy.1266538:3Online publication date: 24-Jan-2021
https://doi.org/10.1111/exsy.12665
Elfeky EElsayed SMarsh LEssam DCochrane MSims BSarker R(2021)A Systematic Review of Coevolution in Real-Time Strategy GamesIEEE Access10.1109/ACCESS.2021.31157689(136647-136665)Online publication date: 2021
https://doi.org/10.1109/ACCESS.2021.3115768
Krawiec KHeywood MCoello Coello C(2020)Solving complex problems with coevolutionary algorithmsProceedings of the 2020 Genetic and Evolutionary Computation Conference Companion10.1145/3377929.3389874(832-858)Online publication date: 8-Jul-2020
https://dl.acm.org/doi/10.1145/3377929.3389874
Show More Cited By

Index Terms

Shaping fitness function for evolutionary learning of game strategies
1. Applied computing
2. Computing methodologies
  1. Artificial intelligence
    1. Search methodologies
      1. Heuristic function construction
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Observing the evolution of neural networks learning to play the game of Othello

A study was conducted to find out how game-playing strategies for Othello (also known as reversi) can be learned without expert knowledge. The approach used the coevolution of a fixed-architecture neural-network-based evaluation function combined with a ...
Evolving Game Playing Strategies for Othello Incorporating Reinforcement Learning and Mobility
SAICSIT '15: Proceedings of the 2015 Annual Research Conference on South African Institute of Computer Scientists and Information Technologists

Genetic programming is rapidly gaining popularity in research areas for the induction of complex game playing strategies for board games such as Othello, checkers, backgammon and chess endgames. Most of this research has focused on developing evaluation ...
Novel virtual fitness evaluation framework for fitness landscape learning evolutionary computation
GECCO Comp '14: Proceedings of the Companion Publication of the 2014 Annual Conference on Genetic and Evolutionary Computation

Introducing the machine learning technique into evolutionary computation (EC) is one of the most important issues to expand EC design. In this paper, we proposed a novel method that combines the genetic algorithm and support vector machine to achieve ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

GECCO '13: Proceedings of the 15th annual conference on Genetic and evolutionary computation

July 2013

1672 pages

ISBN:9781450319638

DOI:10.1145/2463372

Editor:
Christian Blum
IKERBASQUE and University of the Basque Country UPV/EHU, Spain
,
General Chair:
Enrique Alba
University of Malaga, Spain

Copyright © 2013 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGEVO: ACM Special Interest Group on Genetic and Evolutionary Computation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 July 2013

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

GECCO '13

Sponsor:

SIGEVO

GECCO '13: Genetic and Evolutionary Computation Conference

July 6 - 10, 2013

Amsterdam, The Netherlands

Acceptance Rates

GECCO '13 Paper Acceptance Rate 204 of 570 submissions, 36%;

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

13
Total Citations
View Citations
151
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 19 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Johnson C(2021)Solving the Rubik's cube with stepwise deep learningExpert Systems10.1111/exsy.1266538:3Online publication date: 24-Jan-2021
https://doi.org/10.1111/exsy.12665
Elfeky EElsayed SMarsh LEssam DCochrane MSims BSarker R(2021)A Systematic Review of Coevolution in Real-Time Strategy GamesIEEE Access10.1109/ACCESS.2021.31157689(136647-136665)Online publication date: 2021
https://doi.org/10.1109/ACCESS.2021.3115768
Krawiec KHeywood MCoello Coello C(2020)Solving complex problems with coevolutionary algorithmsProceedings of the 2020 Genetic and Evolutionary Computation Conference Companion10.1145/3377929.3389874(832-858)Online publication date: 8-Jul-2020
https://dl.acm.org/doi/10.1145/3377929.3389874
Krawiec KHeywood MAuger AStützle T(2019)Solving complex problems with coevolutionary algorithmsProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3319619.3323384(975-1001)Online publication date: 13-Jul-2019
https://dl.acm.org/doi/10.1145/3319619.3323384
Johnson C(2019)Stepwise Evolutionary Learning Using Deep Learned Guidance FunctionsArtificial Intelligence XXXVI10.1007/978-3-030-34885-4_4(50-62)Online publication date: 19-Nov-2019
https://doi.org/10.1007/978-3-030-34885-4_4
Krawiec KHeywood MTakadama K(2018)Solving complex problems with coevolutionary algorithmsProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3205651.3207888(880-906)Online publication date: 6-Jul-2018
https://dl.acm.org/doi/10.1145/3205651.3207888
Liskowski PJaśkowski WBosman P(2017)Accelerating coevolution with adaptive matrix factorizationProceedings of the Genetic and Evolutionary Computation Conference10.1145/3071178.3071320(457-464)Online publication date: 1-Jul-2017
https://dl.acm.org/doi/10.1145/3071178.3071320
Krawiec KHeywood M(2017)Solving complex problems with coevolutionary algorithmsProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3067695.3067705(782-806)Online publication date: 15-Jul-2017
https://dl.acm.org/doi/10.1145/3067695.3067705
Krawiec KHeywood MNeumann FSutton A(2016)Solving Complex Problems with Coevolutionary AlgorithmsProceedings of the 2016 on Genetic and Evolutionary Computation Conference Companion10.1145/2908961.2926989(687-713)Online publication date: 20-Jul-2016
https://dl.acm.org/doi/10.1145/2908961.2926989
Jaskowski WSzubert M(2016)Coevolutionary CMA-ES for Knowledge-Free Learning of Game Position EvaluationIEEE Transactions on Computational Intelligence and AI in Games10.1109/TCIAIG.2015.24647118:4(389-401)Online publication date: Dec-2016
https://doi.org/10.1109/TCIAIG.2015.2464711
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten