skip to main content
10.1145/3071178.3071320acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Accelerating coevolution with adaptive matrix factorization

Authors Info & Claims
Published:01 July 2017Publication History

ABSTRACT

Among many interaction schemes in coevolutionary settings for interactive domains, the round-robin tournament provides the most precise evaluation of candidate solutions at the expense of computational effort. In order to improve the coevolutionary learning speed, we propose an interaction scheme that computes only a fraction of interactions outcomes between the pairs of coevolving individuals. The missing outcomes in the interaction matrix are predicted using matrix factorization. The algorithm adaptively decides how much of the interaction matrix to compute based on the learning speed statistics. We evaluate our method in the context of coevolutionary covariance matrix adaptation strategy (CoCMAES) for the problem of learning position evaluation in the game of Othello. We show that our adaptive interaction scheme allows to match the state-of-the-art results obtained by the standard round-robin CoCMAES while, at the same time, considerably improves the learning speed.

References

  1. P. J. Angeline and J. B. Pollack. Competitive Environments Evolve Better Solutions for Complex Tasks. In Proceedings of the 5th International Conference on Genetic Algorithms, pages 264--270, San Francisco, CA, USA, 1993. Morgan Kaufmann Publishers Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. W. Berry, M. Browne, A. N. Langville, V. P. Pauca, and R. J. Plemmons. Algorithms and applications for approximate nonnegative matrix factorization. Computational statistics & data analysis, 52(1):155--173, 2007.Google ScholarGoogle Scholar
  3. W. W. Bledsoe and I. Browning. Pattern recognition and reading by machine. In Proc. Eastern Joint Comput. Conf., pages 225--232, 1959. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Buro. Experiments with Multi-ProbCut and a new high-quality evaluation function for Othello. In H. J. van den Herik et al., editor, Games in AI Research, pages 77--96. Univ. Maastricht, 2000.Google ScholarGoogle Scholar
  5. S. Y. Chong, P. Tino, D. C. Ku, and Y. Xin. Improving Generalization Performance in Co-Evolutionary Learning. IEEE Transactions on Evolutionary Computation, 16(1):70--85, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Y. Chong, P. Tino, D. C. Ku, and X. Yao. Improving Generalization Performance in Co-Evolutionary Learning. IEEE Transactions on Evolutionary Computation, 16(1):70--85, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. K. Deb, A. Pratap, S. Agarwal, and T. Meyarivan. A fast and elitist multiobjective genetic algorithm: Nsga-ii. IEEE transactions on evolutionary computation, 6(2):182--197, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. G. Ficici. Solution concepts in coevolutionary algorithms. PhD thesis, Brandeis University, Waltham, MA, USA, 2004. Adviser-Pollack, Jordan B. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. N. Hansen. The CMA evolution strategy: a comparing review. In J. Lozano, P. Larranaga, I. Inza, and E. Bengoetxea, editors, Towards a new evolutionary computation. Advances on estimation of distribution algorithms, pages 75--102. Springer, 2006.Google ScholarGoogle Scholar
  10. W. Jaśkowski. Algorithms for Test-Based Problems. PhD thesis, Institute of Computing Science, Poznan University of Technology, Poznań, Poland, 2011. Adviser: Krzysztof Krawiec.Google ScholarGoogle Scholar
  11. W. Jaśkowski. Systematic n-tuple networks for othello position evaluation. ICGA Journal, 37(2):85--96, June 2014.Google ScholarGoogle ScholarCross RefCross Ref
  12. W. Jaśkowski. Systematic n-tuple networks for position evaluation: Exceeding 90% in the othello league. Technical Report RA-06/2014, arXiv:1406.1509, Institute of Computing Science, Poznan University of Technology, Poznań, Poland, 2014.Google ScholarGoogle Scholar
  13. W. Jaśkowski, K. Krawiec, and B. Wieloch. Evolving strategy for a probabilistic game of imperfect information using genetic programming. Genetic Programming and Evolvable Machines, 9(4):281--294, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. W. Jaśkowski, P. Liskowski, M. Szubert, and K. Krawiec. Performance profile: a multi-criteria performance evaluation method for test-based problems. International Journal of Applied Mathematics and Computer Science, 26(1):215--229, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  15. W. Jaśkowski, P. Liskowski, M. G. Szubert, and K. Krawiec. Improving Coevolution by Random Sampling. In Proceeding of the Fifteenth Annual Conference on Genetic and Evolutionary Computation Conference, GECCO '13, pages 1141--1148, New York, NY, USA, 2013. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. W. Jaśkowski and M. Szubert. Coevolutionary CMA-ES for knowledge-free learning of game position evaluation. IEEE Transactions on Computational Intelligence and AI in Games, 8(4):389--401, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  17. W. Jaśkowski, M. Szubert, and P. Liskowski. Multi-criteria comparison of co-evolution and temporal difference learning on othello. In A. I. Espareia-Alcazar and A. M. Mora, editors, EvoApplications 2014, volume 8602 of Lecture Notes in Computer Science, pages 301--312. Springer, 2014.Google ScholarGoogle Scholar
  18. W. Jaśkowski, M. Szubert, P. Liskowski, and K. Krawiec. High-dimensional function approximation for knowledge-free reinforcement learning: a case study in SZ-Tetris. In GECCO'15: Proceedings of the 17th annual conference on Genetic and Evolutionary Computation, pages 567--574, Mardid, Spain, July 2015. ACM, ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. W. Jaśkowski, B. Wieloch, and K. Krawiec. Fitnessless coevolution. In M. Keijzer, editor, GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation, pages 355--362, Atlanta, GA, USA, Jul 2008. Association for Computing Machinery, Association for Computing Machinery. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Y. Jin, M. Olhofer, and B. Sendhoff. A framework for evolutionary optimization with approximate fitness functions. IEEE Transactions on Evolutionary Computation, 6:481--494, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. Computer, 42(8), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. K. Krawiec and P. Liskowski. Automatic derivation of search objectives for test-based genetic programming. In European Conference on Genetic Programming, pages 53--65. Springer, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  23. D. D. Lee and H. S. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755):788--791, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  24. P. Liskowski and K. Krawiec. Discovery of implicit objectives by compression of interaction matrix in test-based problems. In Parallel Problem Solving from Nature-PPSN XIII, pages 611--620. Springer, 2014.Google ScholarGoogle Scholar
  25. P. Liskowski and K. Krawiec. Non-negative matrix factorization for unsupervised derivation of search objectives in genetic programming. In Proceedings of the 2016 on Genetic and Evolutionary Computation Conference, pages 749--756. ACM, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. P. Liskowski and K. Krawiec. Online Discovery of Search Objectives for Test-based Problems. Evolutionary Computation, mar 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. P. Liskowski and K. Krawiec. Surrogate fitness via factorization of interaction matrix. In European Conference on Genetic Programming, pages 68--82. Springer, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  28. S. M. Lucas. Learning to play Othello with N-tuple systems. Australian Journal of Intelligent Information Processing Systems, Special Issue on Game Technology, 9(4):01--20, 2007.Google ScholarGoogle Scholar
  29. S. Luke and R. P. Wiegand. Guaranteeing coevolutionary objective measures. In K. A. de Jong, R. Poli, and J. E. Rowe, editors, Foundations of Genetic Algorithms VII, pages 237--251, Torremolinos, Spain, 2002. Morgan Kaufman.Google ScholarGoogle Scholar
  30. E. P. Manning. Using Resource-Limited Nash Memory to Improve an Othello Evaluation Function. IEEE Transactions on Computational Intelligence and AI in Games, 2(1):40--53, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  31. L. Panait and S. Luke. A comparison of two competitive fitness functions. In GECCO '02: Proceedings of the Genetic and Evolutionary Computation Conference, pages 503--511, San Francisco, CA, USA, 2002. Morgan Kaufmann Publishers Inc.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. V. P. Pauca, F. Shahnaz, M. W. Berry, and R. J. Plemmons. Text mining using non-negative matrix factorizations. In Proceedings of the 2004 SIAM International Conference on Data Mining, pages 452--456. SIAM, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  33. E. Popovici, A. Bucci, R. P. Wiegand, and E. D. de Jong. Coevolutionary Principles. In G. Rozenberg, T. Bäck, and J. N. Kok, editors, Handbook of Natural Computing, pages 987--1033. Springer, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  34. C. Reynolds. Competition, coevolution and the game of tag. In R. A. Brooks and P. Maes, editors, Artificial Life IV, Proceedings of the fourth International Workshop on the Synthesis and Simulation of Living Systems, pages 59--69, MIT, Cambridge, MA, USA, 1994. MIT Press.Google ScholarGoogle Scholar
  35. T. Runarsson and S. Lucas. Preference Learning for Move Prediction and Evaluation Function Approximation in Othello. Computational Intelligence and AI in Games, IEEE Transactions on, 6(3):300--313, 2014.Google ScholarGoogle Scholar
  36. I. E. Skoulakis and M. G. Lagoudakis. Efficient Reinforcement Learning in Adversarial Games. In 2012 IEEE 24th International Conference on Tools with Artificial Intelligence, pages 704--711. IEEE, Nov. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. M. Szubert and W. Jaśkowski. Temporal difference learning of n-tuple networks for the game 2048. In Proceedings of the IEEE Conference on Computational Intelligence and Games, pages 1--8. IEEE, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  38. M. Szubert, W. Jaśkowski, and K. Krawiec. On scalability, generalization, and hybridization of coevolutionary learning: a case study for othello. IEEE Transactions on Computational Intelligence and AI in Games, 5(3):214--226, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  39. M. Szubert, W Jaśkowski, P. Liskowski, and K. Krawiec. Shaping Fitness Function for Evolutionary Learning of Game Strategies. In Proceedings of the 15th Annual Conference on Genetic and Evolutionary Computation, GECCO '13, pages 1149--1156, New York, NY, USA, 2013. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. M. Thill, P. Koch, and W. Konen. Reinforcement Learning with N-tuples on the Game Connect-4. In Proc. of the 12th International Conference on Parallel Problem Solving from Nature, pages 184--194, Berlin, Heidelberg, 2012. Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. S. van den Dries and M. A. Wiering. Neural-Fitted TD-Leaf Learning for Playing Othello With Structured Neural Networks. IEEE Transactions on Neural Networks and Learning Systems, 23(11):1701--1713, Nov. 2012.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Accelerating coevolution with adaptive matrix factorization

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference
      July 2017
      1427 pages
      ISBN:9781450349208
      DOI:10.1145/3071178

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 July 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      GECCO '17 Paper Acceptance Rate178of462submissions,39%Overall Acceptance Rate1,669of4,410submissions,38%

      Upcoming Conference

      GECCO '24
      Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader