research-article

Evolving an autonomous agent for non-Markovian reinforcement learning

Authors:
Jae-Yoon Jung

Queen's University, Kingston, ON, Canada

Queen's University, Kingston, ON, Canada
View Profile

,
James A. Reggia

University of Maryland, College Park, MD, USA

University of Maryland, College Park, MD, USA
View Profile

GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computationJuly 2009Pages 971–978https://doi.org/10.1145/1569901.1570034

Published:08 July 2009Publication History

GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computation

Pages 971–978

ABSTRACT

In this paper, we investigate the use of nested evolution in which each step of one evolutionary process involves running a second evolutionary process. We apply this approach to build an evolutionary system for reinforcement learning (RL) problems. Genetic programming based on a descriptive encoding is used to evolve the neural architecture, while an evolution strategy is used to evolve the connection weights. We test this method on a non-Markovian RL problem involving an autonomous foraging agent, finding that the evolved networks significantly outperform a rule-based agent serving as a control. We also demonstrate that nested evolution, partitioning into subpopulations, and crossover operations all act synergistically in improving performance in this context.

References

H. G. Beyer. The Theory of Evolution Strategies. Springer, Berlin, 2001. Google ScholarDigital Library
H. G. Beyer and H. P. Schwefel. Evolution strategies - a comprehensive introduction. Natural Computing, 1:3--52, 2002. Google ScholarDigital Library
W. H. Hsu and S. M. Gustafson. Genetic programming and multi-agent layered learning by reinforcements. In Proceedings of the Genetic and Evolutionary Computation Conference, pages 764--771, 2002. Google ScholarDigital Library
C. Igel. Neuroevolution for reinforcement learning using evolution strategies. Congress on Evolutionary Computation (CEC), volume 4, pages 2588--2595, IEEE Press, 2003.Google ScholarCross Ref
T. Jansen and I. Wegener. On the local performance of simulated annealing and the (1+1) evolutionary algorithm. In Proceedings of the 8th Annual Conference on Genetic and Evolutionary Computation, pages 469--476, 2006. Google ScholarDigital Library
J. Y. Jung and J. A. Reggia. Nested evolution of an autonomous agent using descriptive encoding. In Proceedings of the 10th Annual Conference on Genetic and Evolutionary Computation, pages 285--286, 2008. Google ScholarDigital Library
J. Y. Jung and J. A. Reggia. Evolutionary design of neural network architectures using a descriptive encoding language. IEEE Transactions on Evolutionary Computation, 10(6):676--688, Dec. 2006. Google ScholarDigital Library
Y. Kassahun and G. Sommer. Efficient reinforcement learning through evolutionary acquisition of neural topologies. In European Symposium on Artificial Neural Networks (ESANN), pages 259--266, d-side, 2005.Google Scholar
W. N. Martin, J. Lienig, and J. P. Cohoon. Island (migration) models: evolutionary algorithms based on punctuated equilibria. In T. Bäck et al., editors, Handbook of Evolutionary Computation, pages 101--124, Institute of Physics Publishing and Oxford University Press, 1997.Google Scholar
J. H. Metzen, M. Edgington, Y. Kassahun, and F. Kirchner. Analysis of an evolutionary reinforcement learning method in a multiagent domain. In Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems, pages 291--298, 2008. Google ScholarDigital Library
F. Saibene and A. E. Minetti. Biomechanical and physiological aspects of legged locomotion in humans. European Journal of Applied Physiology, 88(4):297--316, 2003.Google ScholarCross Ref
J. A. Reggia, S. Goodall, Y. Shkuro, and M. Glezer. The callosal dilemma: Explaining diaschisis in the context of hemispheric rivalry via a neural network model. Neurological Research, 23:465--471, 2001.Google ScholarCross Ref
J. A. Reggia, R. Schulz, G. Wilkinson, and J. Uriagereka. Conditions enabling the evolution of inter-agent signaling in an artificial world. Artificial Life, 7(1):3--32, 2001. Google ScholarDigital Library
E. Ruppin. Evolutionary autonomous agents: A neuroscience perspective. Nature Reviews Neuroscience, 3(2):132--141, 2002.Google ScholarCross Ref
N. Siebel and G. Sommer. Evolutionary reinforcement learning of artificial neural networks. International Journal of Hybrid Intelligent Systems, 4(3):171--183, 2007. Google ScholarDigital Library
K. O. Stanley and R. Miikkulainen. Evolving neural network through augmenting topologies. Evolutionary Computation, 10(2):99--127, 2002. Google ScholarDigital Library
P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement Learning for RoboCup-Soccer Keepaway. Adaptive Behavior, 13(3):165--188, 2005.Google ScholarCross Ref
R. S. Sutton and A. G. Barto. Reinforcement Learning An Introduction. MIT Press, 1998. Google ScholarDigital Library
I. Szita and A. Lörincz. Learning tetris using the noisy cross-entropy method. Neural Computation 18(12): 2936--2941, 2006. Google ScholarDigital Library
M. E. Taylor, S. Whiteson, and P. Stone. Comparing evolutionary and temporal difference methods in a reinforcement learning domain. In Proceedings of the 8th Annual Conference on Genetic and Evolutionary Computation, pages 1321--1328, 2006. Google ScholarDigital Library
S. Whiteson, M. E. Taylor, and P. Stone. Empirical studies in action selection for reinforcement learning. Adaptive Behavior, 15(1):33--50, 2007. Google ScholarDigital Library
X. Yao. Evolving artificial neural networks. Proceedings of the IEEE, 87(9):1423--1447, Sept. 1999.Google ScholarCross Ref
X. Yao and Y. Liu. Evolutionary artificial neural networks that learn and generalize well. In Proceedings of the 1996 IEEE International Conference on Neural Networks, pages 159--164, 1996.Google Scholar

Index Terms

Evolving an autonomous agent for non-Markovian reinforcement learning
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Nested evolution of an autonomous agent using descriptive encoding
GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation

In this paper, we investigate the use of nested evolution in which each step of one evolutionary process involves running a second evolutionary process. We apply this approach to build a neuroevolution system for reinforcement learning (RL) problems. ...
Read More
Evolving hierarchical memory-prediction machines in multi-task reinforcement learning
Abstract
A fundamental aspect of intelligent agent behaviour is the ability to encode salient features of experience in memory and use these memories, in combination with current sensory information, to predict the best action for each situation such that ...
Read More
Neuroevolution strategies for episodic reinforcement learning

Because of their convincing performance, there is a growing interest in using evolutionary algorithms for reinforcement learning. We propose learning of neural network policies by the covariance matrix adaptation evolution strategy (CMA-ES), a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computation
July 2009
2036 pages
ISBN:9781605583259
DOI:10.1145/1569901
General Chair:
Franz Rothlauf
University of Mainz, Germany
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 July 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
descriptive encoding
evolution strategy
genetic programming
reinforcement learning
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,669of4,410submissions,38%
Upcoming Conference
GECCO '24

Sponsor:

sigevo

Genetic and Evolutionary Computation Conference

July 14 - 18, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 158
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Evolving an autonomous agent for non-Markovian reinforcement learning

GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computation

ABSTRACT

References

Cited By

Index Terms

Recommendations

Nested evolution of an autonomous agent using descriptive encoding

Evolving hierarchical memory-prediction machines in multi-task reinforcement learning

Neuroevolution strategies for episodic reinforcement learning