Evolving Neural Networks for Online Reinforcement Learning

Metzen, Jan Hendrik; Edgington, Mark; Kassahun, Yohannes; Kirchner, Frank

doi:10.1007/978-3-540-87700-4_52

Jan Hendrik Metzen¹⁹,
Mark Edgington²⁰,
Yohannes Kassahun²⁰ &
…
Frank Kirchner^19,20

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5199))

Included in the following conference series:

International Conference on Parallel Problem Solving from Nature

3529 Accesses
2 Citations

Abstract

For many complex Reinforcement Learning problems with large and continuous state spaces, neuroevolution (the evolution of artificial neural networks) has achieved promising results. This is especially true when there is noise in sensor and/or actuator signals. These results have mainly been obtained in offline learning settings, where the training and evaluation phase of the system are separated. In contrast, in online Reinforcement Learning tasks where the actual performance of the systems during its learning phase matters, the results of neuroevolution are significantly impaired by its purely exploratory nature, meaning that it does not use (i.e. exploit) its knowledge of the performance of single individuals in order to improve its performance during learning. In this paper we describe modifications which significantly improve the online performance of the neuroevolutionary method Evolutionary Acquisition of Neural Topologies (EANT) and discuss the results obtained on two benchmark problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Beyer, H.-G., Schwefel, H.-P.: Evolution strategies - a comprehensive introduction. Natural Computing: an International Journal 1(1), 3–52 (2002)
Article MATH MathSciNet Google Scholar
Gomez, F.J., Schmidhuber, J., Miikkulainen, R.: Efficient non-linear control through neuroevolution. In: Proceedings of the 17th European Conference on Machine Learning (ECML), Berlin, Germany, September 2006, pp. 654–662 (2006)
Google Scholar
Kassahun, Y.: Towards a Unified Approach to Learning and Adaptation. PhD thesis, Institute of Computer Science and Applied Mathematics, Christian-Albrechts University, Kiel, Germany (February 2006)
Google Scholar
Kassahun, Y., Edgington, M., Metzen, J.H., Sommer, G., Kirchner, F.: A common genetic encoding for both direct and indirect encodings of networks. In: Proceedings of the 9th Annual Conference on Genetic and Evolutionary Computation (GECCO 2007), pp. 1029–1036 (2007)
Google Scholar
Metzen, J.H., Edgington, M., Kassahun, Y., Kirchner, F.: Analysis of an evolutionary reinforcement learning method in a multiagent domain. In: Proceedings of the Seventh International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS 2008), Estoril, Portugal, pp. 291–298 (May 2008)
Google Scholar
Stagge, P.: Averaging efficiently in the presence of noise. In: Eiben, A.E., Bäck, T., Schoenauer, M., Schwefel, H.-P. (eds.) PPSN 1998. LNCS, vol. 1498, pp. 188–200. Springer, Heidelberg (1998)
Chapter Google Scholar
Stanley, K.O.: Efficient Evolution of Neural Networks through Complexification. PhD thesis, Artificial Intelligence Laboratory. The University of Texas at Austin., Austin, USA (August 2004)
Google Scholar
Stanley, K.O., Bryant, B.D., Miikkulainen, R.: Real-time neuroevolution in the nero video game. IEEE Trans. Evolutionary Computation 9(6), 653–668 (2005)
Article Google Scholar
Stone, P., Kuhlmann, G., Taylor, M.E., Liu, Y.: Keepaway soccer: From machine learning testbed to benchmark. In: Bredenfeld, A., Jacoff, A., Noda, I., Takahashi, Y. (eds.) RoboCup 2005. LNCS (LNAI), vol. 4020, pp. 93–105. Springer, Heidelberg (2006)
Chapter Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning. An Introduction. MIT Press, Massachusetts (1998)
Google Scholar
Sutton, R.S., McAllester, D.A., Singh, S.P., Mansour, Y.: Policy gradient methods for reinforcement learning with function approximation. In: Advances in Neural Information Processing Systems 12, pp. 1057–1063 (1999)
Google Scholar
Taylor, M.E., Whiteson, S., Stone, P.: Comparing evolutionary and temporal difference methods in a reinforcement learning domain. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2006), pp. 1321–1328 (2006)
Google Scholar
Whiteson, S., Stone, P.: Evolutionary function approximation for reinforcement learning. Journal of Machine Learning Research 7, 877–917 (2006)
MATH MathSciNet Google Scholar
Whiteson, S., Taylor, M.E., Stone, P.: Empirical studies in action selection with reinforcement learning. Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems 15(1), 33–50 (2007)
Google Scholar
Yao, X.: Evolving artificial neural networks. Proceedings of the IEEE 87(9), 1423–1447 (1999)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Robotics Lab, German Research Center for Artificial Intelligence (DFKI), Robert-Hooke-Str. 5, D-28359, Bremen, Germany
Jan Hendrik Metzen & Frank Kirchner
Robotics Group, University of Bremen, Robert-Hooke-Str. 5, D-28359, Bremen, Germany
Mark Edgington, Yohannes Kassahun & Frank Kirchner

Authors

Jan Hendrik Metzen
View author publications
You can also search for this author in PubMed Google Scholar
Mark Edgington
View author publications
You can also search for this author in PubMed Google Scholar
Yohannes Kassahun
View author publications
You can also search for this author in PubMed Google Scholar
Frank Kirchner
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fakultät für Informatik, Technische Universität Dortmund, 44221, Dortmund, Germany
Günter Rudolph
Fakultät für Informatik, Technische Universität Dortmund, 44221, Dortmund, Germany
Thomas Jansen & Nicola Beume &
Department of Computing and Electronic Systems, University of Essex, CO4 3SQ, Colchester, Essex, UK
Simon Lucas
Dipartimento di Ingegneria Meccanica, Università degli Studi di Trieste, 34127, Trieste, Italy
Carlo Poloni

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Metzen, J.H., Edgington, M., Kassahun, Y., Kirchner, F. (2008). Evolving Neural Networks for Online Reinforcement Learning. In: Rudolph, G., Jansen, T., Beume, N., Lucas, S., Poloni, C. (eds) Parallel Problem Solving from Nature – PPSN X. PPSN 2008. Lecture Notes in Computer Science, vol 5199. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87700-4_52

Download citation

DOI: https://doi.org/10.1007/978-3-540-87700-4_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87699-1
Online ISBN: 978-3-540-87700-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics