Autonomous Shaping via Coevolutionary Selection of Training Experience

Szubert, Marcin; Krawiec, Krzysztof

doi:10.1007/978-3-642-32964-7_22

Marcin Szubert²¹ &
Krzysztof Krawiec²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7492))

Included in the following conference series:

International Conference on Parallel Problem Solving from Nature

1767 Accesses

Abstract

To acquire expert skills in a sequential decision making domain that is too vast to be explored thoroughly, an intelligent agent has to be capable of inducing crucial knowledge from the most representative parts of it. One way to shape the learning process and guide the learner in the right direction is effective selection of such parts that provide the best training experience. To realize this concept, we propose a shaping method that orchestrates the training by iteratively exposing the learner to subproblems generated autonomously from the original problem. The main novelty of the proposed approach consists in equalling the learning process with the search in subproblem space and in employing a coevolutionary algorithm to perform this search. Each individual in the population encodes a sequence of subproblems that is evaluated by confronting the learner trained on it with other learners shaped in this way by particular individuals. When applied to the game of Othello, temporal difference learning on the best found subproblem sequence yields substantially better players than learning on the entire problem at once.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Evolutionary Computation and the Reinforcement Learning Problem

Learning to alternate

Article 12 April 2018

EvoParsons: design, implementation and preliminary evaluation of evolutionary Parsons puzzle

Article 05 February 2019

References

Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. The MIT Press (1998)
Google Scholar
Skinner, B.: The behavior of organisms: An experimental analysis. Appleton-Century (1938)
Google Scholar
Randløv, J., Alstrøm, P.: Learning to drive a bicycle using reinforcement learning and shaping. In: Proceedings of the Fifteenth International Conference on Machine Learning, pp. 463–471. Morgan Kaufmann, San Francisco (1998)
Google Scholar
Popovici, E., Bucci, A., Wiegand, R.P., de Jong, E.D.: Coevolutionary principles. In: Handbook of Natural Computing. Springer, Berlin (2010)
Google Scholar
Jaśkowski, W., Krawiec, K.: Formal analysis, hardness and algorithms for extracting internal structure of test-based problems. Evolutionary Computation 19(4), 639–671 (2011)
Article Google Scholar
Szubert, M.: cECJ — Coevolutionary Computation in Java (2010), http://www.cs.put.poznan.pl/mszubert/projects/cecj.html
Sutton, R.S.: Learning to predict by the methods of temporal differences. Machine Learning 3, 9–44 (1988)
Google Scholar
Szubert, M., Jaśkowski, W., Krawiec, K.: Coevolutionary Temporal Difference Learning for Othello. In: 2009 IEEE Symposium on Computational Intelligence and Games, pp. 104–111 (2009)
Google Scholar
Mihalkova, L., Mooney, R.: Using active relocation to aid reinforcement learning. In: Proceedings of the 19th International FLAIRS Conference, pp. 580–585 (2006)
Google Scholar
Cohn, D., Atlas, L., Ladner, R.: Improving generalization with active learning. Machine Learning 15(2), 201–221 (1994)
Google Scholar
Rachelson, E., Schnitzler, F., Wehenkel, L., Ernst, D.: Optimal sample selection for batch-mode reinforcement learning. In: Proceedings of the 3rd International Conference on Agents and Artificial Intelligence, ICAART 2011 (2011)
Google Scholar
Torrey, L., Shavlik, J.: Transfer Learning. In: Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques, pp. 242–264. IGI Global (2009)
Google Scholar
Konidaris, G., Barto, A.: Autonomous shaping: Knowledge transfer in reinforcement learning. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 489–496. ACM (2006)
Google Scholar
Epstein, S.: Toward an ideal trainer. Machine Learning 15(3), 251–277 (1994)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computing Science, Poznan University of Technology, Poznań, Poland
Marcin Szubert & Krzysztof Krawiec

Authors

Marcin Szubert
View author publications
You can also search for this author in PubMed Google Scholar
Krzysztof Krawiec
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departmento de Computation, Centro de Investigacion y de Estudios, Avanzados del Instituto Politecnico Nacional (CINVESTAV-IPN), Av. IPN No. 2508, Col. San Pedro Zacatenco, 0360, Mexico, D.F., Mexico
Carlos A. Coello Coello
Department of Mathematics and Computer Science, University of Catania, V.le A. Doria 6, 95125, Catania, Italy
Vincenzo Cutello & Mario Pavone &
Kanpur Genetic Algorithms Laboratory (KanGAL), Indian Institute of Technology, Kanpur, Kanpur, India
Kalyanmoy Deb
Department of Computer Science, University of New Mexico, USA
Stephanie Forrest
Department of Mathematics and Computer Science, University of Catania, Viale A. Doria 6, 95125, Catania, Italy
Giuseppe Nicosia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Szubert, M., Krawiec, K. (2012). Autonomous Shaping via Coevolutionary Selection of Training Experience. In: Coello, C.A.C., Cutello, V., Deb, K., Forrest, S., Nicosia, G., Pavone, M. (eds) Parallel Problem Solving from Nature - PPSN XII. PPSN 2012. Lecture Notes in Computer Science, vol 7492. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32964-7_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-32964-7_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32963-0
Online ISBN: 978-3-642-32964-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Autonomous Shaping via Coevolutionary Selection of Training Experience

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Evolutionary Computation and the Reinforcement Learning Problem

Learning to alternate

EvoParsons: design, implementation and preliminary evaluation of evolutionary Parsons puzzle

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Autonomous Shaping via Coevolutionary Selection of Training Experience

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Evolutionary Computation and the Reinforcement Learning Problem

Learning to alternate

EvoParsons: design, implementation and preliminary evaluation of evolutionary Parsons puzzle

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation