Feed-Forward Learning: Fast Reinforcement Learning of Controllers

Musial, Marek; Lemke, Frank

doi:10.1007/978-3-540-73055-2_30

Marek Musial¹ &
Frank Lemke¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4528))

Included in the following conference series:

International Work-Conference on the Interplay Between Natural and Artificial Computation

1140 Accesses
1 Citations

Abstract

Reinforcement Learning (RL) approaches are, very often, rendered useless by the statistics of the required sampling process. This paper shows how very fast RL is essentially made possible by abandoning the state feedback during training episodes. The resulting new method, feed-forward learning (FF learning), employs a return estimator for pairs of a state and a feed-forward policy’s parameter vector. FF learning is particularly suitable for the learning of controllers, e.g. for robotics applications, and yields learning rates unprecedented in the RL context.

This paper introduces the method formally and proves a lower bound on its performance. Practical results are provided from applying FF learning to several scenarios based on the collision avoidance behavior of a mobile robot.

This work has been conducted within the NeuRoBot project, funded by German Research Foundation (DFG).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Sutton, R.S.: Learning to predict by the methods of temporal differences. Machine Learning 3, 9–44 (1988)
Google Scholar
Watkins, C.J.C.H.: Learning from Delayed Rewards. PhD thesis, King’s College, University of Cambridge, Cambridge, UK (1989)
Google Scholar
Bertsekas, D.P.: Dynamic Programming and Optimal Control, 2nd edn., vol. 2. Athena Scientific, Belmont (2001)
MATH Google Scholar
Boyan, J.A.: Least-squares temporal difference learning. In: Proc. 16th International Conf. on Machine Learning, pp. 49–56. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Nissen, S., Nemerson, E.: Fast Artificial Neural Network Library, Version 1.2.0 Reference Manual (2004), http://fann.sourceforge.net
Hertz, J., Krogh, A., Palmer, R.G.: An Introduction to the Theory of Neural Computation. Lecture Notes, vol. 1. Addison-Wesley, Reading (1991)
Google Scholar

Download references

Author information

Authors and Affiliations

Real-Time Systems and Robotics (PDV), Technische Universität Berlin,
Marek Musial & Frank Lemke

Authors

Marek Musial
View author publications
You can also search for this author in PubMed Google Scholar
Frank Lemke
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

José Mira José R. Álvarez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Musial, M., Lemke, F. (2007). Feed-Forward Learning: Fast Reinforcement Learning of Controllers. In: Mira, J., Álvarez, J.R. (eds) Nature Inspired Problem-Solving Methods in Knowledge Engineering. IWINAC 2007. Lecture Notes in Computer Science, vol 4528. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73055-2_30

Download citation

DOI: https://doi.org/10.1007/978-3-540-73055-2_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73054-5
Online ISBN: 978-3-540-73055-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics