Abstract
This research focuses on vision guided autonomous steering of a wheeled vehicle and tries to implement elementary recognition and learning abilities. While other researchers mainly focussed on using neural networks (e.g. [1], [2], [3], [4]) or explicit modelling of vehicle and environment (e.g. [5],[6],[7]) we established a system which classifies the video information and the vehicle behaviour into patterns and uses a very quick Pattern Matching Algorithm to decide on the required interactions with the environment (i.e. issuance of steering commands ) in order to autonomously steer a vehicle
Within our research, such capabilities of driving by Pattern Matching got successfully implemented but the quality of the driving behaviour is strongly dependant on knowledge on how to react in certain driving situations. Any feedback on the appropriateness of the driving behaviour is delayed and unspecific in relation to single issued steering commands.
Therefore, a further central element in this research is a machine learning algorithm learning by reinforcement based on noisy and delayed rewards. An initial Reinforcement Learning algorithm (e.g. [8], [9]) has been implemented which shows very promising results for creating a system which autonomously steers a car purely based on visual information and even self-improves driving behaviour over time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Pommerleau, D.A.: Efficient Training of Artificial Neural Networks for Autonomous Navigation, Neural Computation 3, 1991
Jochem, T.M., Pomerleau, D.A., Thorpe, C.E.: MANIAC: A Next Generation Neurally Based Autonomous Road Follower, IAS-3, Int. Conference on Intelligent autonomous Systems, February 15–18, 1993, Pittsburgh/PA, USA, F.C.A. Groen, S. Hirose, C.E. Thorpe (eds), IOS Press, Washington, Oxford, Amsterdam, Tokyo, 1993
Jochem, T.M., Pomerleau, D.A., Thorpe, C.E.: Vision Guided Lane Transition, Intelligent Vehicles’ 95 Symposium, September 25–26, 1995, Detroit/MI, USA
Baluja, S., Pomerleau, D.A.: Expectation-based selective attention for visual monitoring and control of a robot vehicle, Robotics and Autonomous System, Vol.22, No.3–4, December, 1997
Dickmanns, E.D., Zapp, A.: Autonomous High Speed Road Vehicle Guidance by Computer Vision, Preprints of the 10th World Congress on Automatic Control, Vol.4, International Federation of Automatic Control, Munich, Germany, July 27–31, 1987
Kuhnert, K.-D.: A Vision System for Real Time Road and Object Recognition for Vehicle Guidance, Proc. Mobile Robots, Oct 30–31, 1986, Cambridge, Massachusetts, Society of Photo-Optical Instrumentation Engineers, SPIE Volume 727
Dickmanns, E.D., Behringer, R., Hildebrandt, T., Maurer, M., Thomanek, F., Schiehlen, J.: The Seeing Passenger Car ‘VaMoRs-P’, Intelligent Vehicles’ 94 Symposium, October 24–26, 1994, Paris, France
Sutton, R., Reinforcement Learning: An introduction, MIT-Press, 1998, Cambridge (USA)
Baird III, L., Reinforcement Learning through Gradient Descent, Dissertation, Carnegie Mellon University, 1999 Pittsburgh, USA
Krödel, M., Kuhnert K.-D.: Towards a Learning Autonomous Driver System, IEEE International Conference on Industrial Electronics, Control and Instrumentation, October 22–28, 2000, Nagoya, Japan
Krödel, M., Kuhnert, K.-D.: Autonomous Driving through Intelligent Image Processing and Machine Learning, Int’l Conference on Computational Intelligence, October 1–3, Dortmund, Germany
Mount, D. M.: ANN Programming Manual, Department of Computer Science and Institute for Advance Computer Studies, University of Maryland, 1998
Langenhangen, J.: Nearly nearest Neighbour search with principal components, Student Thesis, Siegen, 2001
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Krödel, M., Kuhnert, KD. (2002). Reinforcement Learning to Drive a Car by Pattern Matching. In: Van Gool, L. (eds) Pattern Recognition. DAGM 2002. Lecture Notes in Computer Science, vol 2449. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45783-6_39
Download citation
DOI: https://doi.org/10.1007/3-540-45783-6_39
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44209-7
Online ISBN: 978-3-540-45783-1
eBook Packages: Springer Book Archive