Abstract
This paper describes the application of a model of operant conditioning to the problem of obstacle avoidance with a wheeled mobile robot. The main characteristic of the applied model is that the robot learns to avoid obstacles through a learning-by-doing cycle without external supervision. A series of ultrasonic sensors act as Conditioned Stimuli (CS), while collisions act as an Unconditioned Stimulus (UCS). By experiencing a series of movements in a cluttered environment, the robot learns to avoid sensor activation patterns that predict collisions, thereby learning to avoid obstacles. Learning generalizes to arbitrary cluttered environments. In this work we describe our initial implementation using a computer simulation.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Grossberg, S. (1971). On the dynamics of operant conditioning. Journal of Theoretical Biology, 33, 225–255.
Grossberg, S. (1982). A psychophysiological theory of reinforcement, drive, motivation and attention. Journal of Theoretical Neurobiology, 1, 286–369.
Grossberg, S., & Levine, D. (1987). Neural dynamics of attentionally modulated Pavlovian conditioning: blocking, interstimulus interval, and secondary reinforcement. Applied Optics, 26, 5015–5030.
Grossberg, S. (1973). Contour enhancement, short-term memory, and constancies in reverberating neural networks. Studies in Applied Mathematics, 52, 217–257.
Grossberg, S. (Ed.). (1982). Studies of Mind and Brain: neural principles of learning, perception, development, cognition and motor control. Reidel, Boston.
Grossberg, S. (Ed.). (1986). The Adaptive Brain I: Cognition, Learning, Reinforcement, and Rhythm. Elsevier/North-Holland, Amsterdam.
Rescorla, R. A., & Wagner, A. R. (1972). A theory of pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In Black, A. H., & Prokasy, W. F. (Eds.), Classical Conditioning II, chap. 3, pp. 64–99. Appleton, New York.
Sutton, R. S., & Barto, A. G. (1981). Toward a modem theory of adaptive networks: Expectation and prediction. Psychological Review, 88, 135–170.
Zalama, E., Gaudiano, P., & López-Coronado, J. (1995). A real-time, unsupervised neural network for the low-level control of a mobile robot in a nonstationary environment. Neural Networks, 8, 103–123.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zalama, E., Gaudiano, P., Coronado, J.L. (1995). Obstacle avoidance by means of an operant conditioning model. In: Mira, J., Sandoval, F. (eds) From Natural to Artificial Neural Computation. IWANN 1995. Lecture Notes in Computer Science, vol 930. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-59497-3_211
Download citation
DOI: https://doi.org/10.1007/3-540-59497-3_211
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-59497-0
Online ISBN: 978-3-540-49288-7
eBook Packages: Springer Book Archive