Skip to main content

Obstacle avoidance by means of an operant conditioning model

  • Learning
  • Conference paper
  • First Online:
From Natural to Artificial Neural Computation (IWANN 1995)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 930))

Included in the following conference series:

  • 812 Accesses

Abstract

This paper describes the application of a model of operant conditioning to the problem of obstacle avoidance with a wheeled mobile robot. The main characteristic of the applied model is that the robot learns to avoid obstacles through a learning-by-doing cycle without external supervision. A series of ultrasonic sensors act as Conditioned Stimuli (CS), while collisions act as an Unconditioned Stimulus (UCS). By experiencing a series of movements in a cluttered environment, the robot learns to avoid sensor activation patterns that predict collisions, thereby learning to avoid obstacles. Learning generalizes to arbitrary cluttered environments. In this work we describe our initial implementation using a computer simulation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  • Grossberg, S. (1971). On the dynamics of operant conditioning. Journal of Theoretical Biology, 33, 225–255.

    PubMed  Google Scholar 

  • Grossberg, S. (1982). A psychophysiological theory of reinforcement, drive, motivation and attention. Journal of Theoretical Neurobiology, 1, 286–369.

    Google Scholar 

  • Grossberg, S., & Levine, D. (1987). Neural dynamics of attentionally modulated Pavlovian conditioning: blocking, interstimulus interval, and secondary reinforcement. Applied Optics, 26, 5015–5030.

    Google Scholar 

  • Grossberg, S. (1973). Contour enhancement, short-term memory, and constancies in reverberating neural networks. Studies in Applied Mathematics, 52, 217–257.

    Google Scholar 

  • Grossberg, S. (Ed.). (1982). Studies of Mind and Brain: neural principles of learning, perception, development, cognition and motor control. Reidel, Boston.

    Google Scholar 

  • Grossberg, S. (Ed.). (1986). The Adaptive Brain I: Cognition, Learning, Reinforcement, and Rhythm. Elsevier/North-Holland, Amsterdam.

    Google Scholar 

  • Rescorla, R. A., & Wagner, A. R. (1972). A theory of pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In Black, A. H., & Prokasy, W. F. (Eds.), Classical Conditioning II, chap. 3, pp. 64–99. Appleton, New York.

    Google Scholar 

  • Sutton, R. S., & Barto, A. G. (1981). Toward a modem theory of adaptive networks: Expectation and prediction. Psychological Review, 88, 135–170.

    PubMed  Google Scholar 

  • Zalama, E., Gaudiano, P., & López-Coronado, J. (1995). A real-time, unsupervised neural network for the low-level control of a mobile robot in a nonstationary environment. Neural Networks, 8, 103–123.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

José Mira Francisco Sandoval

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zalama, E., Gaudiano, P., Coronado, J.L. (1995). Obstacle avoidance by means of an operant conditioning model. In: Mira, J., Sandoval, F. (eds) From Natural to Artificial Neural Computation. IWANN 1995. Lecture Notes in Computer Science, vol 930. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-59497-3_211

Download citation

  • DOI: https://doi.org/10.1007/3-540-59497-3_211

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-59497-0

  • Online ISBN: 978-3-540-49288-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics