Dynamics of a Classical Conditioning Model

Balkenius, Christian; Morén, Jan

doi:10.1023/A:1008965713435

Dynamics of a Classical Conditioning Model

Published: 28 October 2004

Volume 7, pages 41–56, (1999)
Cite this article

Autonomous Robots Aims and scope Submit manuscript

Christian Balkenius¹ &
Jan Morén¹

437 Accesses
18 Citations
Explore all metrics

Abstract

Classical conditioning is a basic learning mechanism in animals and can be found in almost all organisms. If we want to construct robots with abilities matching those of their biological counterparts, this is one of the learning mechanisms that needs to be implemented first. This article describes a computational model of classical conditioning where the goal of learning is assumed to be the prediction of a temporally discounted reward or punishment based on the current stimulus situation.

The model is well suited for robotic implementation as it models a number of classical conditioning paradigms and learning in the model is guaranteed to converge with arbitrarily complex stimulus sequences. This is an essential feature once the step is taken beyond the simple laboratory experiment with two or three stimuli to the real world where no such limitations exist. It is also demonstrated how the model can be included in a more complex system that includes various forms of sensory pre-processing and how it can handle reinforcement learning, timing of responses and function as an adaptive world model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

References

Balkenius, C. 1995. Natural intelligence in artificial creatures, Lund University Cognitive Studies 37.
Balkenius, C. 1996. Generalization in instrumental learning. In From Animals to Animats 4: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior, P. Maes, M. Mataric, J.-A. Meyer, J. Pollack, and S.W. Wilson (Eds.), MIT Press/Bradford Books: Cambridge, MA.
Google Scholar
Balkenius, C. 1998. A neural network model of classical conditioning I: The dynamics of learning, Lund University Cognitive Studies 68.
Balkenius, C. and Morén, J. 1998. Computational models of classical conditioning: A comparative study. In From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior, R. Pfeifer, B. Blumberg, J.-A. Meyer, and S.W. Wilson (Eds.), MIT Press/Bradford Books: Cambridge, MA.
Google Scholar
Balleine, B. 1992. Instrumental performance following a shift in primary motivation depends on incentive learning. Journal of Experimental Psychology: Animal Behavior Processes, 18:236-250.
Article Google Scholar
Balleine, B., Garner, C., Gonzalez, F., and Dickinson, A. 1995. Motivational control of heterogeneous instrumental chains. Journal of Experimental Psychology: Animal Behavior Processes, 21:203-217.
Article Google Scholar
Barto, A.G., Sutton, R.S., and Watkins, C.J.C.H. 1990. Learning and sequential decision making. In Learning and Computational Neuroscience: Foundations of Adaptive Networks, M. Gabriel and J. Moore (Eds.), MIT Press: Cambridge, MA, pp. 539-602.
Google Scholar
Bouton, M.E. 1991. Context and retrieval in extinction and in other examples of interference in simple associative learning. In Current Topics in Animal Learning: Brain, Emotion and Cognition, L. Dachowski and C.F. Flaherty (Eds.), Erlbaum: Hillsdale, NJ.
Google Scholar
Desmond, J.E. 1990. Temporally adaptive responses in neural models: The stimulus trace. In Learning and Computational Neuroscience: Foundations of Adaptive Networks, M. Gabriel and J. Moore (Eds.), MIT Press: Cambridge, MA, pp. 421-456.
Google Scholar
Donahoe, J.W. and Palmer, D.C. 1994. Learning and Complex Behavior, Allyn & Bacon: Boston.
Google Scholar
Gaffan, D. 1992. Amygdala and the memory of reward. In The Amygdala: Neurobiological Aspects of Emotion, Memory, and Mental Dysfunction, J.P. Aggleton (Ed.), Wiley-Liss: New York, pp. 471-484.
Google Scholar
Gallistel, C.R. 1990. The Organization of Learning, MIT Press: Cambridge, MA.
Google Scholar
Gray, J.A. 1975. Elements of a Two-Process Theory of Learning, Academic Press: London.
Google Scholar
Grossberg, S. 1974. Classical and Instrumental Learning by Neural Networks. In Progress in Theoretical Biology, Academic Press: New York, Vol. 3.
Google Scholar
Grossberg, S. 1987. The Adaptive Brain, North-Holland: Amsterdam.
Google Scholar
Hassoun, M.H. 1995. Fundamentals of Artificial Neural Networks, MIT Press: Cambridge, MA.
MATH Google Scholar
Holland, P.C. 1992. Occasion setting in Pavlovian conditioning. In The Psychology of Learning and Motivation, D. Medin (Ed.), Academic Press: San Diego, CA, Vol. 28, pp. 69-125.
Google Scholar
Hull, C.L. 1932. The goal-gradient hypothesis and maze learning. Psychological Review, 39(1):25-43.
Article Google Scholar
Kaelbling, L.P., Littman, M.L., and Moore, A.W. 1996. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237-285.
Google Scholar
Kamin, L.J. 1968. Attention-like processes in classical conditioning. In Miami Symposium on the Prediction of Behavior: Aversive Stimulation, M.R. Jones (Ed.), University of Miami Press: Miami, pp. 9-31.
Google Scholar
Kehoe, E.J. 1982. Conditioning with serial compound stimuli: Theoretical and empirical issues. Experimental Animal Behavior, 1:30-65.
Google Scholar
Kehoe, E.J. 1990. Classical conditioning: Fundamental issues for adaptive network models. In Learning and Computational Neuroscience: Foundations of Adaptive Networks, M. Gabriel and J. Moore (Eds.), MIT Press: Cambridge, MA, pp. 390-420.
Google Scholar
Klopf, A.H. 1988. A neuronal model of classical conditioning. Psychobiology, 16(2):85-125.
Google Scholar
Klopf, A.H., Morgan, J.S., and Weaver, S.E. 1993. A hierarchical network of control systems that learn: Modeling nervous system function during classical and instrumental conditioning. Adaptive Behavior, 1(3):263-319.
Article Google Scholar
Machado, A. 1997. Learning the temporal dynamics of behavior. Psychological Review, 104:241-265.
Article Google Scholar
Mackintosh, N.J. 1975. A theory of attention: Variations in the associability of stimuli with reinforcement. Psychological Review, 82:276-298.
Article Google Scholar
Mackintosh, N.J. 1983. Conditioning and Associative Learning, Oxford University Press: Oxford.
Google Scholar
Minsky, M. and Papert, S. 1988. Perceptrons-Expanded Edition, MIT Press: Cambridge, MA.
Google Scholar
Moore, J.W. and Choi, J.-C. 1998. Conditioned stimuli are occasion setters. In Occasion Setting: Associative Learning and Cognition in Animals, N.A. Schmajuk and P.C. Holland (Eds.), American Psychological Association: Washington, D.C.
Google Scholar
Mowrer, O.H. 1973. Learning Theory and Behavior, Wiley: New York.
Google Scholar
Pavlov, I.P. 1927. Conditioned Reflexes, Oxford University Press: Oxford.
Google Scholar
Rescorla, R.A. and Wagner, A.R. 1972. A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement. In Classical Conditioning II: Current Research and Theory, A.H. Black and W.F. Prokasy (Eds.), Appleton-Century-Crofts: New York, pp. 64-99.
Google Scholar
Rescorla, R.A. 1985. Inhibition and facilitation. In Information Processing in Animals: Conditioned Inhibition, R.R. Miller and N.E. Spear (Eds.), Erlbaum: Hillsdale, NJ, pp. 299-326.
Google Scholar
Rosenblatt, F. 1959. Principles of Neurodynamics, Spartan Books: New York.
Google Scholar
Schmajuk, N.A. 1990. Role of the hippocampus in temporal and spatial navigation: An adaptive neural network. Behavioral Brain Research, 39:205-229.
Article Google Scholar
Schmajuk, N.A. and DiCarlo, J.J. 1992. Stimulus configuration, classical conditioning, and hippocampal function. Psychological Review, 99:268-305.
Article Google Scholar
Schmajuk, N.A. and Thieme, A.D. 1992. Purposive behavior and cognitive mapping: A neural network model. Biological Cybernetics, 67:165-174.
Article MATH Google Scholar
Schmajuk, N.A., 1997. Animal Learning and Cognition: A Neural Network Approach, Cambridge University Press.
Schmajuk, N.A. and Holland, P.C. (Eds.). 1998. Occasion Setting: Associative Learning and Cognition in Animals, American Psychological Association: Washington, DC.
Google Scholar
Schneiderman, N. 1966. Interstimulus interval function of the nictitating membrane response of the rabbit under delay versus trace conditioning. Journal of Comparative and Physiological Psychology, 62:397-402.
Article Google Scholar
Schneiderman, N., Fuentes, I., and Gormezano, I. 1962. Acquisition and extinction of the classically conditioned eyelid response in the albino rabbit. Science, 136:650-652.
Article Google Scholar
Schneiderman, N. and Gormezano, I. 1964. Conditioning of the nictitating membrane of the rabbit as a function as the CS-US interval. Journal of Comparative and Physiological Psychology, 57:188-195.
Article Google Scholar
Smith, M.C., Coleman, S.R., and Gormezano, I. 1969. Classical conditioning of the rabbits nictitating membrane response at backward, simultaneous, and forward CS-US interval. Journal of Comparative Physiological Psychology, 69:226-231.
Article Google Scholar
Staddon, J.E.R and Higa, J.J. 1996. Multiple time scales in simple habituation. Psychological Review, 103:720-733.
Article Google Scholar
Sutton, R.S. and Barto, A.G. 1990. Time-derivative models of Pavlovian reinforcement. In Learning and Computational Neuroscience: Foundations of Adaptive Networks, M. Gabriel and J. Moore (Eds.), MIT Press: Cambridge, MA, pp. 497-538.
Google Scholar
Sutton, R.S. and Barto, A.G. 1998. Reinforcement Learning: An Introduction, MIT Press: Cambridge, MA.
Google Scholar
Watkins, C.J.C.H. 1992. Q-learning. Machine Learning, 8:279-292.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Lund University Cognitive Science, Lund, Sweden
Christian Balkenius & Jan Morén

Authors

Christian Balkenius
View author publications
You can also search for this author in PubMed Google Scholar
Jan Morén
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Balkenius, C., Morén, J. Dynamics of a Classical Conditioning Model. Autonomous Robots 7, 41–56 (1999). https://doi.org/10.1023/A:1008965713435

Download citation

Published: 28 October 2004
Issue Date: July 1999
DOI: https://doi.org/10.1023/A:1008965713435

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dynamics of a Classical Conditioning Model

Abstract

Access this article

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation