A Hybrid Architecture for Situated Learning of Reactive Sequential Decision Making

Sun, Ron; Peterson, Todd; Merrill, Edward

doi:10.1023/A:1008332731824

A Hybrid Architecture for Situated Learning of Reactive Sequential Decision Making

Published: July 1999

Volume 11, pages 109–127, (1999)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Ron Sun¹,
Todd Peterson² &
Edward Merrill²

149 Accesses
18 Citations
3 Altmetric
Explore all metrics

Abstract

In developing autonomous agents, one usually emphasizes only (situated) procedural knowledge, ignoring more explicit declarative knowledge. On the other hand, in developing symbolic reasoning models, one usually emphasizes only declarative knowledge, ignoring procedural knowledge. In contrast, we have developed a learning model CLARION, which is a hybrid connectionist model consisting of both localist and distributed representations, based on the two-level approach proposed in [40]. CLARION learns and utilizes both procedural and declarative knowledge, tapping into the synergy of the two types of processes, and enables an agent to learn in situated contexts and generalize resulting knowledge to different scenarios. It unifies connectionist, reinforcement, and symbolic learning in a synergistic way, to perform on-line, bottom-up learning. This summary paper presents one version of the architecture and some results of the experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Integrating Symbolic and Sub-symbolic Reasoning

Intelligent problem-solving as integrated hierarchical reinforcement learning

Article 25 January 2022

Anchoring Knowledge in Interaction: Towards a Harmonic Subsymbolic/Symbolic Framework and Architecture of Computational Cognition

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Sutton and Barto (1981).
A. Schultz, “Using a genetic algorithm to learn strategies for collisionavoidance and local navigation,” in Proc. of 7th International Symp. on Unmanned Untethered Submersible Technology, University of New Hampshire, Durham, pp. 213-225, 1991.
Google Scholar
R. Sun and T. Peterson, “A hybrid learning model of reactive sequential decision making,” in The Working Notes of The IJCAI Workshop on Connectionist-Symbolic Integration, edited by R. Sun and F. Alexandre, 1995.
P. Rosenbloom, J. Laird, and A. Newell, The SOAR Papers: Research on Integrated Intelligence, MIT Press: Cambridge, MA, 1993.
Google Scholar
R. Nosofsky, T. Palmeri, and S. McKinley, “Rule-plus-exception model of classification learning,” Psychological Review, vol. 101,no. 1, pp. 53-79, 1994.
Google Scholar
G. Widmer and M. Kubat, “Learning in the presence of concept drift and hidden context,” Machine Learning, vol. 23,no. 1, 1996.
R. Sutton, “Learning to predict by the methods of temporal difference,” Machine Learning, vol. 3, pp. 9-44, 1988.
Google Scholar
R. Sutton, “Integrated architectures for learning, planning, and reacting based on approximating dynamic programming,” in Proc. of Seventh International Conference on Machine Learning, Morgan Kaufmann: San Mateo, CA, 1990.
Google Scholar
S. Mahadevan and J. Connell, “Automatic programming of behavior-based robot with reinforcement learning,” vol. 55, pp. 311-365, 1992.
Google Scholar
L. Lin, “Self-improving reactive agents based on reinforcement learning, planning, and teaching,” Machine Learning, vol. 8, pp. 293-321, 1992.
Google Scholar
L. Kaelbling, M. Littman, and A. Moore, “Reinforcement learning: A survey,” Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
Google Scholar
T. Dietterich, Hierarchical reinforcement learning with MAXQ value function decomposition, 1997, ftp://www.cs.orst.edu.
J. Holland, N. Nisbitt, P. Thagard, and J. Holyoak, Induction: A Theory of Learning and Development, MIT Press: Cambridge, MA, 1986.
Google Scholar
J. Grefenstette, “The evolution of strategies for multiagent environments,” Adaptive Behavior, vol. 1,no. 1, pp. 65-90, 1992.
Google Scholar
L. Meeden, “An incremental approach to developing intelligent neural network controllers for robots,” Adaptive Behavior, 1995.
R. Michalski, “A theory and methodology of inductive learning,” Artificial Intelligence, vol. 20, pp. 111-161, 1983.
Google Scholar
R. Quinlan, “Inductive learning of decision trees,” Machine Learning, vol. 1, pp. 81-106, 1986.
Google Scholar
R. Quinlan, “Learning logical definition from relations,” Machine Learning, vol. 5, pp. 239-266, 1990.
Google Scholar
T. Mitchell, “Generalization as search,” Artificial Intelligence, vol. 18, pp. 203-226, 1982.
Google Scholar
M. Lebowitz, “Experiments with incremental concept formation: UNIMEM,” Machine Learning, vol. 2, pp. 103-138, 1987.
Google Scholar
D. Fisher, “Knowledge acquisition via incremental conceptual clustering,” Machine Learning, vol. 2, pp. 139-172, 1987.
Google Scholar
P. Utgoff, “Incremental induction of decision trees,” Machine Learning, vol. 4, pp. 161-186, 1989.
Google Scholar
H. Hirsh, “Generalizing version spaces,” Machine Learning, vol. 17, pp. 5-46, 1994.
Google Scholar
P. Clark and T. Niblett, “The CN2 induction algorithm,” Machine Learning, vol. 3, pp. 261-284, 1989.
Google Scholar
R. Sun and T. Peterson, “Some experiments with a hybrid model for learning sequential decision making,” Information Sciences, vol. 14, pp. 83-107, 1998.
Google Scholar
R. Sun and T. Peterson, “Autonomous learning of sequential tasks: Experiments and analyses,” IEEE Transaction on Neural Networks, vol. 9,no. 6, pp. 1217-1234, 1998.
Google Scholar
J. Anderson, “Acquisition of cognitive skill,” Psychological Review, vol. 89, pp. 369-406, 1982.
Google Scholar
J. Anderson, Rules of the Mind, Lawrence Erlbaum Associates: Hillsdale, NJ, 1993.
Google Scholar
J. Anderson and C. Lebiere, The Atomic Components of Thought, Lawrence Erlbaum Associates: Mahwah, NJ, 1998.
Google Scholar
F. Keil, Concepts, Kinds, and Cognitive Development, MIT Press: Cambridge, MA, 1989.
Google Scholar
A. Damasio et al., “Neural regionalization of knowledge access,” Cold Spring Harbor Symp. on Quantitative Biology, CSHL Press, vol. LV, 1990.
R. Sun, Integrating Rules and Connectionism for Robust Commonsense Reasoning, John Wiley and Sons: New York, NY, 1994.
Google Scholar
J. Anderson, The Architecture of Cognition, Harvard University Press: Cambridge, MA, 1983.
Google Scholar
P. Fitts and M. Posner, Human Performance, Brooks/Cole, Monterey, CA, 1967.
Google Scholar
H. Dreyfus and S. Dreyfus, Mind Over Machine, The Free Press: New York, NY, 1987.
Google Scholar
P. Smolensky, “On the proper treatment of connectionism,” Behavioral and Brain Sciences, vol. 11,no. 1, pp. 1-74, 1988.
Google Scholar
W. James, The Principles of Psychology, Dover: New York, 1890.
Google Scholar
R. Sun, “Learning, action, and consciousness: A hybrid approach towards modeling consciousness,” Neural Networks, special issue on consciousness, vol. 10,no. 7, pp. 1317-1331, 1997.
Google Scholar
D. Willingham, M. Nissen, and P. Bullemer, “On the development of procedural knowledge,” Journal of Experimental Psychology: Learning, Memory, and Cognition, vol. 15, pp. 1047-1060, 1989.
Google Scholar
R. Sun, “Robust reasoning: Integrating rule-based and similarity-based reasoning,” Artificial Intelligence, vol. 75,no. 2, pp. 241-296, 1995.
Google Scholar
J. Hendler, “Marker passing and microfeature,” in Proc. 10th IJCAI, Morgan Kaufmann: San Mateo, CA, 1987, pp. 151-154.
Google Scholar
J. Gelfand, D. Handelman, and S. Lane, “Integrating knowledge-based systems and neural networks for robotic skill acquisition,” in Proc. IJCAI, Morgan kaufmann: San Mateo, CA, 1989, pp. 193-198.
Google Scholar
W. Schneider and W. Oliver, “An instructable connectionist/control architecture,” in Architectures for Intelligence, edited by K. VanLehn, Erlbaum: Hillsdale, NJ, 1991.
Google Scholar
R. Sun, “A Connectionist model for commonsense reasoning incorporating rules and similarities,” Knowledge Acquisition, vol. 4, pp. 293-321, 1992.
Google Scholar
M. Erickson and J. Kruschke, Rules and Examplars in Category Learning, manuscript, 1997.
A. Reber, “Implicit learning and tacit knowledge,” Journal of Experimental Psychology: General, vol. 118,no. 3, pp. 219-235, 1989.
Google Scholar
R. Dominowski, How do People Discover Concepts?, manuscript, 1975.
D. Medin, W. Wattenmaker, and R. Michalski, “Constraints and preferences in inductive learning: An experimental study of human and machine performance,” Cognitive Science, vol. 11, pp. 299-339, 1987.
Google Scholar
C. Watkins, “Learning with Delayed Rewards,” Ph.D. Thesis, Cambridge University, Cambridge, UK, 1989.
Google Scholar
D. Bertsekas and J. Tsitsiklis, Neuro-Dynamic Programming, Athena Scientific: Belmont, MA, 1996.
Google Scholar
R. Parr and S. Russell, “Reinforcement learning with hierarchies of machines,” Advances in Neural Information Processing Systems, MIT Press: Cambridge, MA, 1997.
Google Scholar
D. Precup, R. Sutton, and S. Singh, “Multi-time models for temporary abstract planning,” Advances in Neural Information Processing Systems 10, MIT Press: Cambridge, MA, 1998.
Google Scholar
T. Tesauro, “Practical issues in temporal difference learning,” Machine Learning, vol. 8, pp. 257-277, 1992.
Google Scholar
J. Boyan and A. Moore, “Generalization in reinforcement learning: Safely approximating the value function,” in Neural Information Processing Systems, edited by J. Tesauro and D. Touretzky, and T. Leen, MIT Press: Cambridge, MA, pp. 369-376, 1995.
Google Scholar
R. Sun, “On variable binding in connectionist networks,” Connection Science, vol. 4,no. 2, pp. 93-124, 1992.
Google Scholar
L.M. Fu, “Rule learning by searching on adapted nets,” in Proc. of AAAI'91, 1991, pp. 590-595.
R.C. Lacher, “Expert networks: Paradigmatic conflict, technological rapprochment,” Minds and Machines, vol 3, pp. 53-71, 1993.
Google Scholar
G. Towell and J. Shavlik, “Extracting refined rules from knowledge-based neural networks,” Machine Learning, vol. 13,no. 1, pp. 71-101, 1993.
Google Scholar
R. Michalski, I. Mozetic, J. Hong, and N. Lavrac, “The multipurpose incremental learning system AQ15,” in Proc. of AAAI-86, Morgan Kaufmann: San Mateo, CA, 1986, pp. 1041-1045.
Google Scholar
A. McCallum, “Learning to use selective attention and short-term memory in sequential tasks,” in Proc. Conference on Simulation of Adaptive Behavior, MIT Press: Cambridge, MA, 1996, pp. 315-324.
Google Scholar
L. Breiman, “Bagging predictors,” Machine Learning, vol. 24,no. 2, pp. 123-140, 1996.
Google Scholar
Y. Freund and R. Schapire, “Experiments with a new boosting algorithm,” in Proc. of ICML'97, Morgan Kaufmann, San Francisco, CA, 1996, pp. 148-156.
Google Scholar
R. Brooks, “Intelligence without representation,” Artificial Intelligence, vol. 47, pp. 139-160, 1991.
Google Scholar
P. Tadepalli and T. Dietterich, “Hierarchical explanation-based reinforcement learning,” in Proc. International Conference on Machine Learning, Morgan Kaufmann: San Francisco, CA, 1997, pp. 358-366.
Google Scholar
D. Gordon, A. Schultz, J. Grefenstette, J. Ballas, and M. Perez, User's Guide to the Navigation and Collision Avoidance Task, Naval Research Lab, Washington, DC, 1994.
Google Scholar
R. Sun, E. Merrill, and T. Peterson, “A bottom-up model of skill learning,” in Proc. of 20th Cognitive Science Society Conference, Lawrence Erlbaum Associates: Mahwah, NJ, pp. 1037-1042, 1998.
Google Scholar
R. Sun, E. Merrill, and T. Peterson, “Skill learning using a bottom-up hybrid model,” in Proc. of The Second European Conference on Cognitive Modeling, Nottingham University Press: Nottingham, UK, pp. 23-29, April 1998.
Google Scholar
E. Smith and D. Medin, Categories and Concepts, Cambridge, MA: Harvard University Press, 1981.
Google Scholar
E. Rosch, “Principles of categorization,” in Cognition and Categorization, edited by E. Rosch and B. Lloyd, Erlbaum: Hillsdale, NJ, 1978.
Google Scholar
C. Giraud-Carrier and T. Martinez, “An integrated framework for learning and reasoning,” Journal of Artificial Intelligence Research, vol. 3, pp. 147-185, 1995.
Google Scholar
D. Waltz, “How to build a robot,” in Proc. of Conf. on Simulation of Adaptive Behaviors, edited by S. Wilson, MIT Press: Cambridge, MA, 1991.
Google Scholar
R. Sun, E. Merrill, and T. Peterson, “A bottom-up model of skill learning,” in Proc. of 20th Cognitive Science Society Conference, Lawrence Erlbaum Associates: Mahwah, NJ, pp. 1037-1042, 1998.
Google Scholar
M. Lebowitz, “Experiments with incremental concept formation: UNIMEM,” Machine Learning, vol. 2, pp. 103-138, 1987.
Google Scholar
D. Fisher, “Knowledge acquisition via incremental conceptual clustering,” Machine Learning, vol. 2, pp. 139-172, 1987.
Google Scholar
R. Stepp and R. Michalski, “Conceptual clustering,” in Machine Learning, II, edited by R. Michalski et al., Morgan Kaufmann: Los Altos, CA Stepp and Michalski (1983).
Google Scholar
W. Shen, “Discovery as autonomous learning from the environment,” Machine Learning, vol. 12, pp. 143-165, 1993.
Google Scholar
E. Wisniewski and D. Medin, “On the interaction of data and theory in concept learning,” Cognitive Science, vol. 18, pp. 221-281, 1994.
Google Scholar
L. Rips, “Similarity, typicality, and categorization,” in Similarity and Analogical Reasoning, edited by S. Vosniadou and A. Ortony, Cambridge University Press: New York, NY, 1989.
Google Scholar
R. Andrews and J. Diederich (Eds.), Proceedings of the NIPS'96 Workshop on Rule Extraction From Trained Artificial Neural Networks, NIPS Foundation, 1996.
R. Miikkulainen and M. Dyer, “Natural language processing with modular PDP networks and distributed lexicons,” Cognitive Science, vol. 15,no. 3, pp. 343-399, 1991.
Google Scholar
D. Gordon and D. Subramanian, “A cognitive model of learning to navigate,” in Proc. of 18th Cognitive Science Conference, Lawrence Erlbaum: Mahwah, NJ, 1997, pp. 271-276.
Google Scholar
T. Johnson, J. Zhang, and H. Wang, “A hybrid learning model of abductive reasoning,” in Connectionist-Symbolic Integration, edited by R. Sun and F. Alexandre, Lawrence Erlbaum Associates: Mahwah, NJ, 1998.
Google Scholar
V. Ajjanagadde and L. Shastri, “From simple association to systematic reasoning,” Tech. Report MS-CIS-90-05, University of Pennsylvania, Philadelphia, PA, 1990.
Google Scholar
J. Barnden, “The right of free association: Relative-position encoding for connectionist data structures,” in Proc. 10th Conference of Cognitive Science Society, Lawrence Erlbaum Associates: Hillsdale, NJ, pp. 503-509, 1988.
Google Scholar
J. Kruschke, “ALCOVE: an examples-based connectionist model of category learning,” Psychological Review, vol. 99, pp. 22-44, 1992.
Google Scholar
D. Touretzky and G. Hinton, “Symbols among neurons,” in Proc. 9th IJCAI, Morgan Kaufmann, 1987, pp. 238-243.
C.L. Giles and M. Gori, “Adaptive processing of sequences and data structures,” Lecture Notes in Artificial Intelligence, Springer Verlag, 1998.
J. Barnden, “The right of free association: Relative-position encoding for connectionist data structures,” in Proc. 10th Conference of Cognitive Science Society, Lawrence Erlbaum Associates: Hillsdale, NJ, pp. 503-509, 1988.
Google Scholar
E. Gat, “Integrating planning and reacting in a heterogeneous architecture,” in Proc. AAAI, Morgan Kaufmann: San Mateo, CA, 1992, pp. 809-815.
Google Scholar
W. Schneider and W. Oliver, “An instructable connectionist/control architecture,” in Architectures for Intelligence, edited by K. VanLehn, Erlbaum: Hillsdale, NJ, 1991.
Google Scholar
R. Maclin and J. Shavlik, “Incorporating advice into agents that learn from reinforcements,” in Proc. of AAAI-94, Morgan Kaufmann: San Meteo, CA, 1994.
Google Scholar
G. Drescher, Made-up Minds, MIT Press: Cambridge, MA, 1991.
Google Scholar
D. Broadbent, P. Fitsgerald, and M. Broadbent, “Implicit and explicit knowledge in the control of complex systems,” British Journal of Psychology, vol. 77, pp. 33-50, 1986.
Google Scholar
G. Logan, “Toward an instance theory of automatization,” Psychological Review, vol. 95,no. 4, pp. 492-527, 1988.
Google Scholar
W. Stanley, R. Mathews, R. Buss, and S. Kotler-Cope, “Insight without awareness: On the interaction of verbalization, instruction and practice in a simulated process control task,” Quarterly Journal of Experimental Psychology, vol. 41A,no. 3, pp. 553-577, 1989.
Google Scholar
D. Gordon and D. Subramanian, “A multistrategy learning scheme for agent knowledge acquisition,” Informatica, vol. 17, pp. 331-346, 1993.
Google Scholar
E. Thorndike, Animal Intelligence, Hafner: Darien, Connecticutt, 1911.
Google Scholar
K. VanLehn, “Cognitive skill acquisition,” in Annual Review of Psychology, edited by J. Spence, J. Darly, and D. Foss, Annual Reviews, Palo Alto, CA, vol. 47, pp. 513-539, 1996.
Google Scholar
S. Whitehead and D. Ballard, “Learning to perceive and act by trial and error,” Artificial Intelligence, vol. 7, pp. 45-83, 1991.
Google Scholar
J. Schlimmer, “Incremental adjustment of representations for learning,” in Proc.of 4th Workshop on Machine Learning, Morgan Kaufmann: San Mateo, CA, pp. 502-507, 1987.
Google Scholar
P. Agre and D. Chapman, “What are plans for?,” in Designing Autonomous Agents, edited by P. Maes, Elsevier: New York, 1990.
Google Scholar
A. Barto, R. Sutton, and C. Watkins, “Learning and sequential decision-making,” in Learning and Computational Neuroscience, edited by M. Gabriel and J. Moors, MIT Press: Cambridge, MA.
J. LeDoux, “Brain mechanisms of emotion and emotional learning,” Current Opinion in Neurobiology, vol. 2,no. 2, pp. 191-197, 1992
Google Scholar
R. Shiffrin and W. Schneider, “Controlled and automatic human information processing II,” Psychological Review, vol. 84, pp. 127-190, 1977.
Google Scholar
R. Stepp and R. Michalski, “Conceptual clustering,” in Machine Learning, II, edited by R. Michalski et al., Morgan Kaufmann: Los Altos, CA, 1986.
Google Scholar
R. Sun and L. Bookman (Eds.), Computational Architectures Integrating Neural and Symbolic Processes, Kluwer Academic Publishers: Norwell, MA, 1994.
Google Scholar
D. Touretzky and G. Hinton, “Symbols among neurons,” in Proc. 9th IJCAI, Morgan Kaufmann, 1987, pp. 238-243.
K. VanLehn, “Rule acquisiton events in the discovery of problem-solving strategies,” Cognitive Science, vol. 15, pp. 1-47, 1991.
Google Scholar
S. Whitehead and L. Lin, “Reinforcement learning of non-Markov decision processes,” Artificial Intelligence, vol. 73,nos. 1–2, pp. 271-306, 1995.
Google Scholar

Download references

Author information

Authors and Affiliations

NEC Research Institute and University of Alabama, 4 Independence Way, Princeton, NJ, 08540
Ron Sun
The University of Alabama, Tuscaloosa, AL, 35487
Todd Peterson & Edward Merrill

Authors

Ron Sun
View author publications
You can also search for this author inPubMed Google Scholar
Todd Peterson
View author publications
You can also search for this author inPubMed Google Scholar
Edward Merrill
View author publications
You can also search for this author inPubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sun, R., Peterson, T. & Merrill, E. A Hybrid Architecture for Situated Learning of Reactive Sequential Decision Making. Applied Intelligence 11, 109–127 (1999). https://doi.org/10.1023/A:1008332731824

Download citation

Issue Date: July 1999
DOI: https://doi.org/10.1023/A:1008332731824

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Hybrid Architecture for Situated Learning of Reactive Sequential Decision Making

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Integrating Symbolic and Sub-symbolic Reasoning

Intelligent problem-solving as integrated hierarchical reinforcement learning

Anchoring Knowledge in Interaction: Towards a Harmonic Subsymbolic/Symbolic Framework and Architecture of Computational Cognition

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Subscribe and save

Buy Now