Skip to main content

A comparison between two architectures for searching and learning in maze problems

  • Conference paper
  • First Online:
Evolutionary Computing (AISB EC 1994)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 865))

Included in the following conference series:

Abstract

We present two architectures, each designed to search 2-Dimensional mazes in order to locate a “goal” position, both of which perform on-line learning as the search proceeds. The first architecture is a form of Adaptive Heuristic Critic which uses a Genetic Algorithm to determine the Action Policy and a Radial Basis Function Neural Network to store the acquired knowledge of the Critic. The second is a stimulus-response Classifier System (CS) which uses a Genetic Algorithm, applied “Michigan” style, for rule generation and the “Bucket Brigade” algorithm for rule reinforcement. Experiments conducted using agents based upon each architectural model lead us to a comparison of performance, and some observations on the nature and relative levels of abstraction in the acquired knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Barto A. G., Sutton R. S., Watkins C. J. C. H., 1989, ‘Learning and Sequential Decision Making', COINS Technical Report 89–95

    Google Scholar 

  • Booker L. B., 1982, ‘Intelligent Behaviour as an Adaptation to the Task Environment’ PhD dissertation, University of Michigan

    Google Scholar 

  • Booker L. B., Goldberg D. E., Holland J. H., 1989, ‘Classifier Systems and Genetic Algorithms', Artificial Intelligence 40, pp.235–282

    Google Scholar 

  • Goldberg, D. E., 1989, ‘Genetic Algorithms in Search, Optimization and Machine Learning', Addison Wesley

    Google Scholar 

  • Lin L., PhD thesis, 1993, ‘Reinforcement Learning for Robots using Neural Networks', School of Comp. Science, Carnegie Mellon University Pittsburgh, USA

    Google Scholar 

  • Parodi A., Bonelli P., 1993, ‘A New Approach to Fuzzy Classifier Systems', Proceedings of the 5th International Conference on Genetic Algorithms, pp.223–230

    Google Scholar 

  • Pipe A. G. 1, Fogarty T. C., Winfield A., 1994, ‘A Hybrid Architecture for Learning Continuous Environmental Models in Maze Problems', to appear in Procs. of 3rd International Conference on Simulation of Adaptive Behaviour, Brighton

    Google Scholar 

  • Pipe A. G. 2, Fogarty T. C., Winfield A., 1994, ‘Hybrid Adaptive Heuristic Architectures for Learning in Mazes with Continuous Search Spaces', to appear in Procs. of 3rd Parallel Problem Solving from Nature, Jerusalem

    Google Scholar 

  • Poggio T., Girosi F., 1989, ‘A Theory of Networks for Approximation and Learning', MIT Cambridge, MA, AI lab. Memo 1140

    Google Scholar 

  • Roberts G., 1989, ‘A rational reconstruction of Wilson's Animat and Holland's CS-1', Procs. of 3rd Int. Conf. on Genetic Algorithms, pp. 317–321, Ed. Schaffer J. D., Morgan Kaufmann

    Google Scholar 

  • Roberts G., 1991, ‘Classifier Systems for Situated Autonomous Learning', PhD thesis, Edinburgh University

    Google Scholar 

  • Roberts G., 1993, ‘Dynamic Planning for Classifier Systems', Proceedings of the 5th International Conference on Genetic Algorithms, pp.231–237

    Google Scholar 

  • Sanner R. M., Slotine J. E., 1991, ‘Gaussian Networks for Direct Adaptive Control', Nonlinear Systems Lab., MIT, Cambridge, USA, Tech. Rep. NSL-910503

    Google Scholar 

  • Sutton R. S., 1984, PhD thesis ‘Temporal Credit Assignment in Reinforcement Learning', University of Massachusetts, Dept. of computer and Info. Science

    Google Scholar 

  • Sutton R. S., 1991, ‘Reinforcement Learning Architectures for Animats', From Animals to Animats, pp288–296, Editors Meyer, J., Wilson, S., MIT Press

    Google Scholar 

  • Thrun S. B., 1992, ‘The Role of Exploration in Learning', Handbook of Intelligent Control: Neural, Fuzzy, & Adaptive Approaches, Van Nostrand Reinhold, Ed. White D., Sofge D.

    Google Scholar 

  • Werbos P. J., 1992, ‘Approximate Dynamic Programming for Real-Time Control and Neural Modelling', Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, Van Nostrand Reinhold, Ed. White D., Sofge D.

    Google Scholar 

  • Wilson S. W., 1985, ‘Knowledge growth in an artificial animal', Procs. of Int. Conf. on Genetic Algorithms and their Applications, pp. 16–23, Editor Grefenstette J. J.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Terence C. Fogarty

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pipe, A.G., Carse, B. (1994). A comparison between two architectures for searching and learning in maze problems. In: Fogarty, T.C. (eds) Evolutionary Computing. AISB EC 1994. Lecture Notes in Computer Science, vol 865. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58483-8_18

Download citation

  • DOI: https://doi.org/10.1007/3-540-58483-8_18

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-58483-4

  • Online ISBN: 978-3-540-48999-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics