Hierarchical Learning of Navigational Behaviors in an Autonomous Robot using a Predictive Sparse Distributed Memory

Rao, Rajesh P.N.; Fuentes, Olac

doi:10.1023/A:1008810406347

Hierarchical Learning of Navigational Behaviors in an Autonomous Robot using a Predictive Sparse Distributed Memory

Published: July 1998

Volume 5, pages 297–316, (1998)
Cite this article

Autonomous Robots Aims and scope Submit manuscript

Rajesh P.N. Rao¹ &
Olac Fuentes²

180 Accesses
6 Citations
Explore all metrics

Abstract

We describe a general framework for learning perception-based navigational behaviors in autonomous mobile robots. A hierarchical behavior-based decomposition of the control architecture is used to facilitate efficient modular learning. Lower level reactive behaviors such as collision detection and obstacle avoidance are learned using a stochastic hill-climbing method while higher level goal-directed navigation is achieved using a self-organizing sparse distributed memory. The memory is initially trained by teleoperating the robot on a small number of paths within a given domain of interest. During training, the vectors in the sensory space as well as the motor space are continually adapted using a form of competitive learning to yield basis vectors that efficiently span the sensorimotor space. After training, the robot navigates from arbitrary locations to a desired goal location using motor output vectors computed by a saliency-based weighted averaging scheme. The pervasive problem of perceptual aliasing in finite-order Markovian environments is handled by allowing both current as well as the set of immediately preceding perceptual inputs to predict the motor output vector for the current time instant. We describe experimental and simulation results obtained using a mobile robot equipped with bump sensors, photosensors and infrared receivers, navigating within an enclosed obstacle-ridden arena. The results indicate that the method performs successfully in a number of navigational tasks exhibiting varying degrees of perceptual aliasing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Sequence-Based Neuronal Model for Mobile Robot Localization

An integrated model of autonomous topological spatial cognition

Article 14 November 2015

Efficient Visual Navigation with Bio-inspired Route Learning Algorithms

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Albus, J. 1971. A theory of cerebellar functions. Math. Bio., 10:25- 61.
Google Scholar
Asada, M., Uchibe, E., Noda, S., Tawaratsumida, S., and Hosoda, K. 1994. Coordination of multiple behaviors acquired by a vision-based reinforcement learning. In Proceedings of the 1994 IEEE/RSJ International Conference on Intelligent Robots an Systems, Munich, Germany, pp. 917-924.
Beer, R. and Gallagher, J. 1992. Evolving dynamical neural networks for adaptive behavior. Adaptive Behavior, 1:91-122.
Google Scholar
Brooks, R.A. 1986. A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation, 2(1):14-22.
Google Scholar
Broomhead, D.S. and Lowe, D. 1988. Multivariable functional interpolation and adaptive networks. Complex Systems, 2:321-355.
Google Scholar
Chrisman, L. 1992. Reinforcement learning with perceptual aliasing. In Proceedings of the Eleventh National Conference on Artificial Intelligence.
Cliff, D., Husbands, P., and Harvey, I. 1992. Evolving visually guided robots. In From Animals to Animats 2: Proceedings of the Second International Conference on the Simulation of Adaptive Behavior, J.A. Meyer, H. Roitblat, and S. Wilson (Eds.), MIT Press: Cambridge, MA, pp. 374-383.
Google Scholar
Connell, J.H. 1990. Minimalist Mobile Robotics: A Colony-Style Architecture for an Artificial Creature, Academic Press: Boston, MA.
Google Scholar
Dayan, P. and Hinton, G. 1993. Feudal reinforcement learning. In Advances in Neural Information Processing Systems 5, S. Hanson, J. Cowan, and C. Giles (Eds.), Morgan Kaufmann: San Mateo, CA, pp. 271-278.
Google Scholar
de Bourcier, P. 1993. Animate navigation using visual landmarks. Cognitive Science Research Papers 277, University of Sussex at Brighton.
Elfes, A. 1987. Sonar-based real-world mapping and navigation. IEEE Journal of Robotics and Automation, 3:249-265.
Google Scholar
Fuentes, O., Rao, R.P.N., and Wie, M.V. 1995. Hierarchical learning of reactive behaviors in an autonomous mobile robot. In Proceedings of the IEEE International Conf. on Systems, Man, and Cybernetics.
Fuentes, O. and Nelson, R.C. 1997. Learning dextrous manipulation skills using the evolution strategy. In Proceedings of the 1997 IEEE International Conference on Robotics and Automation, Albuquerque, New Mexico.
Hackbusch, W. 1985. Multi-grid Methods and Applications, Springer-Verlag: Berlin.
Google Scholar
Hertz, J., Krogh, A., and Palmer, R. 1991. Introduction to the Theory of Neural Computation, Addison-Wesley Publishing Company: Redwood City, CA.
Google Scholar
Hinton, G., McClelland, J., and Rumelhart, D. 1986. Distributed representations. In Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol. 1. MIT Press: Cambridge, MA.
Google Scholar
Joglekar, U.D. 1989. Learning to read aloud: A neural network approach using sparse distributed memory. Technical Report 89.27, Research Institute for Advanced Computer Science, NASA Ames Research Center.
Jordan, M. 1986. Attractor dynamics and parallelism in a connectionist sequential machine. In Proceedings of the 1986 Cognitive Science Conference.
Kaelbling, L. 1993a. Hierarchical reinforcement learning: Preliminary results. In Proceedings of the Tenth International Conference on Machine Learning, Morgan Kaufmann: San Mateo, CA, pp. 167-173.
Google Scholar
Kaelbling, L.P. 1993b. Learning in Embedded Systems, MIT Press: Cambridge, MA.
Google Scholar
Kaelbling, L.P., Littman, M.L., and Moore, A.W. 1996. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237-285.
Google Scholar
Kanerva, P. 1988. Sparse Distributed Memory, Bradford Books: Cambridge, MA.
Google Scholar
Kanerva, P. 1993. Sparse distributed memory and related models. In Associative Neural Memories, M.H. Hassoun (Ed.), Oxford University Press: New York, pp. 50-76.
Google Scholar
Krose, B.J.A. and Eecen, M. 1994. A self-organizing representation of sensor space for mobile robot navigation. In IEEE/RSJ/GI International Conference on Intelligent Robots an Systems, pp. 9-14.
Kuipers, B.J. and Byun, Y.-T. 1988. A robust, qualitiative approach to a spatial learning mobile robot. In SPIE Cambridge Symposium on Optical and Optoelectronic Engineering: Advances in Intelligent Robotics Systems.
Lewis, M.A., Fagg, A.H., and Solidum, A. 1992. Genetic programming approach to the construction of a neural network for control of a walking robot. In Proceedings of the 1992 IEEE International Conference on Robotics and Automation, Nice, France.
Maes, P. and Brooks, R.A. 1990. Learning to coordinate behaviors. In Proceedings of AAAI-90, pp. 796-802.
Mahadevan, S. and Connell, J. 1991. Scaling reinforcement learning to robotics by exploiting the subsumption architecture. In Proceedings of the Eighth International Workshop on Machine Learning.
Marr, D. 1969. A theory of cerebellar cortex. J. Physiol. (London), 202:437-470.
Google Scholar
Mataric, M. 1992. Integration of representation into goal-driven behavior-based robot. IEEE Transactions on Robotics and Automation, 8(3):304-312.
Google Scholar
McCallum, R.A. 1996. Hidden state and reinforcement learning with instance-based state identification. IEEE Trans. on Systems, Man and Cybernetics, 26(3):464-473.
Google Scholar
McIlwain, J.T. 1991. Distributed spatial coding in the superior colliculus: A review. Visual Neuroscience, 6:3-13.
Google Scholar
Nehmzow, U. and Smithers, T. 1991. Mapbuilding using self-organizing networks in “Really Useful Robots”. In From Animals to Animats 1: Proceedings of the First International Conference on Simulation of Adaptive Behavior, J.-A. Meyer and S.W. Wilson (Eds.), MIT Press: Cambridge, MA, pp. 152-159.
Google Scholar
Nelson, R.C. 1991. Visual homing using an associative memory. Biological Cybernetics, 65:281-291.
Google Scholar
Nolfi, S. 1997. Using emergent modularity to develop control systems for mobile robots. Adaptive Behavior, 5:343-363.
Google Scholar
Nowlan, S. 1990. Maximum likelihood competitive learning. In Advances in Neural Information Processing Systems 2, D. Touretzky (Ed.), Morgan Kaufmann: San Mateo, CA, pp. 574-582.
Google Scholar
Pierce, D. and Kuipers, B. 1991. Learning hill-climbing functions as a strategy for generating behaviors in a mobile robot. In From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior, pp. 327-336.
Poggio, T. and Girosi, F. 1990. Networks for approximation and learning. In Proc. IEEE, vol. 78, pp. 1481-1497.
Google Scholar
Pomerleau, D. 1989. ALVINN:Anautonomous land vehicle in a neural network. In Advances in Neural Information Processing Systems, D. Touretzky (Ed.), vol. 1, Morgan Kaufmann: San Mateo, pp. 305-313.
Google Scholar
Pomerleau, D. 1991. Efficient training of artificial neural networks for autonomous navigation. Neural Computation, 3(1):88-97.
Google Scholar
Prager, R.W. and Fallside, F. 1989. The modified Kanerva model for automatic speech recognition. Computer Speech and Language, 3(1):61-81.
Google Scholar
Rao, R.P.N. and Ballard, D.H. 1995a. An active vision architecture based on iconic representations. Artificial Intelligence (Special Issue on Vision), 78:461-505.
Google Scholar
Rao, R.P.N. and Ballard, D.H. 1995b. Learning saccadic eye movements using multiscale spatial filters. In Advances in Neural Information Processing Systems 7, G. Tesauro, D.S. Touretzky, and T.K. Leen (Eds.), MIT Press: Cambridge, MA, pp. 893-900.
Google Scholar
Rao, R.P.N. and Ballard, D.H. 1995c. Natural basis functions and topographic memory for face recognition. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), pp. 10-17.
Rao, R.P.N. and Ballard, D.H. 1995d. Object indexing using an iconic sparse distributed memory. In Proceedings of the International Conference on Computer Vision (ICCV), pp. 24-31.
Rao, R.P.N. and Fuentes, O. 1995. Perceptual homing by an autonomous mobile robot using sparse self-organizing sensory-motor maps. In Proceedings of World Congress on Neural Networks (WCNN), pp. II380-II383.
Rao, R.P.N. and Fuentes, O. 1996. Learning navigational behaviors using a predictive sparse distributed memory. In From Animals to Animats 4: Proceedings of the Fourth Int. Conf. on Simulation of Adaptive Behavior (SAB), pp. 382-390.
Rao, R.P.N. and Ballard, D.H. 1997. Dynamic model of visual recognition predicts neural response properties in the visual cortex. Neural Computation, 9(4):721-763.
Google Scholar
Rogers, D. 1990. Predicting weather using a genetic memory: A combination of Kanerva's sparse distributed memory and Holland's genetic algorithms. In Advances in Neural Information Processing Systems 2, D.S. Touretzky (Ed.), Morgan Kaufmann, pp. 455-464.
Singh, S. 1992. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8:323-339.
Google Scholar
Tani, J. and Fukumura, N. 1994. Learning goal-directed sensory-based navigation of a mobile robot. Neural Networks, 7(3):553-563.
Google Scholar
Whitehead, S. and Ballard, D. 1991. Learning to perceive and act by trial and error. Machine Learning, 7(1):45-83.
Google Scholar
Wixson, L. 1991. Scaling reinforcement learning techniques via modularity. In Proceedings of the Eighth International Workshop on Machine Learning, Morgan Kaufmann, pp. 368-372.
Yair, E., Zeger, K., and Gersho, A. 1992. Competitive learning and soft competition for vector quantizer design. IEEE Trans. Signal Processing, 40(2):294-309.
Google Scholar

Download references

Author information

Authors and Affiliations

Computational Neurobiology Laboratory, The Salk Institute, Sloan Center for Theoretical Neurobiology and, 10010 N. Torrey Pines Road, La Jolla, CA, 92037, USA
Rajesh P.N. Rao
Centro de Investigación en Computación, Instituto Politecnico Nacional, Mexico, D.F. 07738, Mexico
Olac Fuentes

Authors

Rajesh P.N. Rao
View author publications
You can also search for this author inPubMed Google Scholar
Olac Fuentes
View author publications
You can also search for this author inPubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rao, R.P., Fuentes, O. Hierarchical Learning of Navigational Behaviors in an Autonomous Robot using a Predictive Sparse Distributed Memory. Autonomous Robots 5, 297–316 (1998). https://doi.org/10.1023/A:1008810406347

Download citation

Issue Date: July 1998
DOI: https://doi.org/10.1023/A:1008810406347

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hierarchical Learning of Navigational Behaviors in an Autonomous Robot using a Predictive Sparse Distributed Memory

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Sequence-Based Neuronal Model for Mobile Robot Localization

An integrated model of autonomous topological spatial cognition

Efficient Visual Navigation with Bio-inspired Route Learning Algorithms

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Subscribe and save

Buy Now