Abstract
This article addresses the issue of visual landmark recognition in autonomous robot navigation along known routes, by intuitively exploiting the functions of the human visual system and its navigational ability. A feedforward–feedbackward architecture has been developed for recognising visual landmarks in real time. It integrates the theoretical concepts from the pre-attentive and attentive stages in the human visual system, the selective attention adaptive resonance theory neural network and its derivatives, and computational approaches towards object recognition in computer vision. The architecture mimics the pre-attentive and attentive stages in the context of object recognition, embedding neural network processing paradigm into a computational template-matching approach in computer vision. The real-time landmark recognition capability is achieved by mimicking the pre-attentive stage, where it models a selective attention mechanism for optimal computational resource allocation, focusing only on the regions of interest to address the computational restrictive nature of current computer processing power. Similarly, the recognition of visual landmarks in both clean and cluttered backgrounds is implemented in the attentive stage by developing a memory feedback modulation (MFM) mechanism that enables knowledge from the memory to interact and enhance the efficiency of earlier stages in the architecture. Furthermore, it also incorporates both top-down and bottom-up facilitatory and inhibition pathways between the memory and the earlier stages to enable the architecture to recognise a 2D landmark, which is partially occluded by adjacent features in the surroundings. The results show that the architecture is able to recognise objects in cluttered backgrounds using real-images in both indoor and outdoor scenes. Furthermore, the architecture application in autonomous robot navigation has been demonstrated through a number of real-time trials in both indoor and outdoor environments.

















Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Magnard M, Malpeli JG (1991) Paths of information flow through visual cortex. Science 251:1249–1251
Sherwood L (1997) Human physiology: from cells to systems, 3rd edn. Wadsworth, Washington
Thomps RF (2000) The brain: a neuroscience primer, 3rd edn. Worth Publishers, New York
Tarr MJ, Pinker S (1990) When does human object recognition use a viewer-centered reference frame. Psychol Sci 1:253–256
Mashor MY, Osman MK, Arshad MR (2004) 3D object recognition using 2D moments and HMLP network. International conference on computer graphics, imaging and visualization, 2004 (CGIV 2004), pp 126–130
Wei W, Zhang Q, Wang M (2001) A method of vehicle classification using models and neural networks. IEEE 53rd vehicular technology conference, pp 3022–3026
Chandrasekaran V, Palaniswami M, Caelli T (1991) An object recognition system using self-organising neural networks. IEEE international joint conference on neural networks, pp 2582–2587
Fukushima K (1975) Cognition: a self-organizing multilayered neural network. Biol Cybern 20: 121–136
Hopfield JJ (1982) Neural networks and physical systems with emergent collective computational abilities. Presented at National Academy of Science of the USA, pp 2554–2558
Wen-Jing L, Lee T (2001) Hopfield network for affine invariant object recognition. International joint conference on neural networks (IJCNN'01), vol 1, pp 588–593
Wen-Jing L, Tong L (2002) Invariant feature matching by Hopfield-type neural network, 2743 pp
Carpenter GA, Grossberg S (1989) Search mechanisms for adaptive resonance theory (ART) architectures. Presented at international joint conference on neural networks, pp 201–205
Grossberg S, Wyse L (1991) Invariant recognition of cluttered scenes by a self-organizing ART architecture: figure–ground separation. Presented at international joint conference on neural networks, IJCNN-91-Seattle, pp 633–638
Carpenter GA, Grossberg S, Reynolds JH (1991) ARTMAP: supervised real-time learning and classification of nonstationary data by a self-organizing neural network. Neural Netw 4:565–588
Carpenter GA, Grossberg S, Rosen D (1991) ART 2-A: an adaptive resonance algorithm for rapid category learning and recognition. Presented at international joint conference on neural networks, IJCNN-91-Seattle, pp 151–156
Antony M, Bartlett PL (1999) Neural network learning: theoretical foundations. Cambridge University Press, London
Salam FMA, Bai S (1991) A new feedback neural network with supervised learning. IEEE Trans Neural Netw 2:170–173
Salam FMA, Bai S (1990) A feedback neural network with supervised learning. International joint conference on neural networks (IJCNN’90), vol 3, pp 263–268
Healy MJ (1991) A logical architecture for supervised learning. International joint conference on neural networks (IJCNN-91), vol 2, pp 968–976
Maeda Y, Yotsumoto Y, Kanata Y (1995) Unsupervised learning of neural networks for separation of unknown data. IEEE IECON 21st international conference on industrial electronics, control, and instrumentation, vol 2, pp 956–961
Shibata K, Okabe Y (1994) Unsupervised learning method to extract object locations from local visual signals. IEEE international conference on neural networks, IEEE world congress on computational intelligence, pp 1556–1559
Grossberg S (1976) Adaptive pattern classification and universal recoding, II: feedback, expectation, olfaction, and illusions. Biol Cybern 23:187–202
Grossberg S (1980) How does a brain build a cognitive code? Psychol Rev 87:1–51
Grossberg S (1972) Neural expectation: cerebellar and retinal analogs of cells fired by learnable or unlearned pattern classes. Kybernetik 10:49–57
Carpenter GA, Grossberg S (1987) A massively parallel architecture for a self-organizing neural pattern recognition machine. Comput Vis Graph Image Process 37:54–115
Carpenter GA, Grossberg S (1987) ART 2: self-organization of stable category recognition codes for analog input patterns. Appl Opt 26:4919–4930
Carpenter GA, Grossberg S (1990) ART 3: hierarchical search using chemical transmitters in self-organising pattern recognition architectures. Neural Netw 3:129–152
Lozo P (1997) Neural theory and model of selective visual attention and 2D shape recognition in visual clutter. Department of Electrical and Electronic Engineering, University of Adelaide, Adelaide
Lozo P, Lim C-C (1996) Neural circuit for object recognition in complex and cluttered visual images. Presented at The Australian and New Zealand conference on intelligent information systems, pp 254–257
Lozo P, Westmacott J, Do QV, Jain L, Wu L (2004) Selective attention adaptive resonance theory and object recognition. In: Fulcher J, Jain LC (eds) Studies in fuzziness and soft computing, applied intelligent systems. Springer, Berlin, pp 301–320
Lozo P (1995) Selective attention adaptive resonance theory (SAART) neural network for neuro-engineering of robust ATR systems. Presented at IEEE international conference on neural networks, pp 2461–2466
Maunsell JHR, Ferrera VP (1994) Attentional mechanisms in visual cortex. In: Gazzaniga MS (ed) The cognitive neurosciences. MIT Press, Cambridge, pp 451–461
Moran J, Desimone R (1985) Selective attention gates visual processing in the extrastriate cortex. Science 229:782–784
Desimone R, Duncan J (1995) Neural mechanisms of selective visual attention. Annu Rev Neurosci 18:193–222
Desimone R (1996) Neural mechanisms for visual memory and their role in attention. Presented at proceedings of the National Academy of Sciences, USA, pp 13494–13499
Desimone R, Wessinger M, Thomas L, Schneider W (1990) Attentional control of visual perception: cortical, and subcortical mechanisms. Cold Spring Harb Symp Quant Biol 55:963–971
Ullman S (1996) High-level vision: object recognition and visual cognition. MIT Press, Cambridge
Sillito AM, Jones HE, Gerstein GL, West DC (1994) Feature-linked synchronization of thalamic relay cell firing induced by feedback from the visual cortex. Nature 369:479–482
Mignard M, Malpeli JG (1991) Paths of information flow through visual cortex. Science 251:1249–1251
Chong EW-S (2001) A neural framework for visual scene analysis with selective attention. Department of Electrical and Electronic Engineering, University of Adelaide, Adelaide
Westmacott J (2000) An artificial neural network for robust shape recognition in real time. School of Electrical and Information Engineering, University of South Australia
Westmacott J, Lozo P, Jain L (1999) Distortion invariant selective attention adaptive resonance theory neural network. Presented at third international conference on knowledge-based intelligent information engineering systems, IEEE Press, USA, pp 13–16
Brue DC, Green PR, Georgeson MA (1982) Visual perception: physiology, psychology and ecology, 3rd edn. Pyschology Press, Hove, Sussex
Maunsell JHR, Newsome WT (1987) Visual processing in monkey extrastriate cortex. Annu Rev Neurosci 10:363–401
Lozo P (1996) Neural circuit for match mismatch, familiarity/novelty and synchronization detection in SAART neural networks. Presented at the fourth international symposium on signal processing and its applications, ISSPA’96, pp 549–552
Lozo P, Nandagopal N (1996) Selective transfer of spatial patterns by presynaptic facilitation in a shunting competitive neural layer. Presented at the Australian and New Zealand conference on intelligent information systems, pp 178–181
Lozo P (1996) Neural circuit for self-regulated attentional learning in selective attention adaptive resonance theory (SAART) neural networks. Presented at the fourth international symposium on signal processing and its applications, ISSPA-96, pp 545–548
Chong EW, Lim C-C, Lozo P (1999) Neural model of visual selective attention for automatic translation invariant object recognition in cluttered images. Presented at third international conference on knowledge-based intelligent information engineering systems, pp 373–376
Chong EW, Lim C-C, Atsikbasis N, Lozo P (1997) Design of a 2-D neural motion detection filter. Presented at IEEE region 10th annual conference on speech and image technologies for computing and telecommunications, pp 667–670
Dev A, Krose B, Groen F (1997) Navigation of a mobile robot on the temporal development of the optic flow. Presented at the 1997 IEEE/RSJ international conference on intelligent robots and systems, pp 558–563
Cheng G, Zelinsky A (1998) Goal-oriented behaviour-based visual navigation. Presented at IEEE international conference on robotics and automation, pp 3431–3436
Thompson S, Zelinsky A (2003) Accurate vision based position tracking between places in a topological map. Presented at IEEE international symposium on computational intelligence in robotics and automation, pp 491–496
Colios CI, Trahanias PE (2000) Landmark identification based on projective and permutation invariant vectors. Presented at the 15th international conference on pattern recognition, pp 128–131
Pope AR (1994) Model-based object recognition: a survey of recent research. Department of Computer Science, The University of British Columbia Technical Report 94-04
Weiss I, Ray M (2001) Model-based recognition of 3D objects from single images. IEEE Trans Pattern Anal Mach Intell 23: 116–128
Wu Q, Liu Z, Xiong Z, Wang Y, Chen T, Castleman KR (2002) On optimal subspaces for appearance-based object recognition. International conference on image processing, pp 885–888
Luck SJ, Fan S, Hillyard SA (1993) Attention-related modulation of sensory-evoked brain activity in a visual search task. J Cogn Neurosci 5:188–195
Do QV, Lozo P, Jain L (2004) A fast visual search and recognition mechanism for real-time robotic applications. Presented at the 17th Australian joint conference on artificial intelligence, Cairns, Australia, pp 937–342
Weichselgartner E, Sperling G (1987) Dynamics of automatic and controlled visual attention. Science 238:778–780
Wu J, Zhang X (2001) A PCA classifier and its application in vehicle detection. International joint conference on neural networks (IJCNN'01), vol 1, pp 600–604
Martinez AM, Kak AC (2001) PCA versus LDA. IEEE Trans Pattern Anal Mach Intell 23: 228–233
Wolf L, Bileschi S (2005) Combining variable selection with dimensionality reduction. IEEE computer society conference on computer vision and pattern recognition (CVPR 2005), pp 801–806
Do QV, Lozo P, Jain LC (2004) Autonomous robot navigation using SAART for visual landmark recognition. Presented at the 2nd international conference on artificial intelligence in science and technology, Tasmania, Australia, pp 64–69
Do QV, Lozo P, Jain LC (2005) Autonomous robot navigation using SAART for visual landmark recognition. Presented at the 2nd international conference on artificial intelligence in science and technology, Tasmania, Australia, pp 64–69
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Do, Q., Jain, L. Application of neural processing paradigm in visual landmark recognition and autonomous robot navigation. Neural Comput & Applic 19, 237–254 (2010). https://doi.org/10.1007/s00521-009-0294-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-009-0294-7