Skip to main content
Log in

Application of neural processing paradigm in visual landmark recognition and autonomous robot navigation

  • KES 2008
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

This article addresses the issue of visual landmark recognition in autonomous robot navigation along known routes, by intuitively exploiting the functions of the human visual system and its navigational ability. A feedforward–feedbackward architecture has been developed for recognising visual landmarks in real time. It integrates the theoretical concepts from the pre-attentive and attentive stages in the human visual system, the selective attention adaptive resonance theory neural network and its derivatives, and computational approaches towards object recognition in computer vision. The architecture mimics the pre-attentive and attentive stages in the context of object recognition, embedding neural network processing paradigm into a computational template-matching approach in computer vision. The real-time landmark recognition capability is achieved by mimicking the pre-attentive stage, where it models a selective attention mechanism for optimal computational resource allocation, focusing only on the regions of interest to address the computational restrictive nature of current computer processing power. Similarly, the recognition of visual landmarks in both clean and cluttered backgrounds is implemented in the attentive stage by developing a memory feedback modulation (MFM) mechanism that enables knowledge from the memory to interact and enhance the efficiency of earlier stages in the architecture. Furthermore, it also incorporates both top-down and bottom-up facilitatory and inhibition pathways between the memory and the earlier stages to enable the architecture to recognise a 2D landmark, which is partially occluded by adjacent features in the surroundings. The results show that the architecture is able to recognise objects in cluttered backgrounds using real-images in both indoor and outdoor scenes. Furthermore, the architecture application in autonomous robot navigation has been demonstrated through a number of real-time trials in both indoor and outdoor environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

References

  1. Magnard M, Malpeli JG (1991) Paths of information flow through visual cortex. Science 251:1249–1251

    Google Scholar 

  2. Sherwood L (1997) Human physiology: from cells to systems, 3rd edn. Wadsworth, Washington

  3. Thomps RF (2000) The brain: a neuroscience primer, 3rd edn. Worth Publishers, New York

    Google Scholar 

  4. Tarr MJ, Pinker S (1990) When does human object recognition use a viewer-centered reference frame. Psychol Sci 1:253–256

    Article  Google Scholar 

  5. Mashor MY, Osman MK, Arshad MR (2004) 3D object recognition using 2D moments and HMLP network. International conference on computer graphics, imaging and visualization, 2004 (CGIV 2004), pp 126–130

  6. Wei W, Zhang Q, Wang M (2001) A method of vehicle classification using models and neural networks. IEEE 53rd vehicular technology conference, pp 3022–3026

  7. Chandrasekaran V, Palaniswami M, Caelli T (1991) An object recognition system using self-organising neural networks. IEEE international joint conference on neural networks, pp 2582–2587

  8. Fukushima K (1975) Cognition: a self-organizing multilayered neural network. Biol Cybern 20: 121–136

    Google Scholar 

  9. Hopfield JJ (1982) Neural networks and physical systems with emergent collective computational abilities. Presented at National Academy of Science of the USA, pp 2554–2558

  10. Wen-Jing L, Lee T (2001) Hopfield network for affine invariant object recognition. International joint conference on neural networks (IJCNN'01), vol 1, pp 588–593

  11. Wen-Jing L, Tong L (2002) Invariant feature matching by Hopfield-type neural network, 2743 pp

  12. Carpenter GA, Grossberg S (1989) Search mechanisms for adaptive resonance theory (ART) architectures. Presented at international joint conference on neural networks, pp 201–205

  13. Grossberg S, Wyse L (1991) Invariant recognition of cluttered scenes by a self-organizing ART architecture: figure–ground separation. Presented at international joint conference on neural networks, IJCNN-91-Seattle, pp 633–638

  14. Carpenter GA, Grossberg S, Reynolds JH (1991) ARTMAP: supervised real-time learning and classification of nonstationary data by a self-organizing neural network. Neural Netw 4:565–588

    Article  Google Scholar 

  15. Carpenter GA, Grossberg S, Rosen D (1991) ART 2-A: an adaptive resonance algorithm for rapid category learning and recognition. Presented at international joint conference on neural networks, IJCNN-91-Seattle, pp 151–156

  16. Antony M, Bartlett PL (1999) Neural network learning: theoretical foundations. Cambridge University Press, London

  17. Salam FMA, Bai S (1991) A new feedback neural network with supervised learning. IEEE Trans Neural Netw 2:170–173

    Article  Google Scholar 

  18. Salam FMA, Bai S (1990) A feedback neural network with supervised learning. International joint conference on neural networks (IJCNN’90), vol 3, pp 263–268

  19. Healy MJ (1991) A logical architecture for supervised learning. International joint conference on neural networks (IJCNN-91), vol 2, pp 968–976

  20. Maeda Y, Yotsumoto Y, Kanata Y (1995) Unsupervised learning of neural networks for separation of unknown data. IEEE IECON 21st international conference on industrial electronics, control, and instrumentation, vol 2, pp 956–961

  21. Shibata K, Okabe Y (1994) Unsupervised learning method to extract object locations from local visual signals. IEEE international conference on neural networks, IEEE world congress on computational intelligence, pp 1556–1559

  22. Grossberg S (1976) Adaptive pattern classification and universal recoding, II: feedback, expectation, olfaction, and illusions. Biol Cybern 23:187–202

    Article  MATH  MathSciNet  Google Scholar 

  23. Grossberg S (1980) How does a brain build a cognitive code? Psychol Rev 87:1–51

    Article  Google Scholar 

  24. Grossberg S (1972) Neural expectation: cerebellar and retinal analogs of cells fired by learnable or unlearned pattern classes. Kybernetik 10:49–57

    Article  Google Scholar 

  25. Carpenter GA, Grossberg S (1987) A massively parallel architecture for a self-organizing neural pattern recognition machine. Comput Vis Graph Image Process 37:54–115

    Article  Google Scholar 

  26. Carpenter GA, Grossberg S (1987) ART 2: self-organization of stable category recognition codes for analog input patterns. Appl Opt 26:4919–4930

    Article  Google Scholar 

  27. Carpenter GA, Grossberg S (1990) ART 3: hierarchical search using chemical transmitters in self-organising pattern recognition architectures. Neural Netw 3:129–152

    Article  Google Scholar 

  28. Lozo P (1997) Neural theory and model of selective visual attention and 2D shape recognition in visual clutter. Department of Electrical and Electronic Engineering, University of Adelaide, Adelaide

  29. Lozo P, Lim C-C (1996) Neural circuit for object recognition in complex and cluttered visual images. Presented at The Australian and New Zealand conference on intelligent information systems, pp 254–257

  30. Lozo P, Westmacott J, Do QV, Jain L, Wu L (2004) Selective attention adaptive resonance theory and object recognition. In: Fulcher J, Jain LC (eds) Studies in fuzziness and soft computing, applied intelligent systems. Springer, Berlin, pp 301–320

  31. Lozo P (1995) Selective attention adaptive resonance theory (SAART) neural network for neuro-engineering of robust ATR systems. Presented at IEEE international conference on neural networks, pp 2461–2466

  32. Maunsell JHR, Ferrera VP (1994) Attentional mechanisms in visual cortex. In: Gazzaniga MS (ed) The cognitive neurosciences. MIT Press, Cambridge, pp 451–461

  33. Moran J, Desimone R (1985) Selective attention gates visual processing in the extrastriate cortex. Science 229:782–784

    Article  Google Scholar 

  34. Desimone R, Duncan J (1995) Neural mechanisms of selective visual attention. Annu Rev Neurosci 18:193–222

    Article  Google Scholar 

  35. Desimone R (1996) Neural mechanisms for visual memory and their role in attention. Presented at proceedings of the National Academy of Sciences, USA, pp 13494–13499

  36. Desimone R, Wessinger M, Thomas L, Schneider W (1990) Attentional control of visual perception: cortical, and subcortical mechanisms. Cold Spring Harb Symp Quant Biol 55:963–971

    Google Scholar 

  37. Ullman S (1996) High-level vision: object recognition and visual cognition. MIT Press, Cambridge

  38. Sillito AM, Jones HE, Gerstein GL, West DC (1994) Feature-linked synchronization of thalamic relay cell firing induced by feedback from the visual cortex. Nature 369:479–482

    Article  Google Scholar 

  39. Mignard M, Malpeli JG (1991) Paths of information flow through visual cortex. Science 251:1249–1251

    Article  Google Scholar 

  40. Chong EW-S (2001) A neural framework for visual scene analysis with selective attention. Department of Electrical and Electronic Engineering, University of Adelaide, Adelaide

  41. Westmacott J (2000) An artificial neural network for robust shape recognition in real time. School of Electrical and Information Engineering, University of South Australia

  42. Westmacott J, Lozo P, Jain L (1999) Distortion invariant selective attention adaptive resonance theory neural network. Presented at third international conference on knowledge-based intelligent information engineering systems, IEEE Press, USA, pp 13–16

  43. Brue DC, Green PR, Georgeson MA (1982) Visual perception: physiology, psychology and ecology, 3rd edn. Pyschology Press, Hove, Sussex

  44. Maunsell JHR, Newsome WT (1987) Visual processing in monkey extrastriate cortex. Annu Rev Neurosci 10:363–401

    Article  Google Scholar 

  45. Lozo P (1996) Neural circuit for match mismatch, familiarity/novelty and synchronization detection in SAART neural networks. Presented at the fourth international symposium on signal processing and its applications, ISSPA’96, pp 549–552

  46. Lozo P, Nandagopal N (1996) Selective transfer of spatial patterns by presynaptic facilitation in a shunting competitive neural layer. Presented at the Australian and New Zealand conference on intelligent information systems, pp 178–181

  47. Lozo P (1996) Neural circuit for self-regulated attentional learning in selective attention adaptive resonance theory (SAART) neural networks. Presented at the fourth international symposium on signal processing and its applications, ISSPA-96, pp 545–548

  48. Chong EW, Lim C-C, Lozo P (1999) Neural model of visual selective attention for automatic translation invariant object recognition in cluttered images. Presented at third international conference on knowledge-based intelligent information engineering systems, pp 373–376

  49. Chong EW, Lim C-C, Atsikbasis N, Lozo P (1997) Design of a 2-D neural motion detection filter. Presented at IEEE region 10th annual conference on speech and image technologies for computing and telecommunications, pp 667–670

  50. Dev A, Krose B, Groen F (1997) Navigation of a mobile robot on the temporal development of the optic flow. Presented at the 1997 IEEE/RSJ international conference on intelligent robots and systems, pp 558–563

  51. Cheng G, Zelinsky A (1998) Goal-oriented behaviour-based visual navigation. Presented at IEEE international conference on robotics and automation, pp 3431–3436

  52. Thompson S, Zelinsky A (2003) Accurate vision based position tracking between places in a topological map. Presented at IEEE international symposium on computational intelligence in robotics and automation, pp 491–496

  53. Colios CI, Trahanias PE (2000) Landmark identification based on projective and permutation invariant vectors. Presented at the 15th international conference on pattern recognition, pp 128–131

  54. Pope AR (1994) Model-based object recognition: a survey of recent research. Department of Computer Science, The University of British Columbia Technical Report 94-04

  55. Weiss I, Ray M (2001) Model-based recognition of 3D objects from single images. IEEE Trans Pattern Anal Mach Intell 23: 116–128

    Google Scholar 

  56. Wu Q, Liu Z, Xiong Z, Wang Y, Chen T, Castleman KR (2002) On optimal subspaces for appearance-based object recognition. International conference on image processing, pp 885–888

  57. Luck SJ, Fan S, Hillyard SA (1993) Attention-related modulation of sensory-evoked brain activity in a visual search task. J Cogn Neurosci 5:188–195

    Article  Google Scholar 

  58. Do QV, Lozo P, Jain L (2004) A fast visual search and recognition mechanism for real-time robotic applications. Presented at the 17th Australian joint conference on artificial intelligence, Cairns, Australia, pp 937–342

  59. Weichselgartner E, Sperling G (1987) Dynamics of automatic and controlled visual attention. Science 238:778–780

    Article  Google Scholar 

  60. Wu J, Zhang X (2001) A PCA classifier and its application in vehicle detection. International joint conference on neural networks (IJCNN'01), vol 1, pp 600–604

  61. Martinez AM, Kak AC (2001) PCA versus LDA. IEEE Trans Pattern Anal Mach Intell 23: 228–233

    Google Scholar 

  62. Wolf L, Bileschi S (2005) Combining variable selection with dimensionality reduction. IEEE computer society conference on computer vision and pattern recognition (CVPR 2005), pp 801–806

  63. Do QV, Lozo P, Jain LC (2004) Autonomous robot navigation using SAART for visual landmark recognition. Presented at the 2nd international conference on artificial intelligence in science and technology, Tasmania, Australia, pp 64–69

  64. Do QV, Lozo P, Jain LC (2005) Autonomous robot navigation using SAART for visual landmark recognition. Presented at the 2nd international conference on artificial intelligence in science and technology, Tasmania, Australia, pp 64–69

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Quoc Do.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Do, Q., Jain, L. Application of neural processing paradigm in visual landmark recognition and autonomous robot navigation. Neural Comput & Applic 19, 237–254 (2010). https://doi.org/10.1007/s00521-009-0294-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-009-0294-7

Keywords

Navigation