Towards a Computational Model for Object Recognition in IT Cortex

Lowe, David G.

doi:10.1007/3-540-45482-9_3

David G. Lowe⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1811))

Included in the following conference series:

International Workshop on Biologically Motivated Computer Vision

838 Accesses
39 Citations

Abstract

There is considerable evidence that object recognition in primates is based on the detection of local image features of intermediate complexity that are largely invariant to imaging transformations. A computer vision system has been developed that performs object recognition using features with similar properties. Invariance to image translation, scale and rotation is achieved by first selecting stable key points in scale space and performing feature detection only at these locations. The features measure local image gradients in a manner modeled on the response of complex cells in primary visual cortex, and thereby obtain partial invariance to illumination, affine change, and other local distortions. The features are used as input to a nearest-neighbor indexing method and Hough transform that identify candidate object matches. Final verification of each match is achieved by finding a best-fit solution for the unknown model parameters and integrating the features consistent with these parameter values. This verification procedure provides a model for the serial process of attention in human vision that integrates features belonging to a single object. Experimental results show that this approach can achieve rapid and robust object recognition in cluttered partially-occluded images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ballard, D.H., “Generalizing the Hough transform to detect arbitrary patterns,” Pattern Recognition, 13,2 (1981), pp. 111–122.
Article MATH Google Scholar
Beis, Jeff, and David G. Lowe, “Shape indexing using approximate nearestneighbour search in high-dimensional spaces,” Conference on Computer Vision and Pattern Recognition, Puerto Rico (1997), pp. 1000–1006.
Google Scholar
Booth, Michael C.A., and Edmund T. Rolls, “View-invariant representations of familiar objects by neurons in the inferior temporal cortex,” Cerebral Cortex, 8 (1998), pp. 510–523.
Article Google Scholar
Crowley, James L., and Alice C. Parker, “A representation for shape based on peaks and ridges in the difference of low-pass transform,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 6,2 (1984), pp. 156–170.
Article Google Scholar
Edelman, Shimon, Nathan Intrator, and Tomaso Poggio, “Complex cells and object recognition,” Unpublished Manuscript, preprint at http://kybele.psych.cornell.edu/~edelman/abstracts.html#ccells
Ito, Minami, Hiroshi Tamura, Ichiro Fujita, and Keiji Tanaka, “Size and position invariance of neuronal responses in monkey inferotemporal cortex,” Journal of Neurophysiology, 73,1 (1995), pp. 218–226.
Google Scholar
Kobatake, Eucaly, and Keiji Tanaka, “Neuronal selectivities to complex object features in the ventral visual pathway of the macaque cerebral cortex,” Journal of Neurophysiology, 71,3 (1994), pp. 856–867.
Google Scholar
Lindeberg, Tony, “Scale-space theory: A basic tool for analysing structures at different scales”, Journal of Applied Statistics, 21,2 (1994), pp. 224–270.
Google Scholar
Lindeberg, Tony, “Detecting salient blob-like image structures and their scales with a scale-space primal sketch: a method for focus-of-attention,” International Journal of Computer Vision, 11,3 (1993), pp. 283–318.
Article Google Scholar
Logothetis, Nikos K., Jon Pauls, and Tomaso Poggio, “Shape representation in the inferior temporal cortex of monkeys,” Current Biology, 5,5 (1995), pp. 552–563.
Article Google Scholar
Lowe, David G., “Three-dimensional object recognition from single two dimensional images,” Artificial Intelligence, 31,3 (1987), pp. 355–395.
Article Google Scholar
Lowe, David G., “Fitting parameterized three-dimensional models to images,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 13,5 (1991), pp. 441–450.
Article MathSciNet Google Scholar
Lowe, David G., “Object recognition from local scale-invariant features,” International Conference on Computer Vision, Corfu, Greece (September 1999), pp. 1150–1157.
Google Scholar
Mel, Bartlett W., “SEEMORE: Combining color, shape, and texture histogramming in a neurally-inspired approach to visual object recognition,” Neural Computation, 9,4 (1997), pp. 777–804.
Article Google Scholar
Murase, Hiroshi, and Shree K. Nayar, “Visual learning and recognition of 3-D objects from appearance,” International Journal of Computer Vision, 14,1 (1995), pp. 5–24.
Article Google Scholar
Perrett, David I., and Mike W. Oram, “Visual recognition based on temporal cortex cells: viewer-centered processing of pattern configuration,” Zeitschrift für Naturforschung C, 53c (1998), pp. 518–541.
Google Scholar
Schiele, Bernt, and James L. Crowley, “Recognition without correspondence using multidimensional receptive field histograms,” International Journal of Computer Vision, 36,1 (2000), pp. 31–50.
Article Google Scholar
Schmid, C., and R. Mohr, “Local grayvalue invariants for image retrieval,” IEEE PAMI, 19,5 (1997), pp. 530–534.
Google Scholar
Swain, M., and D. Ballard, “Color indexing,” International Journal of Computer Vision, 7,1 (1991), pp. 11–32.
Article Google Scholar
Tanaka, Keiji, “Neuronal mechanisms of object recognition,” Science, 262 (1993), pp. 685–688.
Article Google Scholar
Tanaka, Keiji, “Mechanisms of visual object recognition: monkey and human studies,” Current Opinion in Neurobiology, 7 (1997), pp. 523–529.
Article Google Scholar
Tovee, Martin J., Edmund T. Rolls, and V.S. Ramachandran, “Rapid visual learning in neurones of the primate temporal visual cortex,” NeuroReport, 7 (1996), pp. 2757–2760.
Article Google Scholar
Treisman, Anne M., and Nancy G. Kanwisher, “Perceiving visually presented objects: recognition, awareness, and modularity,” Current Opinion in Neurobiology, 8 (1998), pp. 218–226.
Article Google Scholar
Viola, Paul, “Complex feature recognition: A Bayesian approach for learning to recognize objects,” MIT AI Memo 1591, Massachusetts Institute of Technology (1996).
Google Scholar
Wolfe, Jeremy M., and Sara C. Bennett, “Preattentive object files: shapeless bundles of basic features,” Vision Research, 37,1 (1997), pp. 25–43.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Dept., Univ. of British Columbia, Vancouver, B.C., V6T 1Z4, Canada
David G. Lowe

Authors

David G. Lowe
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Artificial Vision Research, Korea University, Anam-dong, Seongbuk-ku, Seoul, 136-701, Korea
Seong-Whan Lee
Max-Planck-Institute for Biological Cybernetics, Spemannstr. 38, 72076, Tübingen, Germany
Heinrich H. Bülthoff
Department of Brain and Cognitive Sciences Artificial Intelligence Laboratory, E25-218, Massachusetts Institute of Technology, 45 Carleton Street, Cambridge, MA, 02142, USA
Tomaso Poggio

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lowe, D.G. (2000). Towards a Computational Model for Object Recognition in IT Cortex. In: Lee, SW., Bülthoff, H.H., Poggio, T. (eds) Biologically Motivated Computer Vision. BMCV 2000. Lecture Notes in Computer Science, vol 1811. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45482-9_3

Download citation

DOI: https://doi.org/10.1007/3-540-45482-9_3
Published: 01 February 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67560-0
Online ISBN: 978-3-540-45482-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics