Abstract
There is considerable evidence that object recognition in primates is based on the detection of local image features of intermediate complexity that are largely invariant to imaging transformations. A computer vision system has been developed that performs object recognition using features with similar properties. Invariance to image translation, scale and rotation is achieved by first selecting stable key points in scale space and performing feature detection only at these locations. The features measure local image gradients in a manner modeled on the response of complex cells in primary visual cortex, and thereby obtain partial invariance to illumination, affine change, and other local distortions. The features are used as input to a nearest-neighbor indexing method and Hough transform that identify candidate object matches. Final verification of each match is achieved by finding a best-fit solution for the unknown model parameters and integrating the features consistent with these parameter values. This verification procedure provides a model for the serial process of attention in human vision that integrates features belonging to a single object. Experimental results show that this approach can achieve rapid and robust object recognition in cluttered partially-occluded images.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ballard, D.H., “Generalizing the Hough transform to detect arbitrary patterns,” Pattern Recognition, 13,2 (1981), pp. 111–122.
Beis, Jeff, and David G. Lowe, “Shape indexing using approximate nearestneighbour search in high-dimensional spaces,” Conference on Computer Vision and Pattern Recognition, Puerto Rico (1997), pp. 1000–1006.
Booth, Michael C.A., and Edmund T. Rolls, “View-invariant representations of familiar objects by neurons in the inferior temporal cortex,” Cerebral Cortex, 8 (1998), pp. 510–523.
Crowley, James L., and Alice C. Parker, “A representation for shape based on peaks and ridges in the difference of low-pass transform,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 6,2 (1984), pp. 156–170.
Edelman, Shimon, Nathan Intrator, and Tomaso Poggio, “Complex cells and object recognition,” Unpublished Manuscript, preprint at http://kybele.psych.cornell.edu/~edelman/abstracts.html#ccells
Ito, Minami, Hiroshi Tamura, Ichiro Fujita, and Keiji Tanaka, “Size and position invariance of neuronal responses in monkey inferotemporal cortex,” Journal of Neurophysiology, 73,1 (1995), pp. 218–226.
Kobatake, Eucaly, and Keiji Tanaka, “Neuronal selectivities to complex object features in the ventral visual pathway of the macaque cerebral cortex,” Journal of Neurophysiology, 71,3 (1994), pp. 856–867.
Lindeberg, Tony, “Scale-space theory: A basic tool for analysing structures at different scales”, Journal of Applied Statistics, 21,2 (1994), pp. 224–270.
Lindeberg, Tony, “Detecting salient blob-like image structures and their scales with a scale-space primal sketch: a method for focus-of-attention,” International Journal of Computer Vision, 11,3 (1993), pp. 283–318.
Logothetis, Nikos K., Jon Pauls, and Tomaso Poggio, “Shape representation in the inferior temporal cortex of monkeys,” Current Biology, 5,5 (1995), pp. 552–563.
Lowe, David G., “Three-dimensional object recognition from single two dimensional images,” Artificial Intelligence, 31,3 (1987), pp. 355–395.
Lowe, David G., “Fitting parameterized three-dimensional models to images,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 13,5 (1991), pp. 441–450.
Lowe, David G., “Object recognition from local scale-invariant features,” International Conference on Computer Vision, Corfu, Greece (September 1999), pp. 1150–1157.
Mel, Bartlett W., “SEEMORE: Combining color, shape, and texture histogramming in a neurally-inspired approach to visual object recognition,” Neural Computation, 9,4 (1997), pp. 777–804.
Murase, Hiroshi, and Shree K. Nayar, “Visual learning and recognition of 3-D objects from appearance,” International Journal of Computer Vision, 14,1 (1995), pp. 5–24.
Perrett, David I., and Mike W. Oram, “Visual recognition based on temporal cortex cells: viewer-centered processing of pattern configuration,” Zeitschrift für Naturforschung C, 53c (1998), pp. 518–541.
Schiele, Bernt, and James L. Crowley, “Recognition without correspondence using multidimensional receptive field histograms,” International Journal of Computer Vision, 36,1 (2000), pp. 31–50.
Schmid, C., and R. Mohr, “Local grayvalue invariants for image retrieval,” IEEE PAMI, 19,5 (1997), pp. 530–534.
Swain, M., and D. Ballard, “Color indexing,” International Journal of Computer Vision, 7,1 (1991), pp. 11–32.
Tanaka, Keiji, “Neuronal mechanisms of object recognition,” Science, 262 (1993), pp. 685–688.
Tanaka, Keiji, “Mechanisms of visual object recognition: monkey and human studies,” Current Opinion in Neurobiology, 7 (1997), pp. 523–529.
Tovee, Martin J., Edmund T. Rolls, and V.S. Ramachandran, “Rapid visual learning in neurones of the primate temporal visual cortex,” NeuroReport, 7 (1996), pp. 2757–2760.
Treisman, Anne M., and Nancy G. Kanwisher, “Perceiving visually presented objects: recognition, awareness, and modularity,” Current Opinion in Neurobiology, 8 (1998), pp. 218–226.
Viola, Paul, “Complex feature recognition: A Bayesian approach for learning to recognize objects,” MIT AI Memo 1591, Massachusetts Institute of Technology (1996).
Wolfe, Jeremy M., and Sara C. Bennett, “Preattentive object files: shapeless bundles of basic features,” Vision Research, 37,1 (1997), pp. 25–43.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lowe, D.G. (2000). Towards a Computational Model for Object Recognition in IT Cortex. In: Lee, SW., Bülthoff, H.H., Poggio, T. (eds) Biologically Motivated Computer Vision. BMCV 2000. Lecture Notes in Computer Science, vol 1811. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45482-9_3
Download citation
DOI: https://doi.org/10.1007/3-540-45482-9_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67560-0
Online ISBN: 978-3-540-45482-3
eBook Packages: Springer Book Archive