Generic shape learning and recognition

François, Alexandre R. J.; Medioni, Gérard

doi:10.1007/3-540-61750-7_34

Alexandre R. J. François¹ &
Gérard Medioni¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1144))

Included in the following conference series:

International Workshop on Object Representation in Computer Vision

Abstract

We address the problem of generic shape recognition, in which exact models are not available. We propose an original approach, in which learning and recognition are intimately linked, as recognition is based on previous observation.

The input to our system is in the form of segmented descriptions of objects in terms of parts. In 2-D, the shape is incrementally decomposed into parts suggested by curvature sign changes, and for each part an axial description is derived from both local and global information. The parts are organized into a connection hierarchy. For 3-D objects, we intend to use segmented tridimensional descriptions, the parts being modeled by generalized cylinders. In this case, the connection graph is not necessarily a hierarchy, but can still be used with our algorithms.

The part description obtained at this point is still too detailed and fine grained in order to easily categorize and compare shapes. Hence, we use a simplified description of parts, capturing part local geometry and connection with the superpart information. The local geometry parameters are qualitative and symbolic, and are quasi-invariants under projection and viewpoint change. Both types of parameters take discrete values derived from the available fine description. The connection parameters are normalized to be scale-independent. These simplified part descriptions are organized into a connection hierarchy as provided by the original decomposition. The parameters are chosen to ensure that the information carried in these descriptions is sufficient to perform shape recognition.

Actual shape descriptions are stored in a data-base, from which they must be efficiently and specifically retrieved when a new shape is proposed for recognition. We define a hierarchical indexing system based on the structure of the descriptions and the local description of parts. This mechanism allows for dynamic updating of the data-base with a minimum computing cost.

When a shape is submitted for recognition, the data-base is searched for the closest known shapes. A partial match, based on the connection structure and the aggregation of dissimilarities between parts, is computed incrementally level by level between the shape and the possible candidates. The combination of the incremental process with the hierarchical indexing makes the number of shapes processed at each step decrease rapidly, therefore dramatically reducing the average complexity of the retrieval. The selected retrieved shape(s) are used to give a classification for the submitted shape.

This approach to recognition is influenced by the Case-Based Reasoning (CBR) paradigm, which embeds all the characteristics required to meet our goals, such as the ability to process noisy, incomplete and new data. It also provides an interesting framework for higher-level intelligent processing (e.g. justified interpretation, automatic learning).

We describe our implementation for 2-D shapes recognition and present results. The current implementation should also work on 3-D descriptions as described above, with minor changes. We also intend to use this system as the core of a higher-level vision-based reasoning system.

This research was supported in part by the Advanced Research Projects Agency of the Department of Defense and was monitored by the Air Force Office of Scientific Research under Contract No. F49620-90-C-0078 and/or Grant No F49620-93-1-0620. The United States Government is authorized to reproduce and distribute reprints for governmental purposes notwithstanding any copyright notation hereon.

Sponsored for this research by NOESIS S.A., Immeuble Ariane, Domaine Technologique de Saclay, 4 rue René Razel - Saclay, F-91892 Orsay Cedex, FRANCE.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Bareiss. Exemplar-based knowledge acquisition: a unified approach to concept representation, classification and learning. Academic Press, 1989.
Google Scholar
I. Biederman. Human image understanding: recent research and a theory. In Computer vision, graphics and image understanding, vol. 32, no. 1, pp. 29–73, October 1985.
Google Scholar
R.C. Bolles and R.A. Cain. Recognizing and locating partially visible objects: the local-feature-focus method. In International Journal of Robotics Research, vol. 1, no. 3, pp. 57–82, 1982.
Google Scholar
A. Califano and R. Mohan. Multidimensional indexing for recognizing visual shapes. In Proc. IEEE Computer Vision and Pattern Recognition, pp. 28–34, Maui, Hawaii, June 1991.
Google Scholar
G. J. Ettinger. Large hierarchical object recognition using libraries of parametrized model subparts. In Proc. IEEE Computer Vision and Pattern Recognition, pp. 32–41, Ann Arbor, Michigan, June 1988.
Google Scholar
W. E. L. Grimson. Object recognition by computer — The role of geometric constraints. MIT Press, Cambridge, Massachusetts, 1990.
Google Scholar
W. E. L. Grimson and T. Lozano-Perez. Model-based recognition and localization from sparse range or tactile data. In International Journal of Robotics Research, vol. 3, no. 3, pp. 3–35, 1984.
Google Scholar
W. E. L. Grimson and T. Lozano-Perez. Localizing overlapping parts by searching the interpretation tree. In IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 9, no. 4, pp. 469–482, 1987.
Google Scholar
P. Havaldar, G. Medioni and F. Stein. Extraction of groups for recognition. In Proc. European Conference on Computer Vision, vol. 1, pp. 251–261, Stockholm, Sweden, May 1994.
Google Scholar
A. Kalvin, E. Schonberg, J.T. Schwartz and M. Sharir. Two-dimensional, model-based, boundary matching using footprints. In International Journal of Robotics Research, vol. 5, no. 4, pp. 38–55, 1986.
Google Scholar
J. Kolodner. An introduction to Case-Based Reasoning. In Artificial Intelligence review, vol. 6, pp. 3–34, 1992.
Google Scholar
Y. Lamdan and H. J. Wolfson. Geometric hashing: a general and efficient model-based recognition scheme. In Proceedings of IEEE International Conference on Computer Vision, pp. 218–249, Tampa, Florida, december 1988.
Google Scholar
H. Murase and S. K. Nayar. Visual learning and recognition of 3-D objects from appearance. In International Journal of Computer Vision, vol. 14, no. 1, pp. 5–24, January 1995.
Google Scholar
R. Nevatia and Th. O. Binford. Description and recognition of curved objects. In Artificial Intelligence, vol. 8, no. 1, pp. 77–98, February 1977.
Google Scholar
G. Provan, P. Langley and Th. O. Binford. Probabilistic learning of three-dimensional object models. In Proc. Image Understanding Workshop, pp. 1403–1413, Palm Springs, California, February 1996.
Google Scholar
K. Rao. Shape description from sparse and imperfect data. PhD Thesis. University of Southern California, December 1988. IRIS Technical Report 250.
Google Scholar
H. Rom and G. Medioni. Hierarchical decomposition and axial shape description. In IEEE Transactions on Pattern Analysis and Machine Intelligence. vol. 15, no. 10, pp. 973–981, October 1993.
Google Scholar
S. Slade. Case-Based Reasoning: a research paradigm. In AI Mag., pp. 42–55, Spring 1991.
Google Scholar
F. Stein and G. Medioni. Structural indexing: efficient two dimensional object recognition. In IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1198–1204, February 1992.
Google Scholar
M. Zerroug and R. Nevatia. From an intensity image to 3-D segmented descriptions. In Proc. IEEE International Conference on Pattern Recognition, vol. 1, pp. 108–113, Jerusalem, Israel, October 1994.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Robotics and Intelligent Systems, University of Southern California, 90089-0273, Los Angeles, California
Alexandre R. J. François & Gérard Medioni

Authors

Alexandre R. J. François
View author publications
You can also search for this author in PubMed Google Scholar
Gérard Medioni
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Jean Ponce Andrew Zisserman Martial Hebert

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

François, A.R.J., Medioni, G. (1996). Generic shape learning and recognition. In: Ponce, J., Zisserman, A., Hebert, M. (eds) Object Representation in Computer Vision II. ORCV 1996. Lecture Notes in Computer Science, vol 1144. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61750-7_34

Download citation

DOI: https://doi.org/10.1007/3-540-61750-7_34
Published: 02 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61750-1
Online ISBN: 978-3-540-70673-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics