Abstract
We address the problem of generic shape recognition, in which exact models are not available. We propose an original approach, in which learning and recognition are intimately linked, as recognition is based on previous observation.
The input to our system is in the form of segmented descriptions of objects in terms of parts. In 2-D, the shape is incrementally decomposed into parts suggested by curvature sign changes, and for each part an axial description is derived from both local and global information. The parts are organized into a connection hierarchy. For 3-D objects, we intend to use segmented tridimensional descriptions, the parts being modeled by generalized cylinders. In this case, the connection graph is not necessarily a hierarchy, but can still be used with our algorithms.
The part description obtained at this point is still too detailed and fine grained in order to easily categorize and compare shapes. Hence, we use a simplified description of parts, capturing part local geometry and connection with the superpart information. The local geometry parameters are qualitative and symbolic, and are quasi-invariants under projection and viewpoint change. Both types of parameters take discrete values derived from the available fine description. The connection parameters are normalized to be scale-independent. These simplified part descriptions are organized into a connection hierarchy as provided by the original decomposition. The parameters are chosen to ensure that the information carried in these descriptions is sufficient to perform shape recognition.
Actual shape descriptions are stored in a data-base, from which they must be efficiently and specifically retrieved when a new shape is proposed for recognition. We define a hierarchical indexing system based on the structure of the descriptions and the local description of parts. This mechanism allows for dynamic updating of the data-base with a minimum computing cost.
When a shape is submitted for recognition, the data-base is searched for the closest known shapes. A partial match, based on the connection structure and the aggregation of dissimilarities between parts, is computed incrementally level by level between the shape and the possible candidates. The combination of the incremental process with the hierarchical indexing makes the number of shapes processed at each step decrease rapidly, therefore dramatically reducing the average complexity of the retrieval. The selected retrieved shape(s) are used to give a classification for the submitted shape.
This approach to recognition is influenced by the Case-Based Reasoning (CBR) paradigm, which embeds all the characteristics required to meet our goals, such as the ability to process noisy, incomplete and new data. It also provides an interesting framework for higher-level intelligent processing (e.g. justified interpretation, automatic learning).
We describe our implementation for 2-D shapes recognition and present results. The current implementation should also work on 3-D descriptions as described above, with minor changes. We also intend to use this system as the core of a higher-level vision-based reasoning system.
This research was supported in part by the Advanced Research Projects Agency of the Department of Defense and was monitored by the Air Force Office of Scientific Research under Contract No. F49620-90-C-0078 and/or Grant No F49620-93-1-0620. The United States Government is authorized to reproduce and distribute reprints for governmental purposes notwithstanding any copyright notation hereon.
Sponsored for this research by NOESIS S.A., Immeuble Ariane, Domaine Technologique de Saclay, 4 rue René Razel - Saclay, F-91892 Orsay Cedex, FRANCE.
Preview
Unable to display preview. Download preview PDF.
References
R. Bareiss. Exemplar-based knowledge acquisition: a unified approach to concept representation, classification and learning. Academic Press, 1989.
I. Biederman. Human image understanding: recent research and a theory. In Computer vision, graphics and image understanding, vol. 32, no. 1, pp. 29–73, October 1985.
R.C. Bolles and R.A. Cain. Recognizing and locating partially visible objects: the local-feature-focus method. In International Journal of Robotics Research, vol. 1, no. 3, pp. 57–82, 1982.
A. Califano and R. Mohan. Multidimensional indexing for recognizing visual shapes. In Proc. IEEE Computer Vision and Pattern Recognition, pp. 28–34, Maui, Hawaii, June 1991.
G. J. Ettinger. Large hierarchical object recognition using libraries of parametrized model subparts. In Proc. IEEE Computer Vision and Pattern Recognition, pp. 32–41, Ann Arbor, Michigan, June 1988.
W. E. L. Grimson. Object recognition by computer — The role of geometric constraints. MIT Press, Cambridge, Massachusetts, 1990.
W. E. L. Grimson and T. Lozano-Perez. Model-based recognition and localization from sparse range or tactile data. In International Journal of Robotics Research, vol. 3, no. 3, pp. 3–35, 1984.
W. E. L. Grimson and T. Lozano-Perez. Localizing overlapping parts by searching the interpretation tree. In IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 9, no. 4, pp. 469–482, 1987.
P. Havaldar, G. Medioni and F. Stein. Extraction of groups for recognition. In Proc. European Conference on Computer Vision, vol. 1, pp. 251–261, Stockholm, Sweden, May 1994.
A. Kalvin, E. Schonberg, J.T. Schwartz and M. Sharir. Two-dimensional, model-based, boundary matching using footprints. In International Journal of Robotics Research, vol. 5, no. 4, pp. 38–55, 1986.
J. Kolodner. An introduction to Case-Based Reasoning. In Artificial Intelligence review, vol. 6, pp. 3–34, 1992.
Y. Lamdan and H. J. Wolfson. Geometric hashing: a general and efficient model-based recognition scheme. In Proceedings of IEEE International Conference on Computer Vision, pp. 218–249, Tampa, Florida, december 1988.
H. Murase and S. K. Nayar. Visual learning and recognition of 3-D objects from appearance. In International Journal of Computer Vision, vol. 14, no. 1, pp. 5–24, January 1995.
R. Nevatia and Th. O. Binford. Description and recognition of curved objects. In Artificial Intelligence, vol. 8, no. 1, pp. 77–98, February 1977.
G. Provan, P. Langley and Th. O. Binford. Probabilistic learning of three-dimensional object models. In Proc. Image Understanding Workshop, pp. 1403–1413, Palm Springs, California, February 1996.
K. Rao. Shape description from sparse and imperfect data. PhD Thesis. University of Southern California, December 1988. IRIS Technical Report 250.
H. Rom and G. Medioni. Hierarchical decomposition and axial shape description. In IEEE Transactions on Pattern Analysis and Machine Intelligence. vol. 15, no. 10, pp. 973–981, October 1993.
S. Slade. Case-Based Reasoning: a research paradigm. In AI Mag., pp. 42–55, Spring 1991.
F. Stein and G. Medioni. Structural indexing: efficient two dimensional object recognition. In IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1198–1204, February 1992.
M. Zerroug and R. Nevatia. From an intensity image to 3-D segmented descriptions. In Proc. IEEE International Conference on Pattern Recognition, vol. 1, pp. 108–113, Jerusalem, Israel, October 1994.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
François, A.R.J., Medioni, G. (1996). Generic shape learning and recognition. In: Ponce, J., Zisserman, A., Hebert, M. (eds) Object Representation in Computer Vision II. ORCV 1996. Lecture Notes in Computer Science, vol 1144. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61750-7_34
Download citation
DOI: https://doi.org/10.1007/3-540-61750-7_34
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61750-1
Online ISBN: 978-3-540-70673-1
eBook Packages: Springer Book Archive