Shape recognition based on Kernel-edit distance
Introduction
Computer vision aims at building artificial systems able to understand scenes and to recognize automatically all present objects. Objects have several properties like shape, texture, color, etc. that can used for recognition. Among those features, probably shape is the most important property which can be perceived and used for recognition and classification. Because of this property of shapes, it is useful to develop algorithms for shape recognition. One important issue for the recognition of objects is dealing with the partial occlusions. As the proposed method is mainly for the recognition of segmented shapes from the background, these shapes might come from another module which does the segmentation task. Normally, the segmentation module cannot provide perfect shapes due to occlusions, so some parts of an object might be missing or might be combined with a part of background or another object. To have a successful method for shape recognition this point has to be considered in designing an algorithm. As we will see the proposed method has considered this point and it is able to work well with partially occluded shapes.
In this paper we present a new kernel method for shape recognition based on a previously published method [1] that uses edit distance metric for the similarity measure of a pair of shapes. With this kernel method, it is possible to use support vector machine (SVM) for the classification while for the old approach nearest-neighbor classifier was used. The new kernel method performs better than previous approaches on a variety of shape databases.
Section snippets
Previous approaches
There are two main approaches for the shape recognition in the literature namely contour-based approaches, considering the outline of shapes as input for recognition and surface-based methods, considering shapes as a whole for recognition. Both approaches use local or global representation. Maybe contour-based approaches that they use local representation are the most successful methods that can deal with the problems we have mentioned in the introduction part. In addition, the approaches can
Symbolic representation for a pair of shapes
Here we describe the way we transform a pair of shape contours from a database into a string of symbols. Let suppose we have selected N points over the contour of two shapes using a sampling method. In order to be able to compute the similarity between the two shapes, we need to find the best correspondence between point sets from the two shapes. We used the shape context as a similarity measure between points from the two shapes. The shape context for each point over the contour is a polar
Distance between strings (edit distance)
As we have two sequences of strings, there are several ways to compute the distance between them. One way is using the edit distance [15]. The edit distance is a general form of Hamming distance that is applied when the length of two strings are identical. The edit distance is the minimum number of necessary operations applied to one of the strings to make it identical to the other one. Allowed operation is an insertion, deletion or substitution of a single character. We have proposed [1] to
Kernel-edit distance
Kernel methods have been successfully used for pattern recognition. The key idea of kernel methods is to map the data into a high dimensional feature space in which each coordinate corresponds to one feature of data points. In the new high dimensional space it is possible to define the mathematical operations that in the original space have not been defined. The kernel methods normally do not operate directly in the space but rather they use inner products of all pairs of the data. This makes
Experimental results
In this section we test our method for shape recognition on a variety of shape databases and compare the results with the state of the art approaches for the shape recognition, available for those database in the literature. The databases were used here are consisting of Kimia-99 database, Chicken Piece Dataset, Natural Silhouette database, Marine database, Gesture dataset, MPEG-7 shape dataset and ETH-80 object database.
Conclusions
Here a kernel approach to shape recognition was proposed. The Kernel-edit distance proposed here, was tested on a variety of shape databases including Kimia-99, Chicken Pieces, Natural Silhouettes, Gesture database, Marine database, MPEG-7 shape database and ETH-80 object database and the recognition rate was superior to all of the previous approaches that they have used these databases. The results show that the method is robust to the partial occlusions, which is a necessary part for a shape
References (39)
- et al.
Robust shape similarity retrieval based on contour segmentation polygonal multiresolution and elastic matching
Pattern Recogn.
(2005) - et al.
Active shape models – their training and application
Comput. Vision Image Understand.
(1995) - et al.
Edit distance-based kernel functions for structural pattern classification
Pattern Recogn.
(2006) - et al.
Robust symbolic representation for shape recognition and retrieval
Pattern Recogn.
(2008) Computer processing of line-drawing images
Comput. Surv.
(1974)- et al.
Run-length chain coding and scalable computation of a shape’s moments using reconfigurable optical buses
IEEE Trans. Syst. Man Cybern., Part B
(2004) - et al.
Shape matching and object recognition using shape contexts
IEEE Trans. PAMI
(2002) - H. Blum, A transformation for extracting new descriptors of shape, in: W. Walthen-Dunn (Ed.), Models for the Perception...
- et al.
Recognition of shapes by editing their shock graphs
IEEE Trans. PAMI
(2004) - et al.
Skeletonization of Ribbon-like shapes based on regularity and singularity analyses
IEEE Trans. Syst. Man Cybern., Part B
(2001)
A new method for analyzing local shape in three-dimensional images based on medial axis transformation
IEEE Trans. Syst. Man Cybern., Part B
Shape classification using the inner-distance
IEEE Trans. PAMI
Matching and retrieval of distorted and occluded shapes using dynamic programming
IEEE Trans. PAMI
Learning string-edit distance
IEEE Trans. PAMI
The Nature of Statistical Learning Theory
Nonlinear component analysis as a kernel eigenvalue problem
Neural Comput.
Cited by (45)
Multi-level contour combination features for shape recognition
2023, Computer Vision and Image UnderstandingRecognition of occluded objects by slope difference distribution features[Formula presented]
2022, Applied Soft ComputingAn enhanced and interpretable feature representation approach to support shape classification from binary images
2021, Pattern Recognition LettersA robust approach for object matching and classification using Partial Dominant Orientation Descriptor
2017, Pattern RecognitionCitation Excerpt :Height function is also used by [21] to generate the shape descriptors, where the correspondence between shapes is performed using dynamic programming algorithm. Other studies such the ones in [22,23] have used linear interpolation to transform the shape contours into a symbolic representation, where edit-distance and kernel Support Vector Machine (SVM) are both used for classification. Furthermore, BoW concept is used by Wang et al. [24] to develop a shape representation called Bag of Contours Fragments; here, the shape classification is performed using linear SVM classifier.
Unseen object categorization using multiple visual cues
2017, Neurocomputing