Shape recognition based on Kernel-edit distance

https://doi.org/10.1016/j.cviu.2010.07.002Get rights and content

Abstract

In this paper a kernel method for shape recognition is proposed. The approach is based on the edit distance between pairs of shapes after transforming them into symbol strings. The transformation of shapes into symbol strings is invariant to similarity transforms and can handle partial occlusions. Representation of shape contours uses the shape contexts and applies dynamic programming for finding the correspondence between points over shape contours. Corresponding points are then transformed into symbolic representation and the normalized edit distance computes the dissimilarity between pairs of strings in the database. Obtained distances are then transformed into suitable kernels which are classified using support vector machines. Experimental results over a variety of shape databases show that the proposed approach is suitable for shape recognition.

Introduction

Computer vision aims at building artificial systems able to understand scenes and to recognize automatically all present objects. Objects have several properties like shape, texture, color, etc. that can used for recognition. Among those features, probably shape is the most important property which can be perceived and used for recognition and classification. Because of this property of shapes, it is useful to develop algorithms for shape recognition. One important issue for the recognition of objects is dealing with the partial occlusions. As the proposed method is mainly for the recognition of segmented shapes from the background, these shapes might come from another module which does the segmentation task. Normally, the segmentation module cannot provide perfect shapes due to occlusions, so some parts of an object might be missing or might be combined with a part of background or another object. To have a successful method for shape recognition this point has to be considered in designing an algorithm. As we will see the proposed method has considered this point and it is able to work well with partially occluded shapes.

In this paper we present a new kernel method for shape recognition based on a previously published method [1] that uses edit distance metric for the similarity measure of a pair of shapes. With this kernel method, it is possible to use support vector machine (SVM) for the classification while for the old approach nearest-neighbor classifier was used. The new kernel method performs better than previous approaches on a variety of shape databases.

Section snippets

Previous approaches

There are two main approaches for the shape recognition in the literature namely contour-based approaches, considering the outline of shapes as input for recognition and surface-based methods, considering shapes as a whole for recognition. Both approaches use local or global representation. Maybe contour-based approaches that they use local representation are the most successful methods that can deal with the problems we have mentioned in the introduction part. In addition, the approaches can

Symbolic representation for a pair of shapes

Here we describe the way we transform a pair of shape contours from a database into a string of symbols. Let suppose we have selected N points over the contour of two shapes using a sampling method. In order to be able to compute the similarity between the two shapes, we need to find the best correspondence between point sets from the two shapes. We used the shape context as a similarity measure between points from the two shapes. The shape context for each point over the contour is a polar

Distance between strings (edit distance)

As we have two sequences of strings, there are several ways to compute the distance between them. One way is using the edit distance [15]. The edit distance is a general form of Hamming distance that is applied when the length of two strings are identical. The edit distance is the minimum number of necessary operations applied to one of the strings to make it identical to the other one. Allowed operation is an insertion, deletion or substitution of a single character. We have proposed [1] to

Kernel-edit distance

Kernel methods have been successfully used for pattern recognition. The key idea of kernel methods is to map the data into a high dimensional feature space in which each coordinate corresponds to one feature of data points. In the new high dimensional space it is possible to define the mathematical operations that in the original space have not been defined. The kernel methods normally do not operate directly in the space but rather they use inner products of all pairs of the data. This makes

Experimental results

In this section we test our method for shape recognition on a variety of shape databases and compare the results with the state of the art approaches for the shape recognition, available for those database in the literature. The databases were used here are consisting of Kimia-99 database, Chicken Piece Dataset, Natural Silhouette database, Marine database, Gesture dataset, MPEG-7 shape dataset and ETH-80 object database.

Conclusions

Here a kernel approach to shape recognition was proposed. The Kernel-edit distance proposed here, was tested on a variety of shape databases including Kimia-99, Chicken Pieces, Natural Silhouettes, Gesture database, Marine database, MPEG-7 shape database and ETH-80 object database and the recognition rate was superior to all of the previous approaches that they have used these databases. The results show that the method is robust to the partial occlusions, which is a necessary part for a shape

References (39)

  • A. Bonnassie et al.

    A new method for analyzing local shape in three-dimensional images based on medial axis transformation

    IEEE Trans. Syst. Man Cybern., Part B

    (2003)
  • H. Ling et al.

    Shape classification using the inner-distance

    IEEE Trans. PAMI

    (2007)
  • X. Yang, X. Bai, L.J. Latecki, Z. Tu, Improving shape retrieval by learning graph transduction, in: Proc. European...
  • X. Yang, S. Koknar-Tezel, L.J. Latecki, Locally constrained diffusion process on locally densified distance spaces with...
  • E.G.M. Petrakis et al.

    Matching and retrieval of distorted and occluded shapes using dynamic programming

    IEEE Trans. PAMI

    (2002)
  • E.S. Ristad et al.

    Learning string-edit distance

    IEEE Trans. PAMI

    (1998)
  • V. Vapnik

    The Nature of Statistical Learning Theory

    (1995)
  • B. Scholkopf et al.

    Nonlinear component analysis as a kernel eigenvalue problem

    Neural Comput.

    (1998)
  • S.J. Kim, A. Magnani, S. Boyd, Optimal kernel selection in kernel fisher discriminant analysis, in: Proc. of the 23rd...
  • Cited by (45)

    • Multi-level contour combination features for shape recognition

      2023, Computer Vision and Image Understanding
    • A robust approach for object matching and classification using Partial Dominant Orientation Descriptor

      2017, Pattern Recognition
      Citation Excerpt :

      Height function is also used by [21] to generate the shape descriptors, where the correspondence between shapes is performed using dynamic programming algorithm. Other studies such the ones in [22,23] have used linear interpolation to transform the shape contours into a symbolic representation, where edit-distance and kernel Support Vector Machine (SVM) are both used for classification. Furthermore, BoW concept is used by Wang et al. [24] to develop a shape representation called Bag of Contours Fragments; here, the shape classification is performed using linear SVM classifier.

    View all citing articles on Scopus
    View full text