Elsevier

Pattern Recognition

Volume 49, January 2016, Pages 79-88
Pattern Recognition

Landmarks inside the shape: Shape matching using image descriptors

https://doi.org/10.1016/j.patcog.2015.07.013Get rights and content

Highlights

  • An internal-landmark based planar shape matching scheme is proposed.

  • The shape landmarks are obtained using the recently proposed SPEM representation.

  • Properties of SPEM that allow the use of image descriptors inside shapes are discussed.

  • Statistical evidence indicates that the number of matches can be used as a similarity measure.

Abstract

In the last few decades, significant advances in image matching are provided by rich local descriptors that are defined through physical measurements such as reflectance. As such measurements are not naturally available for silhouettes, existing arsenal of image matching tools cannot be utilized in shape matching. We propose that the recently presented SPEM representation can be used analogous to image intensities to detect local keypoints using invariant image salient point detectors. We devise a shape similarity measure based on the number of matching internal regions. The performance of the similarity measure in planar shape retrieval indicates that the landmarks inside the shape silhouettes provide a strong representation of the regional characteristics of 2D planar shapes.

Introduction

In the past two decades, a large number of shape matching and retrieval methods have been proposed. Perhaps the two most important tasks for shape matching are the representation of shape and the extraction of shape descriptors. A need to measure shape similarity arises in many contexts both generic and application specific as in retrieving the most similar shapes from a database to a given query shape. Shape descriptors are the key to measuring similarity or equivalently distance between two or more shapes, and the choice of shape representation naturally affects the choice of shape descriptors as different representations make different properties explicit. In this paper, we propose internal landmarks based on a set of consistent continuous shape fields, SPEM [1], that facilitate the use of invariant image descriptors for planar shape description (see Fig. 1).

The idea of matching shapes from the same class using correspondences that can be related via geometric transformations, stemming from the work of D׳Arcy Thompson [2], motivates landmark-based methods for shape matching. Typically, the landmark points are acquired by sampling either from shape boundary [3], [4] or medial axes [5], [6], [7], [8], [9], [10]. Such landmark-based shape representations have been successfully used in shape matching, also several robust methods for point-based matching are presented [11], [12], [13], [14].

In a different and very active line of research, local detectors that are covariant to a class of transformations as support regions to compute invariant descriptors have proven to be very successful in image retrieval and object recognition [15], [16], [17], [18], [19], [20]. Matching using such a representation is commonly followed by a spatial verification procedure [21], where a planar homography transformation hypothesis is formed and agreeing matches are kept as inliers. This powerful framework progressively evolved over the years for matching images of objects.

There are, nonetheless, applications where matching silhouette data rather than image data is needed. For example, in computational anatomy, alignment of shapes (via silhouettes or silhouette boundaries) is crucial in constructing anatomical atlases for organs or characterizing change that may be a precursor to certain diseases or defects. Furthermore, availability of depth images via RGB-D cameras made it possible to extract the silhouette data for problems where color and texture may be uninformative. Hence, adopting the highly evolved body of works available for matching images to matching silhouettes is important for not re-inventing equivalent tools for silhouette matching. For instance, Obuchi et al. [22] presented a SIFT based framework for 3D shape retrieval, where the SIFT features are obtained from 2D range images of a 3D model. Yet for 2D silhouettes no information is provided in order to calculate salient local regions inside the shape domain.

On one hand, smooth distance fields given for instance by solutions to the Poisson equation are consistent, but they are highly smooth and not sufficiently expressive; hence, the schemes based on Poisson type shape representations have to employ secondary level representations in the form of skeletons [23] or descriptors such as weighted moments [24]. The reason that the proposed framework can work with planar silhouettes is the underlying shape representation that smoothly encodes pure shape information on the shape domain, is consistent and has rich and revealing characteristics.

In this paper, we show that the recently proposed Screened Poisson Encoding Maps (SPEM) [1] representation is both consistent and expressive in a way that descriptors and geometric assumptions proposed for object images can be used for object silhouettes. As the SPEM representation entails the full regional shape description in the form of a compact set of shape maps, each of which is analogous to an image intensity function, we resort to the robust image feature descriptor SIFT [25]. Our approach allows landmarks to be obtained inside the shape, which is novel. We provide statistical evidence that the landmarks inside shapes and corresponding SIFT descriptors can be used for shape matching. To demonstrate the effectiveness of the approach, we perform shape retrieval experiments on widely experimented common shape datasets and compare retrieval accuracy using precision and Bull׳s eye scores. The promising retrieval performance provides motivation for the use of proposed internal landmarks in a variety of shape analysis applications. In the following sections we will briefly describe the SPEM shape representation and why it is sound to use it with SIFT descriptors, introduce the SIFT-based internal landmark matching framework, which is for the first time utilized in the planar shape-matching problem.

Section snippets

Planar shape matching using SPEM and SIFT

We propose a shape matching procedure where corresponding internal parts of shapes are matched under the constraints of having similar regional shape characteristics and a geometrically likely configuration. In order to find a pairwise similarity measure between a given query shape and all of the shapes in a given database, we utilize the matching procedure depicted in Fig. 2.

SIFT [25] descriptors on each projection of SPEM [1] for both input and target shapes are calculated, where detection of

Shape retrieval experiments

In this section, we present our results in comparison to results of the popular shape retrieval methods on commonly employed datasets of Gorelick [24] and Tari [23], [35]. Gorelick׳s dataset is a collection of natural silhouettes expanded by variable classes from Kimia Dataset [36] and contains 12 classes with unevenly distributed number of samples in each class. The natural silhouettes in the same class contain large topological variations and articulations. The Tari dataset by Aslan and Tari

Conclusions

Due to both the holistic and the regional nature of the SPEM representation, a SIFT-based image matching framework could be used in planar shape matching. In contrast to existing boundary-based approaches, proposed region-based representation allows the landmarks to be obtained from the whole shape domain, which leads to robustness to artifacts that can occur in shape boundaries. Moreover, in addition to matching projection characteristics, in order to enforce geometrically likely

Conflict of interest

None declared.

Acknowledgments

Authors R.A. Guler and G. Unal are supported by the TUBITAK (The Scientific and Technological Research Council of Turkey) Research Grant no. 112E320.

Rıza Alp Güler received B.S. and M.S. degrees from Sabanci University, Electronics Engineering Program in 2012 and 2014 respectively. He worked as a PhD student and graduate assistant at Sabanci University in 2015. He received the ISRA VISION ‘Computer Vision Award’ in 2014 with his M.S. thesis work. Starting in Fall 2015, he is going to pursue his PhD degree at École Centrale Paris as part of INRIA Galen Research Team. His main research interest has been in regional shape representations.

References (48)

  • S. Tari et al.

    Extraction of shape skeletons from grayscale images

    Comput. Vis. Image Underst.

    (1997)
  • K. Siddiqi et al.

    Shock graphs and shape matching

    Int. J. Comput. Vis.

    (1999)
  • S.-C. Zhu

    Stochastic jump-diffusion process for computing medial axes in Markov random fields

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1999)
  • T.B. Sebastian et al.

    Recognition of shapes by editing their shock graphs

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2004)
  • B. Jian, B.C. Vemuri, A robust algorithm for point set registration using mixture of Gaussians, in: Tenth IEEE...
  • Y. Zheng et al.

    Robust point matching for nonrigid shapes by preserving local neighborhood structures

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2006)
  • H. Guo, A. Rangarajan, S. Joshi, L. Younes, Non-rigid registration of shapes via diffeomorphic point matching, in: IEEE...
  • D.G. Lowe

    Distinctive image features from scale-invariant keypoints

    Int. J. Comput. Vis.

    (2004)
  • R. Fergus, P. Perona, A. Zisserman, Object class recognition by unsupervised scale-invariant learning, in: IEEE...
  • K. Mikolajczyk et al.

    Scale and affine invariant interest point detectors

    Int. J. Comput. Vis.

    (2004)
  • K. Mikolajczyk et al.

    A comparison of affine region detectors

    Int. J. Comput. Vis.

    (2005)
  • C. Schmid et al.

    Local grayvalue invariants for image retrieval

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1997)
  • K. Mikolajczyk, C. Schmid, Indexing based on scale invariant interest points, in: Eighth IEEE International Conference...
  • J. Philbin, O. Chum, M. Isard, J. Sivic, A. Zisserman, Object retrieval with large vocabularies and fast spatial...
  • Cited by (0)

    Rıza Alp Güler received B.S. and M.S. degrees from Sabanci University, Electronics Engineering Program in 2012 and 2014 respectively. He worked as a PhD student and graduate assistant at Sabanci University in 2015. He received the ISRA VISION ‘Computer Vision Award’ in 2014 with his M.S. thesis work. Starting in Fall 2015, he is going to pursue his PhD degree at École Centrale Paris as part of INRIA Galen Research Team. His main research interest has been in regional shape representations.

    Sibel Tari received her B.S. degree from Hacettepe University Computer Science Engineering in 1989 and Ph.D. degree from Northeastern University in 1997. Currently, she is a professor at the Department of Computer Engineering at Middle East Technical University. Her research interest is in symmetry based shape representation, variational and PDE methods in image and shape analysis.

    Gozde Unal received her Ph.D. degree from ECE Department of North Carolina State University in 2002. She held research faculty and research scientist positions at Georgia Institute of Technology in 2002–2003, and at Siemens Corporate Research, Princeton, NJ, in 2003–2007, respectively. She was an assistant professor (2007-2011) and an associate professor at Sabanci University (2011-2015). Currently, she is an associate professor at the Department of Computer Engineering, Istanbul Technical University. Her research interests are in computer vision, segmentation, registration, and shape analysis techniques with applications to medical imaging.

    View full text