Similarity retrieval based on group bounding and angle sequence matching in shape database systems

https://doi.org/10.1016/S0164-1212(00)00022-4Get rights and content

Abstract

In this paper, a new method for retrieving similar shapes from a shape database is proposed. The shapes in the database are indexed by their CPA-strings. When a query shape is submitted to the system, it is converted to a CPA-string from which both the lower and the upper bounds of the locations of the potentially matched shapes are computed. This will restrict the search space to a reasonable small proportion of the whole database. At the second stage, an angle sequence matching algorithm is invoked to compute the Weighted Levensthein Distances between the query CPA-string and the selected database CPA-strings. The shapes that have distances less than a given threshold are finally retrieved. Experimental results show that our approach is robust, accurate and efficient in terms of finding the desired shapes within a reasonable time, even if the shapes are rotated, scaled, and may have boundary noise.

Introduction

Similarity retrieval is one of the most important functions in image database systems. Techniques of using spatial relations among objects for similarity retrieval have been widely studied in the past few years (Chang et al., 1987, Chang et al., 1988; Lee et al., 1989; Lee and Hsu, 1992; Huang and Jean, 1994, Huang and Jean, 1996). There are many image database applications where an image contains only a single object. Typical examples include target shapes in defense applications; part or tool shapes in manufacturing applications; and shape of body parts, medical tools, and biomedical signals in medical applications (Mehrotra and Gary, 1995). In such applications, shape feature becomes the most important information used to discriminate objects. The shape of an object is described through an image or drawing. Large collections of object images or drawings are called shape databases. The process of selecting a subset of objects' images or drawings, which are visually similar to the query shape, is called shape similarity retrieval.

There are three important issues involved in shape similarity retrieval: (1) how to represent a shape in an abstract and efficient way while still preserving important shape feature; (2) what are the measures for evaluating both the difference and similarity between shapes?; (3) how to index and organize the shapes in the database so that searching effort for desired shapes can be reduced?

The shape of an object can be described quantitatively using numeric shape descriptors such as roundness, curveness, rectangularity, compactness, direction, elongatedness and eccentricity (Sonka et al., 1993). These shape descriptors provide a global description of object shape, but lack detailed variations. On the other hand, shape feature can be extracted from critical points because the main information of a shape is concentrated at those points with high curvature. Existing critical points detection techniques were summarized in (Fu and Yan, 1997).

Methods of shape similarity retrieval depend on the shape representation schemes used. So far, there is no technique of general validity to measure shape similarity. Evaluating diversities among an unpredictably wide range of shapes probably requires complete shape descriptions about objects. In despite of such difficulties, Mehrotra and Gary (1995) proposed a shape indexing method by transforming a sequence of critical points of a shape into a single point in a multidimensional space. All points in this multidimensional space are then organized into a dynamic structure such as K–D–B tree (Robinson, 1981). Searching for desired shapes in a K–D–B tree is much more efficient than exhaustive search. However, this method is very sensitive to noises on shapes. It is also difficult to determine the dimension of a suitable K–D–B tree because of varieties of shapes. Besides, the implementation cost of a K–D–B tree is extremely high. Another method of shape similarity measuring is to compute the string distance by applying a string-matching technique directly to chain-coded contours (Abe and Sugita, 1982; Cortelazzo et al., 1994). This method is also too sensitive for non-smooth parts of contours which are not the real `features' of the shapes.

In this paper, we present a robust and efficient method for retrieving similar shapes from a shape database. Our approach is robust because it can discriminate objects by measuring their similarities even if objects are rotated, scaled and have noise on the boundary. It is also efficient because the search area for locating potentially matched shapes is controlled within two bounds, which can be dynamically calculated according to the shape feature of the query object. The effectiveness of our approach has been well demonstrated by an experimental system implemented in C++ on IBM compatible PC.

The remainder of this paper is organized as follows. In Section 2, we present our method for extracting and representing the shape feature of an object. In Section 3, we describe how to extend the Weighted Levensthein Distance (WLD) method (Okuda et al., 1976) to become a similarity (or dissimilarity) measure for shape recognition and retrieval. In Section 4, we show the method of organizing the shape database and mapping a shape query into a specific area of the shape database by computing the lower bound and upper bound for that area. The results demonstrating the performance of our experimental system are shown in Section 5. Finally, conclusions are given in Section 6.

Section snippets

Shape feature extraction and representation

The process of shape feature extraction and representation is divided into the following two steps: (1) locating a set of critical points as the vertices of a polygon which approximates the shape of a given object; (2) calculating the sizes of angles at the critical points and forming a sequence of angles to represent the shape of the object.

Shape similarity measurement

One good method for measuring the difference between two strings of symbols is to compute their WLD (Okuda et al., 1976; Abe and Sugita, 1982). The WLD is based on the concept that a string A={a1,a2,…,an} can be rewritten to become another string B={b1,b2,…,bm} by three symbol operations: substitution, insertion and deletion. Suppose that the costs associated with substitution, insertion and deletion are p,q and r, respectively. Also assume that there are N different ways of rewriting string A

Shape similarity retrieval

Given a query shape, our task is to find all the database shapes which are similar to the query shape. Using WLD to measure the difference (or similarity) between the query shape and each database shape is still very time-consuming. To reduce the search time, we organize the CPA-strings of the shapes in the database into several groups such that all CPA-strings of the same group will have equal length.

Let {Gi1,Gi2,…,Gil} be an ordered sequence of groups such that i1<i2<⋯<il, where ik is the

Experimental results

The database of our experimental system contains 2068 shapes. Some of the shapes in the database are shown in Fig. 3. The shapes may have different orientations and scales. The CPA-strings for these 2068 shapes were generated and stored in the database as the indices to their corresponding shapes. All 2068 CAP-strings were classified into 43 groups according to their string lengths as shown in Table 1.

In our experiment, we picked different objects from the database as the query shapes. Each

Conclusions

Object shape plays an important role in modeling the real world for many important database applications. The traditional database approach to modeling the real world by alphanumeric data yields a severe loss of information. Content-based retrieval by directly analyzing the feature of shapes and measuring the similarity and dissimilarity among shapes becomes a better alternative.

In our approach, we used a CPA-string to capture the shape information of an object. The method of automatically

Acknowledgements

This research was supported by National Science Council of ROC under contract NSC 88-2211-E-005-021.

P.W. Huang has been with Texas Instruments as a member of technical staff, software development manager, and senior system engineer from 1979 to 1990. From 1991 to 1997, he was an associate professor in the Computer Science Group of Applied Mathematics Department at National Chung-Hsing University, Taiwan. He was the chairman of the Institute of Computer Science at National Chung-Hsing University from 1992 to 1993. Currently, he is a full professor in the Department of Applied Mathematics at

References (16)

There are more references available in the full text version of this article.

Cited by (3)

P.W. Huang has been with Texas Instruments as a member of technical staff, software development manager, and senior system engineer from 1979 to 1990. From 1991 to 1997, he was an associate professor in the Computer Science Group of Applied Mathematics Department at National Chung-Hsing University, Taiwan. He was the chairman of the Institute of Computer Science at National Chung-Hsing University from 1992 to 1993. Currently, he is a full professor in the Department of Applied Mathematics at National Chung-Hsing University. His research interests include pattern recognition, image database systems, distributed systems, artificial intelligence, decision support and expert systems, computer-integrated manufacturing. He received the B.S. degree in Applied Mathematics from National Chung-Hsing University in 1973, the M.S. degree in Mathematics from Texas Tech University in 1978, and his Ph.D. degree in Computer Science from Southern Methodist University in 1989.

P.L. Lin received her B.S. degree in Engineering Science from National Cheng-Kung University in 1973, M.S. in Mathematics from Texas Tech University in 1978, M.S. and Ph.D. in Electrical Engineering from Southern Methodist University in 1992 and 1994, respectively. She was with the Corporate Manufacturing Technology Center of Texas Instruments as a senior engineer and a project manager from 1979 to 1992. Since 1994, she has been an associate professor in the Department of Computer Science and Information Management at Providence University. She holds two U.S. patents in mobile robot guiding systems. Her research interests includes network security, cryptography, image processing, and factory automation. She is a senior member of SME.

S.K. Dai is a part-time graduate student in the Department of Applied Mathematics at National Chung-Hsing University. He is also an engineer of Chunghwa Telecom Co. at Taichung, Taiwan. His research interests are in computer graphics, pattern recognition, and image processing.

R.T. Kuo is currently a professor in the Department of Applied Mathematics at National Chung-Hsing University. He received a Ph.D. degree in Mathematics from University of Missouri at Columbus in 1981.

View full text