Content-based trademark retrieval system using a visually salient feature

doi:10.1016/S0262-8856(98)00060-2

Image and Vision Computing

Volume 16, Issues 12–13, 24 August 1998, Pages 931-939

https://doi.org/10.1016/S0262-8856(98)00060-2 Get rights and content

Abstract

The ever-increasing number of registered trademarks has created greater demand for an automatic trademark retrieval system. In this paper, we present a method for such a system based on the image content, using a shape feature. Zernike moments of an image are used for a feature set. To retrieve similarly shaped trademarks quickly, we introduced the concept of a `visually salient feature' that dominantly affects the global shape of the trademarks. Experiments have been conducted on a database of 3000 trademark images. The retrieval speed was very fast and similar-shaped trademark retrieval results were very promising.

Introduction

An image retrieval system based on image content is a key area for building and managing large multimedia databases such as trademark and copyright, art galleries and museum, picture archiving and communication system (PACS), to name a few [1]. So, interest in the subject of content-based image retrieval has greatly increased for the past few years.

In this paper, we address the problem of visually similar trademark retrieval from a large trademark database using shape features. Trademarks are considered valuable intellectual properties and a key component of the goodwill of a business, since they represent not only the quality of actual products and services, but also the reputation of the manufacturer or the company. A registered trademark is protected through legal proceedings from misuse or imitation. Until now, since the total number of registered trademarks is over a million, the task of designing and registering a new trademark becomes more difficult without inadvertent infringement of copyright. So far, the current practice to classify trademarks is first by grouping the trademarks into several similar shapes according to a specific class order, followed by performing the matching process manually by human operator [2]. Therefore, the development of an on-line automatic trademark retrieval system for similar shapes becomes crucial.

In this paper, Zernike moment magnitudes (ZMMs) are used as a feature set. ZMMs are robust to noise or small variance of a pattern, and have rotation invariant characteristics. With a proper normalization method, scale invariance has also been achieved [3]. To retrieve similar shapes, we developed the `visually salient feature' that dominantly affects the global shape of the trademarks by ignoring minor details. The visually salient feature was determined by the probabilistic distribution model of a trademark database. To verify the performance of our proposed similar-shaped trademark retrieval system, several trademarks were submitted as a query image to a trademark database that contains 3000 trademarks.

We also considered pseudo-Zernike moments as a feature set. Pseudo-Zernike moments have properties analogous to Zernike moments. The performance of pseudo-Zernike moments was very similar to that of Zernike moments.

A trademark is a complex pattern, consisting of various text and image patterns. Trademarks can be divided into four types as shown in Fig. 1. Word-in-mark is a trademark that contains only characters or words in the mark. Character recognition or manual annotation is required to handle the type because the linguistic property (word structures and phonetics) is the key component of the type. On the other hand, a device-mark contains graphical or figurative elements only. Thus, the geometric shape is the key component for the type. Composite-mark consists of characters or words and graphical elements, while a complex-mark contains a complex image. Our current system focuses on retrieving device-mark types only.

Content-based image retrieval can be categorized into three parts: color-, texture- and shape-based retrieval. A number of techniques have appeared in the literature that deal with retrieval based on shape similarity. The QBIC (Query by Image Content) system allows queries on a large image database using various image contents such as color, texture, shape and position [4]. Jagadish proposed a similar shape retrieval method using the rectangular cover description 5, 6. Bigün et al. proposed an image retrieval system using orientation radiograms which are similar to the histogram of the edge directions [7]. Bimbo et al. presented an image retrieval system using a hierarchical model of the curve which is derived from its multi-scale analysis [8]. Mokhtarian et al. proposed the similar shape retrieval method using the maxima of curvature zero-crossing contours in the curvature scale space [9].

Several researchers have applied shape-based retrieval techniques to trademark images. Kato introduced a content-based similar shaped-trademark retrieval system [10]. This system used graphical features such as spatial outline of the overall figures, spatial, frequency, local correlation measure and local contrast measure. Cortelazzo et al. presented the trademark shape description method using a string matching technique [11]. Jain et al. proposed a hierarchical image retrieval system and tested the system on a trademark database 12, 13, 14. Their system uses a two-stage hierarchy: a fast screening stage using a histogram of the edge directions and invariant moments and a detailed matching stage using deformable template matching [15]. Eakins presented the SAFARI (shape analysis for automatic retrieval of images) system with curvature-based feature [16]. He developed a later version of SAFARI, so called ARTISAN (automatic retrieval of trademark images by shape analysis) that utilized more complex features: circularity, aspect ratio, discontinuity angle irregularity etc. [17]. Lam et al. presented a trademark retrieval system, STAR (system for trademark archival and retrieval) [18]. The system consisted of two parts to handle device-marks and word-in-marks. For device-marks, invariant moments and Fourier descriptors extracted from manually isolated distinct objects were used for shape features and the similarities among the trademarks are measured by a fuzzy thesaurus. For word-in-marks, the system performed sub-string matching and phonetics matching to retrieve trademarks that have similar linguistic properties.

Boundary based techniques such as boundary matching 11, 16, 17, Fourier descriptors [18] and multiscale curve matching 8, 9 may not be suitable for similar-shaped trademark retrieval, as the boundary shape can be changed drastically when there is a small crack like an opening or an object touching neighboring objects. For example, the shapes shown in Fig. 2(a) and (c) are very similar in human perception. The boundaries of these shapes, as shown in Fig. 2(b) and (d), however, are very different whether or not the inner star touches the outer circle. Furthermore, while most Fourier descriptor or curvature-based methods are based on a single boundary, a trademark consists of a complex pattern that has more than one boundary. Morphology-based preprocessing can be applied to remedy the problem, but it is not easy to determine the number of operations such as erosion or dilation to yield the optimum result for all trademark images. In addition, the resulting number of contours may also be very sensitive to the number of preprocessing steps.

A histogram of the edge directions 7, 12, 13, 14 has also been used in many systems. The drawback of this technique lies in the lack of discernment, because the histogram alone does not contain the information of edge location. For example, the images shown in Fig. 3(a)–(c), although their shapes are very different, have similar histograms of the edge directions as illustrated in Fig. 3(d).

The rest of this paper is organized as follows. In Section 2, we overview Zernike moments as a feature set. In Section 3, we present a probabilistic distribution model of the feature. Then, our retrieval method is described in Section 4. Experimental results are given in Section 5, and Section 6summarizes the paper.

Section snippets

Zernike moments as a feature set

Zernike moments are complex orthogonal moments whose magnitude has rotational invariant property 19, 20, 21, 22. Teh et al. compared several moments in terms of:

1.
sensitivity to image noise;
2.
aspects of information redundancy;
3.
capability for image representation.

They reported that Zernike and pseudo-Zernike moments outperform the other moments, such as regular moments, Legendre moments, rotational moments and complex moments, in all aspects [23]. Kim et al. has shown that Zernike moments

Trademark data collection

Three thousand Korean and world trademarks were collected from the reference 24, 25 by scanner. All trademark images were binarized and normalized to the size of 100×100 pixels by maximum extent circle (MEC) method [26]. Color was not considered in our current system. ZMMs were computed by the lookup-table method [26] and stored in a database up to the order of n=17. The total number of moments corresponding to n=17, is 90 [23].

Distribution model

The distribution model of features plays an important role in our

Retrieving similar trademarks using the most salient feature

With 90 Zernike moment features to use for retrieving similar trademarks from database, one of the common practice is to make use of the Euclidean distance in feature space, along with a proper weight on each feature [22]. The one whose distance to the query is the minimum will be selected. However, when the number of patterns in a database to compare is very large, the number of features should also be increased, and this naive approach may pose a computational problem. In addition, as the

Experimental results

To verify the performance of our proposed similar-shaped trademark retrieval scheme, several trademarks shown in Fig. 9 were submitted as query image to the trademark database that consists of 3000 trademarks. The performance is estimated by the following subjective and objective criteria;

1.
How well can similar-shaped trademarks be retrieved in accordance with the human perception.
2.
How well can the same trademarks be retrieved in the presence of noise or deformation.

When a query was submitted to

Summary and discussion

In this paper, we presented a new content-based similar shape retrieval method for trademarks using Zernike moments. The advantages of using MSF are twofold: quick retrieval of similar trademarks, and robustness to the minor transformation of the shape.

Since the radial complexity and the degree of circular symmetry of the shape are reflected in the MSF, the retrieved trademarks using the MSF will have similar characteristics. The MSF of the trademark was barely affected by noise or deformation

Acknowledgements

This work was supported by the Electronics and Telecommunications Research Institute under grant 97202.

References (28)

G. Cortelazzo et al.
Trademark shapes description by stringmatching techniques
Pattern Recog.
(1994)
A.K. Jain et al.
Image retrieval using color and shape
Pattern Recog.
(1996)
R.J. Prokop et al.
A survey of moment-based techniques for unoccluded object representation and recognition
CVGIP: Graph. Models Image Process.
(1992)
A. Khotanzad et al.
Rotation invariant image recognition using features selected via a systematic method
Pattern Recog.
(1990)
V.N. Gudivada et al.
Content-based image retrieval systems
IEEE Comput.
(1995)
B. Andrews, U.S. Patent and Trademark Office ORBIT Trademark Retrieval System, T-term User Guide, Examining Attorney's...
Kim W.Y., Yuan P.O., A practical pattern recognition system for translation, scale and rotation invariance, in:...
M. Flickner et al.
Query by image and video content: the QBIC system
IEEE Comput.
(1995)
H.V. Jagadish, A retrieval technique for similar shapes, in: Proceedings of ACM SIGMOD, 1991, pp....
S.K. Chang et al.
A new method of image compression using irreducible covers of maximal rectangles
IEEE Trans. Software Engng
(1988)

J. Bigün, S.K. Bhattacharjee. Michel S., Orientation radiograms for image retrieval: an alterative to segmentation, in:...

A.D. Bimbo, P. Pala, Image indexing using shape-based visual features, in: Proceedings of IEEE International Conference...

Mokhtarian, S. Abbasi and J. Kittler, Efficient and robust retrieval by shape content through curvature scale space,...

T. Kato, Database architecture for content-based image retrieval, in: Proceedings of SPIE Conference on Image Storage...

Cited by (109)

Finite multi-dimensional generalized Gamma Mixture Model Learning for feature selection
2020, Learning Control: Applications in Robotics and Complex Dynamical Systems
Model-based approaches have been widely utilized to investigate multi-dimensional positive features for the purpose of gaining beneficial knowledge. In fact, feature selection is a critical and challenging task when modeling data that are represented in high-dimensional features spaces, which is due to the fact that some of these features maybe irrelevant to the learning process. As a result, a statistical mixture model, which is capable of dealing with multi-dimensional vectors, is developed to tackle the issue of clustering positive vectors. Simultaneously, we take into consideration the irrelevant features that could compromise the proposed model accuracy. The maximum likelihood (ML) method is performed via expectation maximization (EM), and employed for estimating the parameters of the proposed model. Furthermore, experiments are conducted using real-life applications, which include texture, shape and scene images to investigate the performance of our approach.
A deep one-shot network for query-based logo retrieval
2019, Pattern Recognition
Citation Excerpt :
Thereafter many logo-related works were carried out with the methods of content-based indexing and retrieval in trademark databases. The main goal is to assist in trademark infringement detection by checking a newly designed trademark with registered logos in archives [30–33]. The task of trademark recognition in videos is inherently harder due to loss of quality of original logos during processing (e.g. color sub-sampling, video interlacing, motion blur, etc.).
Logo detection in real-world scene images is an important problem with applications in advertisement and marketing. Existing general-purpose object detection methods require large training data with annotations for every logo class. These methods do not satisfy the incremental demand of logo classes necessary for practical deployment since it is practically impossible to have such annotated data for new unseen logo. In this work, we develop an easy-to-implement query-based logo detection and localization system by employing a one-shot learning technique using off the shelf neural network components. Given an image of a query logo, our model searches for logo within a given target image and predicts the possible location of the logo by estimating a binary segmentation mask. The proposed model consists of a conditional branch and a segmentation branch. The former gives a conditional latent representation of the given query logo which is combined with feature maps of the segmentation branch at multiple scales in order to obtain the matching location of the query logo in a target image. Feature matching between the latent query representation and multi-scale feature maps of segmentation branch using simple concatenation operation followed by 1 × 1 convolution layer makes our model scale-invariant. Despite its simplicity, our query-based logo retrieval framework achieved superior performance in FlickrLogos-32 and TopLogos-10 dataset over different existing baseline methods.
Improved shape matching and retrieval using robust histograms of spatially distributed points and angular radial transform
2017, Optik
In this paper, the problem of shape based image retrieval is addressed by proposing a hybrid shape descriptor. The proposed descriptor conforms to human visual perception along with its low computational complexity. Since global features are related to the holistic characteristics of images, whereas local features describe the finer details within objects of images, in the proposed hybrid descriptor both global and local features of images are used to describe the entire aspects of image shape. For global features extraction, we use angular radial transform, which is also adopted by MPEG-7 as a region based shape descriptor. On the other hand, for local feature extraction, a novel local descriptor is proposed, which is referred to as histograms of spatially distributed points (HSDP). It is based on two components: radial distance and differential coefficient, which are used to build 2D histograms. Global and local features are combined using effective distance measures viz. Min-Max and Bray-Curtis. Their superiority is validated by experimental results. Apart from that, an extensive range of image databases is employed to assess the performance of the proposed hybrid descriptor. These databases represent several characteristics of shape such as partial occlusion, distortion, subject change, gray scale objects, rotated and noise affected objects, unstructured images, trademarks, blurred images, Corel images, etc. The results of wide range of experiments reveal that the fusion of ART and HSDP significantly improves the image retrieval accuracy and provides a robust and invariant solution for effective shape matching.
Multi-faceted assessment of trademark similarity
2016, Expert Systems with Applications
Trademarks are intellectual property assets with potentially high reputational value. Their infringement may lead to lost revenue, lower profits and damages to brand reputation. A test normally conducted to check whether a trademark is highly likely to infringe other existing, already registered, trademarks is called a likelihood of confusion test. One of the most influential factors in this test is establishing similarity in appearance, meaning or sound. However, even though the trademark registration process suggests a multi-faceted similarity assessment, relevant research in expert systems mainly focuses on computing individual aspects of similarity between trademarks. Therefore, this paper contributes to the knowledge in this field by proposing a method, which, similar to the way people perceive trademarks, blends together the three fundamental aspects of trademark similarity and produces an aggregated score based on the individual visual, semantic and phonetic assessments. In particular, semantic similarity is a new aspect, which has not been considered by other researchers in approaches aimed at providing decision support in trademark similarity assessment. Another specific scientific contribution of this paper is the innovative integration, using a fuzzy engine, of three independent assessments, which collectively provide a more balanced and human-centered view on potential infringement problems. In addition, the paper introduces the concept of degree of similarity since the line between similar and dissimilar trademarks is not always easy to define especially when dealing with blending three very different assessments. The work described in the paper is evaluated using a database comprising 1400 trademarks compiled from a collection of real legal cases of trademark disputes. The evaluation involved two experiments. The first experiment employed information retrieval measures to test the classification accuracy of the proposed method while the second used human collective opinion to examine correlations between the trademark scoring/rating and the ranking of the proposed method, and human judgment. In the first experiment, the proposed method improved the F-score, precision and accuracy of classification by 12.5%, 35% and 8.3%, respectively, against the best score computed using individual similarity. In the second experiment, the proposed method produced a perfect positive Spearman rank correlation score of 1.00 in the ranking task and a pairwise Pearson correlation score of 0.92 in the rating task. The test of significance conducted on both scores rejected the null hypotheses of the experiment and showed that both scores correlated well with collective human judgment. The combined overall assessment could add value to existing support systems and be beneficial for both trademark examiners and trademark applicants. The method could be further used in addressing recent cyberspace phenomena related to trademark infringement such as customer hijacking and cybersquatting.
Logo and seal based administrative document image retrieval: A survey
2016, Computer Science Review
Citation Excerpt :
Based on this categorization outline, local features used for logo recognition include: features extracted from local zone [54], differential invariants [5], negative shape features [6], primitives (line segments) [55], curvature and distance from centroid point [12,7], SIFT and SURF descriptors derived from Hessian-affine interest points [56–58], horizontal gaps per total area, vertical gaps per total area, ratio of hole area to total area [59,60], color [61], Delaunay triangulation of components/local features [61,60], bag-of-words features [60], edge based features extracted using GHT [62], Fourier coefficients of segmented boundary curves [63], rectangle features extracted from integral image [64], etc. Global features utilized in literature for logo recognition are: different moments (Zernike, Tchebichef, invariant, radial) [54,11,59,65,2,7], projection profiles [66], bispectral [66], gradient features extracted from contour points [67–69], algebraic invariants [5], wavelet-based features [6,70], circularity, eccentricity and rectangularity [59], geometric topology features extracted from components [61], area, isolation, deviation, symmetry, centralization, complexity and 2-level contour representation strings [71], global shape based features (circle, rectangle, triangle, ellipse, polygon, and B-spline) extracted from Fourier descriptor [62,72], shape context [73,3], features extracted from raw image/vector data [74], curvature [22], template matching [75], etc. Different classification methods employed for logo (well segmented) recognition/ classification can be categorized into non-parametric and parametric classification techniques.
With the advance of technology, business offices and organizations together with their clients create a massive amount of administrative documents every day. Administrative documents commonly contain some salient entities such as logos, stamps or seals as the means of their authentication and proprietorship. These salient entities provide quite discriminative information, which can effectively be used for different tasks of document image retrieval, classification and recognition in document-based applications. Thus, proper detection/recognition of these entities in document images increases the performance of such applications in terms of document retrieval, classification, and recognition. To present the state-of-the-art research on the retrieval of administrative document images, this paper deals with a survey of administrative document image retrieval in relation to seals and logos. All the available datasets, feature extraction and classification techniques for logo and seal detection/recognition are discussed systematically. The shortcomings of the present technologies on logo and seal based document processing are also highlighted. Avenues of the future works are further given for the benefit of readers. To the best of authors’ knowledge, there is no survey on administrative document image retrieval and hence the authors hope that this work will be helpful to the researchers of the document analysis community.
An effective vector model for global-contrast-based saliency detection
2015, Journal of Visual Communication and Image Representation
Citation Excerpt :
This selective visual ability allows brain and visual system to break through the bottleneck of information-processing, because it is hypothesized that human visual system only concentrates on the most unusual parts of the massive sensory incoming information [1]. In the computer vision field, it is critical to simulate this ability to extract saliency maps because the maps are key to the applications in images and videos including perceptual video retargeting [2,3], perceptual object segmentation [4,5], adaptive coding [6], object recognition [7,8], and image retrieval [9]. Visual saliency is a perceptual state or quality that makes an item prominent from its neighborhoods.
The saliency detection methods based on global contrast can generate full-resolution saliency map with uniformly highlighted regions and defined boundaries. For the images consisting of large salient objects, the use of unweighted sum of the color distances in the existing global-contrast-based methods may result in the detection of the background instead of the outstanding objects. In this paper, we propose a new global-contrast-based saliency detection method, called LRSW method, by deriving a new vector model which uses the weighted mean vector and contains the features of CIELAB color, chromatic double opponency, and similarity distribution. By using the vector model, the proposed method can significantly increase the detection precision and suppress the background in the saliency map, especially for large salient objects. The experimental results on the MSRA benchmark images show the effectiveness of the proposed method which outperforms the existing methods on visual saliency detection in terms of precision and recall.

View all citing articles on Scopus

View full text

Content-based trademark retrieval system using a visually salient feature

Abstract

Introduction

Section snippets

Zernike moments as a feature set

Trademark data collection

Distribution model

Retrieving similar trademarks using the most salient feature

Experimental results

Summary and discussion

Acknowledgements

Pattern Recog.

Pattern Recog.

CVGIP: Graph. Models Image Process.

Pattern Recog.

Content-based image retrieval systems

IEEE Comput.

Query by image and video content: the QBIC system

IEEE Comput.

A new method of image compression using irreducible covers of maximal rectangles

IEEE Trans. Software Engng