FOCUS: A system for searching for multi-colored objects in a diverse image database

https://doi.org/10.1016/j.cviu.2003.10.018Get rights and content

Abstract

We describe a new multi-phase, color-based image retrieval system (FOCUS) which is capable of identifying multi-colored query objects in an image in the presence of significant, interfering backgrounds. The query object may occur in arbitrary sizes, orientations, and locations in the database images. Scale and rotation invariant color features have been developed to describe an image, such that the matching process is fast even in the case of complex images. The first phase of processing matches the query object color with the color content of an image computed as the peaks in the color histogram of the image. The second phase matches the spatial relationships between color regions in the image with the query using a spatial proximity graph (SPG) structure designed for the purpose. Processing at coarse granularity is preferred over pixel-level processing to produce simpler graphs, which significantly reduces computation time during matching. The speed of the system and the small storage overhead make it suitable for use in large databases with online user interfaces. Test results with multi-colored query objects from man-made and natural domains show that FOCUS is quite effective in handling interfering backgrounds and large variations in scale. The experimental results on a database of diverse images highlights the capabilities of the system.

Introduction

With the advent of large image databases with complex images, efficient content-based retrieval of images has become an important issue. The aim of content-based retrieval is to find images in a database which contain the object represented in a query image. When the database has images of multi-colored objects which can be recognized on the basis of their distinctive color signatures alone, the color of the object is an obvious choice for indexing. The simpler problem of finding global similarity between a query image and candidate images based on color has been addressed in [9], [13], [22], [32]. However, multi-colored objects usually will not occur by themselves in an image. There may be significant, interfering background clutter in the images or the object may occupy a very small portion of the overall image, making global similarity measures unsuitable for retrieval.

In the general case studied here, the target image may contain the queried object at a scale vastly different from the representation in the query image, at any location and with any orientation. The background in the target image is unconstrained and may include other objects with similar colors. However, it is assumed that the queried object appears in the target images with no occlusion and under similar photometric conditions. Since the proposed method is intended for use with online user interfaces, speed is an important criterion and a response time of a few seconds or less is targeted.

There are many image databases in domains where object color can be a basis for retrieval—flags, logos, consumer products, textile patterns, and postal stamps among man-made objects and flowers, birds, fish, and butterflies as example image databases in the natural domain. The database on which FOCUS has been tested consists of 1200 diverse color images. There are 400 advertisements from magazines and 800 color images from nature including birds, fish, flowers, animals, and vegetables. Advertisements are the most challenging component of the database where the goal is to retrieve all advertisements of a product shown in a query image. This is a particularly complex task since the queried object may appear in candidate images in various sizes and orientations with a wide variety of background colors and forms as shown in Fig. 1. The difference in scale is as large as 1:25 in some query–target pairs in our database. Unlike other databases on which color-based retrieval has been tried (flags, logos, and products), in most of the advertisement images the products do not spatially dominate the image, nor are they necessarily in the center. There is no concept of foreground and background—what is background clutter for this application may actually be the foreground of the image. Consequently, no focus-of-attention pre-segmentation is possible. In the rest of the paper, background refers to all objects and context in the image which are not a part of the queried object.

Our choice of domain gives us some advantages in offsetting the difficulty of the problem. Advertisers want consumers to see their product clearly, so occlusion of the product is rare, and typically the same aspect of the product is presented in all advertisements. There may be small out-of-plane rotations causing some occlusion, but all major colors of the object remain visible. Also, the advertisers take care that their products are printed with their true colors so there is very little color distortion across different advertisements of the same product. However, the color constancy problem still arises because light and shadow effects in images create differences in the apparent color of objects. Fortunately, the color variations are not severe and can be handled by the selection of a robust color representation and by allowing some tolerance in the matching strategy. A brief description of this work appeared in [8].

Section snippets

Related work

There has been work in color-based retrieval since the early 1990s. The earliest significant work in color-based retrieval is by Swain and Ballard who used color histograms for indexing [32]. They introduced the histogram intersection technique for matching the histograms of a query image and a candidate image. More efficient histogram indexing strategies have been developed in [9], [10]. Histograms have been used for segmenting images far earlier. An overview of the use of histograms in image

Overview of the system

To meet the requirement for speed while maintaining the ability to detect the color signature of an object at widely varying scale and in the presence of interfering backgrounds, we have used two scale- and orientation-invariant color features, combining them in a two-phase matching strategy to achieve fast and accurate retrieval. An overview of the FOCUS system is shown in Fig. 2. The emphasis in the first phase of matching is on speed of retrieval, and the second phase aims at removal of

Phase I: peak matching

The first phase of matching is intended to produce a candidate image list as quickly as possible. The simplest requirement for an image to contain a multi-colored query object is the presence of all the colors of the query. The location of the histogram maxima, unlike the bin counts, are stable under large differences in scale and some viewpoint change, both conditions that are expected in this domain. Therefore, distinct colors in the image are computed as the peaks in the 3-D color histogram

Phase II: matching strategy

A ranked list of candidate images is obtained at the end of the first phase of matching. A more computationally intensive matching strategy can be applied at this stage since these operations need to be carried out only on the candidate images from the first phase and not the whole database. Even with this reduction, pixel level processing of the images would make this phase of matching too slow for online user interfaces. In response to this problem, we have developed a new graph description

Query construction and processing

FOCUS currently has an interactive user interface where the user can select a query from a variety of images in the database by marking a sub-image which covers the object of interest. The query image should not contain any background colors but it is not necessary to include the whole object exactly; including the salient colors of the object is sufficient. An example of query selection from a “Macintosh” advertisement is shown in Fig. 12. A part of the advertisement is selected as query using

Results

The test database for this work has 1200 images of various sizes up to 2.5 Mb. The overhead incurred in storing the required color information extracted offline is about 5% of the size of the database. The average number of target images for each query image is 3.3. Fig. 10 shows the first 10 retrieved images after the first phase of retrieval and the top five images after completion of the second phase of processing for a typical retrieval. Some of the false matches in the original retrieved

Conclusion

We have presented a fast, background-independent color image retrieval system which produces good results with multi-colored query objects. The main contributions of this work are to propose two scale- and orientation-invariant features which can be combined to produce good retrieval results even with database images with significant background clutter where the query object appears at different scales, orientations, and location in the candidate images. The speed of the system and the small

References (33)

  • A.K. Jain et al.

    Image retrieval using color and shape

    Pattern Recog.

    (1996)
  • M.S. Kankanhalli et al.

    Cluster-based color matching for image retrieval

    Pattern Recog.

    (1996)
  • J. Matas et al.

    The multimodal neighborhood signature for modeling object color appearance and applications in object recognition and image retrieval

    Comput. Vision Image Understanding

    (2002)
  • G.P. Babu et al.

    Color indexing for efficient image retrieval

    Multimedia Tools Appl.

    (1995)
  • J.R. Bach et al.

    The virage image search engine: an open framework for image management

    SPIE Conf. on Storage and Retrieval for Still Image and Video Databases

    (1996)
  • S. Belongie, C. Carson, H. Greenspan, J. Malik, Color- and texture-based segmentation using em and its application to...
  • S. Berretti et al.

    Efficient matching and indexing of graph models in content-based retrieval

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2001)
  • J.R. Beveridge et al.

    Segmenting images using localized histograms and region merging

    Int. J. Comput. Vision

    (1989)
  • S. Cohen, Finding color and shape patterns in images, Ph.D. thesis, Computer Science, Stanford University...
  • S. Cohen, L. Guibas, The earth mover’s distance under transformation sets, in: Seventh Internat. Conf. on Computer...
  • M. Das et al.

    Focus: Searching for multi-colored objects in a diverse image database

    IEEE Conf. Comput. Vision Pattern Recog.

    (1997)
  • Y. Gong et al.

    Image indexing and retrieval based on color histograms

    Multimedia Tools Appl.

    (1996)
  • J. Hafner et al.

    Efficient color histogram indexing for quadratic form distance functions

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1995)
  • J. Huang et al.

    Spatial color indexing and applications

    Int. J. Comput. Vision

    (1999)
  • P. Kelly et al.

    Query by image example: the candid approach

    SPIE Proc. on Storage and Retrieval for Image and Video Databases

    (1995)
  • H.F. Korth et al.

    Database System Concepts

    (1991)
  • Cited by (2)

    This material is based on work supported in part by the National Science Foundation, Library of Congress and Department of Commerce under Cooperative Agreement No. EEC-9209623, in part by the National Science Foundation under Grant No. IRI-9619117 and in part by NSF Multimedia CDA-9502639. Any opinions, findings, and conclusions or recommendations expressed in this material are the author(s) and do not necessarily reflect those of the sponsor(s).

    1

    Present address: Eastman Kodak Company, Rochester, NY, USA.

    View full text