Interactive image retrieval using fuzzy sets

https://doi.org/10.1016/S0167-8655(01)00045-9Get rights and content

Abstract

We present an image retrieval system which permits the user to submit a coarse initial query and continuously refine it. The user's relevance feedbacks is modeled by fuzzy sets, and is used to discover and use the more discriminatory features for the given query. The proposed system uses a dissimilarity measure based on the fuzzy integral.

Introduction

The recent increase in the size of multimedia information repositories has made Content-based image retrieval (CBIR) one of the most active research areas in the past few years. Unlike traditional database techniques which retrieve images based on exact matching of keywords, CBIR systems represent the information content of an image by visual features such as color, texture, and shape, and retrieve images based on similarity of features. Therefore, a good similarity (or equivalently a dissimilarity) measure is essential for the effective retrieval in such systems. Unfortunately, the meaning of similarity is rather vague and difficult to define. In general, different similarity measures capture different aspects of perceptual similarity between images. What makes the problem even harder is the fact that different features do not contribute equally, and therefore, cannot be considered equally important for computing the similarity (or dissimilarity) between images. For example, when a user perceives two images as being similar, what he or she really means is that the images are similar either in an individual feature, or in some combination of features. In this paper, we present a novel method that models the user's positive and negative feedbacks by fuzzy sets. These fuzzy sets are then used to learn the feature relevance weights. We show that the learned weights can be used as fuzzy densities in a fuzzy integral based retrieval system. We also show that the learned feature relevance values can be used as weights in a weighted Euclidean distance-based retrieval system.

The rest of this paper is organized as follows: Section 2 summarizes related work on similarity measures and feature relevance. Section 3 gives an overview of fuzzy integrals and fuzzy measures. Section 4 discusses how to use the Choquet integral as a dissimilarity measure. Section 5 describes how to model the user's feedback using fuzzy sets and how to update the feature relevance values. Experimental results over more than 3000 images for testing the learning behavior of the system are given in Section 6. Concluding remarks are given in Section 7.

Section snippets

Similarity measures

The retrieval process of images involves evaluating the degree of similarity between the query image feature vector and the feature vectors of the images stored in the database. Several similarity measures have been used in the past few years. The Euclidean (or weighted Euclidean) distance has been used in several CBIR systems (Flickner et al., 1995, Bach et al., 1998). A hybrid neural network algorithm is used in (Ma and Manjunath, 1996) to learn the visual similarity by clustering patterns in

Fuzzy measures

Let X={x1,…,xn} be an arbitrary set. A set function g:2X→[0,1] is a fuzzy measure if it satisfies the following three axioms:

  • 1.

    Boundary conditions: g(∅)=0,g(X)=1,

  • 2.

    Monotonicity: if A,B⊂X, and AB, then g(A)⩽g(B),

  • 3.

    Continuity: if {Ai} is an increasing subsequence of subsets of X, thenlimi→∞g(Ai)=gi=1Ai.

    A fuzzy measure is a Sugeno measure (or a λ-fuzzy measure) if it satisfies the following additional condition for some λ>−1

  • 4.

    ∀A,B⊂X with AB=∅g(A∪B)=g(A)+g(B)+λg(A)g(B).

For a finite set X, the fuzzy

The Choquet integral as a dissimilarity measure

The fuzzy integral can be interpreted as a fuzzy expectation (Sugeno, 1977), the maximal grade of agreement between two opposite tendencies (Wierzchon, 1989), or the maximal grade of agreement between the objective evidence and the expectation (Tahani and Keller, 1990). In this work, we develop a retrieval system that uses the Choquet integral as a dissimilarity measure. The system can be easily modified to use the Sugeno integral.

Let X={f1,…,fn} be a set of the n features used in the CBIR

Feature relevance

Let gi=g({fi}) represent the relevance of feature i. The relevance values will also serve as the fuzzy densities to be used in the Choquet integral. In this work, we restrict gi to be a λ-measure. This restriction will simplify the system considerably since only the fuzzy densities need to be learned in order to compute the fuzzy integral. Other measures, such as possibility measure, which provide a way to compute the measure of a union set from a pair of component measures, can also be used in

Data and features

The image database for our experiments consists of 3584 images of size 64×64 obtained by dividing 512×512 texture images into 64 subimages. The 56 original images1 are texture images from the Brodatz album (Brodatz, 1966), and are shown in Fig. 4. We use 24 Gabor filters (4 scales and 6 orientations) for feature extraction. The normalized mean and standard deviation of

Conclusion

In this paper, we have proposed an image retrieval system where human and computer can interact with each other to improve the retrieval performance of a CBIR system. This approach permits the user to submit a coarse initial query and continuously refine it using positive and negative relevance feedbacks. For both the Choquet integral-based dissimilarity and the Euclidean distance, our system requires the user to provide only a vague and more natural description of the retrieved images. These

Acknowledgements

The authors would like to thank the anonymous reviewers for their valuable comments. Partial support of this research was provided by a grant from the University of Memphis New Faculty Research Initiation Awards. This support does not necessarily imply endorsement by the University of research conclusions.

References (25)

  • M. Grabisch et al.

    Fuzzy measure of fuzzy events defined by fuzzy integrals

    Fuzzy Sets and Systems

    (1992)
  • T. Murofushi et al.

    A theory of fuzzy measures: representations, the Choquet integral, and null sets

    J. Math Anal. Appl.

    (1991)
  • Agnew, B., Faloutsos, C., Wang, Z., Welch, D., Xue, X., 1997. Multimedia indexing over the web. In: SPIE Conference,...
  • Bach, J.R., Fuller, C., Gupta, A., Hampapur, A., Horowitz, B., Jain, R., Shu, C., 1998. The virage image search engine:...
  • Bhanu, B., Peng, J., Qing, S., 1998. Learning feature relevance and similarity metrics in image databases. In: IEEE...
  • Bouet, M., Djeraba, C., 1998. Visual content based retrieval in an image database with relevant feedback. In: Internat....
  • P. Brodatz

    Textures: A Photographic Album for Artists & Designers

    (1966)
  • Buckley, C., Salton, G., 1995. Optimization of relevance feedback weights. In:...
  • A. Celentano et al.

    Feature integration and relevance feedback analysis in image similarity evaluation

    J. Electronic Imaging

    (1998)
  • M. Flickner et al.

    Query by image and video content: the qbic system

    IEEE Comput.

    (1995)
  • M. Grabisch

    Fuzzy integrals for classification and feature extraction

  • M. Grabisch et al.

    Fundamentals of Uncertainty Claculi with Applications to Fuzzy Inference

    (1995)
  • Cited by (24)

    • TMR: Towards an efficient semantic-based heterogeneous transportation media big data retrieval

      2016, Neurocomputing
      Citation Excerpt :

      To solve this problem, a feasible solution is to use fuzzy matching. In fact, fuzzy approaches have been applied in many fields of information retrieval such as interactive image retrieval [8], content-based image retrieval [3], decision support system [2] and Web resources retrieval [7]. In current approaches, semantic features are stored in the server׳s database and separated from the media documents.

    • A new fuzzy logic based ranking function for efficient Information Retrieval system

      2015, Expert Systems with Applications
      Citation Excerpt :

      Chen (2011) presented a new ranking function based on the geometric mean averaging operator to handle the similarity problems of generalized fuzzy numbers. Although, this research work is limited to fuzzy Information Retrieval and fuzzy datasets (Chen & Chen, 2003; Chen, Horng, & Lee, 2001; Chiang, Chow, & Hsien, 1997; Devedzic & Pap, 1999; Frigui, 2001). It is evident that, very few works are reported in literature to develop fuzzy logic based ranking function during last one decade only despite of advantages already mentioned above.

    • An evaluation of several fusion algorithms for anti-tank landmine detection and discrimination

      2012, Information Fusion
      Citation Excerpt :

      Exhaustive search can be used to assign weights for small collections of algorithms, but is too computationally burdensome for large collections. The Choquet integral has been investigated for information fusion by many researchers [84–89,45,90–93]. This integral defines a family of generally nonlinear aggregation operators on some function of the algorithm confidence values, which we will refer to as a decision statistic.

    • Application of artificial intelligence methods to content-based image retrieval

      2011, Advances in Imaging and Electron Physics
      Citation Excerpt :

      Conversely, there are also those who oppose the interference and proposed auto-relevance feedback methods that did not require human involvement at any time (see Section 3). Frigui (2001), for example, modeled the user's positive and negative feedback by using fuzzy sets to discover the more discriminatory features through relevance weights for a given query. The system used a dissimilarity measure based on the Choquet integral and applied it to the learned feature relevance values of each iteration, which are considered to be fuzzy densities.

    • A relevance feedback CBIR algorithm based on fuzzy sets

      2008, Signal Processing: Image Communication
      Citation Excerpt :

      Regarding the second topic, an interesting approach can be found in [9], where the authors propose a dissimilarity measure based on the fuzzy integral which performs better than the Euclidean distance, and then construct an image retrieval system based on this measure.

    View all citing articles on Scopus
    View full text