Elsevier

Neurocomputing

Volume 116, 20 September 2013, Pages 165-181
Neurocomputing

A simple computational model for image retrieval with weighted multifeatures based on orthogonal polynomials and genetic algorithm

https://doi.org/10.1016/j.neucom.2012.05.030Get rights and content

Abstract

This paper proposes a simple and new image retrieval method with weighted multifeature set based on multiresolution enhanced orthogonal polynomials model and genetic algorithm. In the proposed method, initially the orthogonal polynomials model coefficients are computed and reordered into multiresolution subband like structure. Then the statistical, directional, perceptual and invariant texture, shape and color features are directly extracted from the subband coefficients. The extracted texture, shape and color features are integrated into linear multifeature set and the significance of each feature in the multifeature set is determined by assigning appropriate weight. This paper also proposes a method to compute the optimized weight for each feature in the integrated linear multifeature multi feature set using genetic algorithm. Then the obtained optimized weight is multiplied with the corresponding features in the multifeature set and the weighted Manhattan distance metric is used for retrieving similar images. The efficiency of the proposed method is experimented on the standard subset of Corel and Caltech database images. The performance of the proposed method is compared with other existing retrieval methods such as Haar wavelet and Contourlet Transform based retrieval schemes. The proposed method yields high average recall and precision of 92.6% and 71% for Corel database and 90.5% and 72.3% of Caltech database images when compared with other existing methods.

Introduction

With the rapid growth of digital and information technologies, more and more multimedia data are generated and made available in digital form. Searching and retrieving relevant images in this huge volume of data is a difficult task and has created an urgent need to develop new tools and techniques. One such solution is the Content Based Image Retrieval (CBIR). As the image databases grow larger, the traditional keyword-based approach for retrieving a particular image becomes inefficient and suffers from the following limitations: (i) vast amount of labor is required for manual image annotation and (ii) limited capacity for retrieving the visual content of the image and subjectivity of human perception. Hence, to overcome these difficulties of manual annotation approach, content based image retrieval has emerged. CBIR is a collection of techniques and algorithms which enable querying the image databases with low level image content such as color, texture, objects and their geometries rather than textual attributes such as image name or other keywords [1]. Many image retrieval systems have been developed using all or some of these features. It includes Chabot [2], Photobook [3], QBIC [4], Virage [5], VisualSeek [6], MARS [7], Netra [8] and Excalibur [9]. The extensive literature and the state of art methods about content based image retrieval can be found in [10], [11], [12], [13], [14], [15]. Though some of the image retrieval applications such as trademark retrieval, character recognition and leaf image retrieval [16], [17] are implemented based on single feature (either texture, shape or color), but the single feature is found to be insufficient for natural, web based image retrieval applications as it affects the retrieval performance. Hence recently general-purpose CBIR systems concentrate on multiple features such as color, texture and shape along with some domain specific features for improving the performance of the image retrieval.

Sklansky [18] defined the texture as a set of local properties in the image region with a constant, slowly varying or approximately periodic pattern and it is measured using its distinct properties such as periodicity, coarseness, directionality and pattern complexity for efficient image retrieval particularly on the aspects of orientation and scale [19], [20]. In a typical CBIR system, identification of the proper features that maximizes the differentiation of the texture is an important step. There are many categories of methods that exist for identifying and manipulating the texture: (i) statistical methods (Gray level Co occurrence matrix (GLCM) [21]), (ii) model based methods such as Markov Random Fields (MRF) [22], Simultaneous Auto Regression (SAR) [23], Wold decomposition [24] and (iii) signal processing methods (Gabor filters [25], Wavelet Transforms [26], [27]). Some of these techniques depend on the comparison values of second order statistics obtained from query and stored images [28], [27] for measuring the texture similarity.

The shape of an image is effectively perceived by the human eye than color or texture. Hence shape-based searching and retrieving has gained much attention in CBIR. Shape-based retrieval involves three primary issues: shape representation, shape similarity measure and shape indexing. Among them, shape representation is the most important issue in shape based image retrieval. The shape representation methods reported in the literature can be classified into two categories: region based and contour based [29]. Region based techniques have frequently used moment descriptors to obtain shape representation [30], [31], [32], [33] which considers all the pixels inside the shape to compute the shape features. Contour based shape representation [29] only exploits shape boundary information and are classified into continuous approach (global) and discrete approach (structural). Both region and contour based representation methods compute shape features either in spatial or frequency domain. Spatial domain descriptors are sensitive to noise and not robust. In addition, it requires intensive computation during similarity calculation, due to the hard normalization of rotation invariance [34]. As the result, these spatial representations need further processing using spectral transform such as Fourier transform and wavelet transform. Wavelet descriptors have the advantage over Fourier descriptors in that they achieve localization of shape features in joint-space, i.e., in both spatial and frequency domains.

Early color based retrieval systems have used the global RGB histogram information such as the Local Color histogram [35], histogram difference approach [36], histogram intersection [37], [38], [39] and quadratic histogram comparison [40]. Though the color histogram based approaches are extremely easy to compute and insensitive to small changes in viewing positions and partial occlusion, they do not capture local spatial color information. Hence this approach is liable to false positives and is not robust to large appearance changes. Several recent schemes viz, Color Coherence Vector [41], Color Correlogram [42] and Binary Color Set [43], [44] incorporate spatial correlation of color regions as well as the global distribution of local spatial correlation of colors to improve upon the histogram method. Though these techniques perform better than traditional histograms, they require intensive computation. Generally color spatial techniques are classified into three categories [45]: (i) partition based approach (ii) signature based approach and (iii) cluster based approach.

Saber and Tekalp [46] have introduced a region based algorithm for automatic image annotation and retrieval with color, edge, shape and texture features. The regions are extracted based on edge information and Bayesian techniques with texture and shape features are computed on these regions. Region-wise feature extraction becomes computationally intensive and degrades the retrieval performance. The combination of structure, color and texture features for image retrieval is reported in [47]. But in this method the automatic adjustment of weight for the features are missing and obtaining texture feature using gabor filter becomes computationally difficult. Howe and Huttenlocher [48] have used a technique that integrates diverse and expandable set of image properties such as color, texture and location in a retrieval framework. The retrieval performance of this method is fairly better (68.3%) compared to other methods and the substantial control is given to end user for retrieving relevant images. Wavelet transform based color, shape and texture feature extraction for image retrieval is reported in [49], [50]. The multifeatures such as color, shape and texture are extracted from low and high frequency band of wavelet transform and the weight for each feature is determined using relevance feedback technique. Katare et al. [51] have integrated the shape and color feature for multi object image retrieval. In this method, the object with different orientation could not be identified with active contour based shape feature representation and it degrades the retrieval performance. Xiaojuan et al. [52] have established a method for image retrieval based on multifeatures with color, shape and texture and also introduced a method for normalization of the multifeature set. In this work, the multifeature normalization is performed in two stages: (i) Internal Normalization and (ii) Exterior Normalization and the authors themselves claimed that it is computationally intensive. An online application called garment image retrieval has used multifeatures for retrieving similar garment images [53]. Since the physical meaning, importance and value ranges of each feature are different in the linear combination of multifeature set, the similarity score computation with single distance metric becomes a series problem and degrades the retrieval accuracy. This problem can be solved using two methods viz., (i) relevance feedback [54], [55], [56] and (ii) appropriate weight generation [57], [58]. Relevance feedback is computationally high demanding and difficult to incorporate human into the loop. The latter method can be viewed as an optimization problem and a suitable optimization technique has to be incorporated to generate the weight in an adaptive manner for effective discrimination and retrieval with less computational cost. Hence this paper proposes a method for optimized weight generation using genetic algorithm for multifeature representation with multiresolution enhanced orthogonal polynomials model for efficient image retrieval. This paper is organized as follows. The orthogonal polynomials model and reordering the transformed coefficients into multiresolution subband like structure have been described in Section 2. The multi feature extraction is presented in Section 3. The evolution and the process of genetic algorithm are presented in Section 4. The genetic algorithm based optimized weight generation process is presented in Section 5. The performance metric is described in Section 6. Experiments and results are discussed in Section 7 and conclusion is drawn in Section 8.

Section snippets

Multiresolution reordering with orthogonal polynomials model coefficients

In this section the orthogonal polynomials model and the reordering of the orthogonal polynomials coefficients into multiresolution subband structure is presented. Multiresolution analysis plays a vital role in Human Visual System (HVS) and the experimental studies have shown that the eye's sensitivity to a visual stimulus strongly depends upon the spatial frequency contents of this stimulus. Hence, HVS motivates the use of multiscale image decompositions as a front end to complex image

Proposed rotation invariant feature extraction

In this section, the feature extraction process of the image under analysis based on the orthogonal polynomials model with the multiresolution reordered subbands is presented. In the orthogonal polynomials model, it is observed that when the block of an image is rotated the coefficient's magnitude remains unaltered but their position and the sign vary. At the same time the absolute difference between the corresponding coefficients of the original and the rotated block in the zig-zag sequence is

Evolution of genetic algorithm

This section describes the general procedure for computing Genetic Algorithm (GA). The basic concept behind GA is the survival of the fittest and is inspired by and named after biological processes of inheritance, mutation, natural selection and the genetic crossover that occurs when parents mate to produce offspring [66]. GA searches the solution by maintaining a population of solutions from which better solutions are created, whereas the conventional non-linear optimization techniques making

Proposed GA based weight generation

In this section, GA based feature weight generation process for the proposed multifeature representation scheme using texture and shape is presented. In the proposed weight generation method, initially the database (DB) images considered for experimentation are subjected to (i) Training and (ii) Testing. In the training portion, training pair (TP) is generated as described below:

Training Pair: A training Pair (TP) is the pair which consists of query image IQ and the user defined best matched

Performance measure

In this section the performance evaluation metric for the proposed method is presented. The performance of the proposed method is measured in terms of overall retrieval accuracy and accuracy of retrieval for a particular class. Accuracy of retrieval RetAcci for a particular class is the number of top images retrieved corresponding to the query image class:RetAcci=1|DBi|Hj=0DBik=0Hh(DBji,Rk)where DB denotes the database and DBjidenote the jth image belonging to ith class and H is the number

Experiments and results

The retrieval efficiency of the proposed multiresolution enhanced orthogonal polynomials with weighted multifeature using genetic algorithm is experimented with a subset of popular image databases COREL [69] and Caltech [70]. The experimental results are presented in this section. The images in these databases are of color images and are resized into (256×256). The databases contain various classes of images and some of the sample images from each database are shown in Fig. 5.

During

Conclusion

In this paper a new image retrieval method with weighted multi feature in multiresolution enhanced orthogonal polynomials model and genetic algorithm is proposed. The transformed coefficients are reordered into multiresolution subband like structure and the statistical and invariant texture, shape and color features are directly extracted from them. The extracted texture, shape and color features are integrated into linear multifeature set and the significance of each feature in the

Dr. R. Krishnamoorthi received his M.Tech in Computer Science and Engineering at IIT, Kanpur and Ph.D. from IIT, Kharagpur. He has 27 years of teaching and research experience and currently working as Professor and Dean, in Anna University of Technology Trichirappalli, Tamilnadu, India. He has published 24 papers in International journals and presented 51 Conference papers in various International Conferences. He is also an active reviewer for many international journals such as IEEE

References (70)

  • V.E. Ogle et al.

    Chabot: retrieval from a relational database of images

    IEEE Comput.

    (1995)
  • A. Pentland et al.

    Photobook : content based manipulation of image databases

    Int. J. Comput. Vision

    (1996)
  • M. Flickner et al.

    Query by image and video content: the QBIC system

    IEEE Comput.

    (1995)
  • J.R. Batch, C. Fuller, A. Gupta, A. Hampapur, B. Horowitz, B. Horowitz, R. Humphery, R. Jain, C.F. Shu, The virage...
  • J.R. Smith et al.

    Querying by Color Region using the VisualSEEK Content Based Visual Query System, Intelligent Multimedia Information Retrieval

    (1997)
  • T.S. Huang, S. Mehrotra, K. Ramachandran, Multimedia Analysis and Retrieval System(MARS) Project, in: Proceedings of...
  • W.Y. Ma, B.S. Manjunath, Netra: a toolbox for navigating large image databases, in: Proceedings of IEEE International...
  • J. Feder

    Towards image content based retrieval for world wide web

    J. Adv. Imaging

    (1997)
  • R. Datta, J. Li, J.Z. Wang, Content based image retrieval—approaches and trends of the new age, Proceedings of the 7th...
  • A. Smeulders et al.

    Content based image retrieval at the end of the early years

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2000)
  • M.L. Kherfi et al.

    Image retrieval from the world wide web: issues, techniques and systems

    ACM Comput. Surv.

    (2004)
  • M.S. Lew et al.

    Content-based multimedia information retrieval: state of the art and challenges

    ACM Trans. Multimedia Comput. Commun. Appl.

    (2006)
  • M. Kokare et al.

    A survey on current content based image retrieval methods

    IETE J. Res.

    (2002)
  • J. Sklansky

    Image segmentation and feature extraction

    IEEE Trans. Syst. Man Cybern.

    (1978)
  • H. Tamura et al.

    Texture features corresponding to visual perception

    IEEE Trans. Syst. Man Cybern.

    (1976)
  • W. Niblack, R. Barber, W. Equitz, M.D. Flickner, E.H. Glasman, D. Petkovicr, C. Faloutsos, G. Taubin, The QBIC Project:...
  • R.M. Haralick, Statistical and structural approaches to texture, Proceedings of IEEE, vol. 67, no. 5, 1979, pp....
  • G. Cross et al.

    Markov random field texture models

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1983)
  • F. Liu et al.

    Periodicity, directionality and randomness: wold features for image modeling and retrieval

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1996)
  • A.K. Jain et al.

    Unsupervised texture segmentation using Gabor filters

    Pattern Recognition

    (1991)
  • T. Chang et al.

    Texture analysis and classification with tree structured wavelet transform

    IEEE Trans. Image Process.

    (1992)
  • A. Laine et al.

    Texture classification by wavelet packet signatures

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1993)
  • J.P. Eakins, M.E. Gratam, CBIR: A Report to the JISC Technology Applications Program....
  • C.H. Teh et al.

    On image analysis by the methods of moments

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1988)
  • G. Taubin, D.B. Cooper, Recognition and positioning of rigid objects using algebraic moment invariants, International...
  • Cited by (11)

    • Utilization of rotation-invariant uniform LBP histogram distribution and statistics of connected regions in automatic image annotation based on multi-label learning

      2017, Neurocomputing
      Citation Excerpt :

      Caicedo and Jaafar BenAbdallah, etc. presented a method based on non-negative matrix factorization to generate multimodal image representations that integrate visual features and text information [1]. Krishnamoorthi et al. proposed a image retrieval method with weighted multifeature set based on multiresolution enhanced orthogonal polynomials model and genetic algorithm [18]. In order to reduce the semantics gap brought by the incompliance between the machine understanding of structural image difference by machine learning mechanism [24] and that of human beings perception of difference and similarity in semantic cognitions, accurate image annotation marked by semantic labels can work out the semantic gap puzzle [25] somewhat via whether professional labors or with the help of automatic image semantics annotation in which most of the time can be saved.

    • Quaternion polar complex exponential transform for invariant color image description

      2015, Applied Mathematics and Computation
      Citation Excerpt :

      The second one decomposes the color image into three channels, and then calculates the moment invariants of these three channels separately. Obviously, these two approaches cannot capture the correlation among color channels, and can hardly produce the most compact and effective feature description of color image [15–19]. In recent years, quaternion has been utilized more and more in color image processing, and the use of quaternion-based moment functions to color image has been investigated [17–19].

    • Research on Image Retrieval with Multi-features

      2019, Journal of Physics: Conference Series
    • Weighted feature voting technique for content-based image retrieval

      2018, International Journal of Computational Vision and Robotics
    View all citing articles on Scopus

    Dr. R. Krishnamoorthi received his M.Tech in Computer Science and Engineering at IIT, Kanpur and Ph.D. from IIT, Kharagpur. He has 27 years of teaching and research experience and currently working as Professor and Dean, in Anna University of Technology Trichirappalli, Tamilnadu, India. He has published 24 papers in International journals and presented 51 Conference papers in various International Conferences. He is also an active reviewer for many international journals such as IEEE Transactions on Systems, Man and Cybernetics, IEEE Transactions on Pattern Analysis and Machine Intelligence, Pattern Recognition and Pattern Recognition Letters. He received a sum of Rs. 63.15 Lakhs from various sponsoring agency to promote research. His area of interest is Image Annotation and Retrieval, Digital watermarking, Compression and Biometric.

    Mrs. S. Sathiya Devi received her BE in computer Science Engineering from Madras University and M.Tech in Computer Science and Engineering from Pondicherry University. She has 13 years of experience in teaching and 5 years of research experience. She is doing research in Intelligent Image Retrieval. Currently working as an Assistant Professor in Anna University of Technology, Trichirappalli, Tamilnadu, India. Her research interest includes Image retrieval, Web mining and application of soft computing techniques in remote sensing and medical image processing.

    View full text